📚 The CoCalc Library - books, templates and other resources

cocalc-examples / scikit-image-tutorials / book / lessons / 4_segmentation.ipynb

¹³²⁹²⁹ views
License: OTHER

Kernel: Python 3

In [48]:

%matplotlib inline

Segmentation

Segmentation is the division of an image into "meaningful" regions. If you've seen The Terminator, you've seen image segmentation:

In scikit-image, you can find segmentation functions in the segmentation package, with one exception: the watershed function is in morphology, because it's a bit of both. We'll use watershed and region boundary agglomeration. Functions such as segmentation.slic are useful for images in which the objects you want to segment have different colors. We won't cover them here but you should be aware they exist.

In [49]:

import numpy as np
import matplotlib.pyplot as plt

Segmenting with filters

In many images, the contrast between regions is not sufficient to distinguish them, but there is a clear boundary between them. Using an edge detector on these images, followed by a watershed, often gives very good segmentation. For example, look at the output of the Sobel filter on the coins image:

In [50]:

from skimage import data
from skimage import filters
from matplotlib import cm

In [51]:

coins = data.coins()
edges = filters.sobel(coins)

plt.imshow(edges, cmap='gray');

Out[51]:

The watershed algorithm finds the regions between these edges. It does so by envisioning the pixel intensity as height on a topographic map. It then "floods" the map from the bottom up, starting from seed points. These flood areas are called "watershed basins" and when they meet, they form the image segmentation.

Let's look at a one-dimensional example:

In [52]:

from skimage.morphology import watershed
from scipy import ndimage as ndi

x = np.arange(12)
y = np.array([1, 0, 1, 2, 1, 3, 2, 0, 2, 4, 1, 0])

seeds = ndi.label(y == 0)[0]
seed_positions = np.argwhere(seeds)[:, 0]

print("Seeds:", seeds)
print("Seed positions:", seed_positions)

Out[52]:

Seeds: [0 1 0 0 0 0 0 2 0 0 0 3]
Seed positions: [ 1  7 11]

In [53]:

result = watershed(y, seeds)
print(result)

Out[53]:

[1 1 1 1 1 2 2 2 2 3 3 3]

In [54]:

# You can ignore the code below--it's just
# to make a pretty plot of the results.
fig, ax = plt.subplots(figsize=(10, 5))
ax.plot(y, '-o', label='Image slice', linewidth=3)
ax.plot(seed_positions, np.zeros_like(seed_positions), 'r^',
        label='Seeds', markersize=15)

for n, label in enumerate(np.unique(result)):
    mask = (result == label)
    ax.bar(x[mask][:-1], result[mask][:-1],
           width=1, label='Region %d' % n,
           alpha=0.1)

ax.vlines(np.argwhere(np.diff(result)) + 0.5, -0.2, 4.1, 'm',
          linewidth=3, linestyle='--')

from scipy.interpolate import interp1d

#c = interp1d(x, y, kind='cubic')
#t = np.linspace(0, len(y) - 1, 100)
#ax.plot(t, c(t), 'g')

ax.legend(loc='upper left', numpoints=1)
ax.axis('off')
ax.set_ylim(-0.2, 4.1);

Out[54]:

Let's find some seeds for coins. First, we compute the distance transform of a thresholded version of edges:

In [55]:

threshold = filters.threshold_otsu(edges)
print(threshold)
# Euclidean distance transform
# How far do we have to travel from a non-edge to find an edge?
non_edges = (edges < threshold)
distance_from_edge = ndi.distance_transform_edt(non_edges)

plt.imshow(distance_from_edge, cmap='viridis');

Out[55]:

0.143982851338

Then, we find the peaks in that image--the background points furthest away from any edges--which will act as the seeds.

In [56]:

from skimage import feature

# -------------------------------------------------#
peaks = feature.peak_local_max(distance_from_edge, min_distance=10)
print("Peaks shape:", peaks.shape)
# -------------------------------------------------#

peaks_image = np.zeros(coins.shape, np.bool)
peaks_image[tuple(np.transpose(peaks))] = True
seeds, num_seeds = ndi.label(peaks_image)

plt.imshow(edges, cmap='gray')
plt.plot(peaks[:, 1], peaks[:, 0], 'ro');
plt.axis('image')

Out[56]:

Peaks shape: (83, 2)

(-0.5, 383.5, 302.5, -0.5)

We are now ready to perform the watershed:

In [57]:

ws = watershed(edges, seeds)

from skimage import color
plt.imshow(color.label2rgb(ws, coins));

Out[57]:

Examining the resulting segmentation

That's pretty good! Some coins are perfectly segmented, with only one missing. We can't do much about the missing one (yet), but we can merge regions to fix the remaining coins, and the background.

In [58]:

from skimage.future import graph

Because mean boundary agglomeration won't be available until scikit-image 0.13, we have to monkey patch the RAG class to use it.

In [59]:

def merge_nodes(self, src, dst, weight_func=None, in_place=True,
                extra_arguments=[], extra_keywords={}):
    src_nbrs = set(self.neighbors(src))
    dst_nbrs = set(self.neighbors(dst))
    neighbors = (src_nbrs | dst_nbrs) - set([src, dst])

    if in_place:
        new = dst
    else:
        new = self.next_id()
        self.add_node(new)

    for neighbor in neighbors:
        data = weight_func(self, src, new, neighbor, *extra_arguments,
                           **extra_keywords)
        self.add_edge(neighbor, new, attr_dict=data)

    self.node[new]['labels'] = (self.node[src]['labels'] +
                                self.node[dst]['labels'])
    self.remove_node(src)

    if not in_place:
        self.remove_node(dst)

    return new

graph.RAG.merge_nodes = merge_nodes

Now we can make a RAG that will be mergeable:

In [60]:

g = graph.rag_boundary(ws, edges)

Look at the skimage.future.graph.merge_hierarchical API. Although it's still being worked on (that's why it's in future, you can use it now!

In [61]:

from skimage.future import graph

graph.merge_hierarchical?

Note that it needs both a merge function and a weight function, which together define how merging nodes affects the graph. In our case, we want any edges to reflect the mean of the pixels at their boundary.

In [62]:

from skimage.future import graph

def weight_boundary(graph, src, dst, n):
    default = {'weight': 0.0, 'count': 0}

    count_src = graph[src].get(n, default)['count']
    count_dst = graph[dst].get(n, default)['count']

    weight_src = graph[src].get(n, default)['weight']
    weight_dst = graph[dst].get(n, default)['weight']

    count = count_src + count_dst
    return {
        'count': count,
        'weight': (count_src * weight_src + count_dst * weight_dst)/count
    }

def do_nothing(*args, **kwargs):
    pass

seg_coins = graph.merge_hierarchical(ws, g, thresh=0.155, rag_copy=True,
                                     in_place_merge=True,
                                     merge_func=do_nothing,
                                     weight_func=weight_boundary)

In [63]:

from skimage import segmentation
plt.imshow(segmentation.mark_boundaries(coins, seg_coins))

Out[63]:

<matplotlib.image.AxesImage at 0x7f2c64497978>

Joining the seeds of doubt

Watershed combined with region agglomeration makes a very good segmentation, but we missed a coin. How can we improve this result?

In [ ]:

Segmentation

Segmenting with filters

Examining the resulting segmentation

Joining the seeds of doubt

Product

Resources

Company