- Partition image pixels using k-means
- Extract connected components
- For each connected component extract the following features:
- Color histogram and a measure of consistency
- Stroke Width Transform histogram and a measure of consistency
- Number of holes in a component
- For every pair of connected components compute the following similarity metrics:
- Color Similarity
- Stroke Width Similarity
- Pixel distance in x and y directions separately
- Graph distance between every pair of elements (this means how many times you need to move from one component to a neighboring component to reach from component A to component B).
- Merge component according to the similarity metrics. We have a hand-learned criteria.
- Classify each component according to whether it looks like text or not. (We have a hand-set criteria.) Use the following features for text classification:
- Area (the number of pixels)
- Stroke Width Consistency
- Width/Height
- relative Stroke Length: Area/(Stroke_Width^2)
- Sparsity
- Display text regions together with their bounding boxes
Here are the outputs:
https://dl.dropboxusercontent.com/u/20022261/reports/text_segmentation_benchmark.html
No comments:
Post a Comment