Reverse Engineering Graphic Designs: Text detection pipeline

Monday, July 7, 2014

Text detection pipeline

Partition image pixels using k-means
Extract connected components
For each connected component extract the following features:

Color histogram and a measure of consistency
Stroke Width Transform histogram and a measure of consistency
Number of holes in a component

For every pair of connected components compute the following similarity metrics:

Color Similarity
Stroke Width Similarity
Pixel distance in x and y directions separately
Graph distance between every pair of elements (this means how many times you need to move from one component to a neighboring component to reach from component A to component B).

Merge component according to the similarity metrics. We have a hand-learned criteria.
Classify each component according to whether it looks like text or not. (We have a hand-set criteria.) Use the following features for text classification:

Area (the number of pixels)
Stroke Width Consistency
Width/Height
relative Stroke Length: Area/(Stroke_Width^2)
Sparsity

Display text regions together with their bounding boxes

Here are the outputs:
https://dl.dropboxusercontent.com/u/20022261/reports/text_segmentation_benchmark.html

No comments:

Post a Comment

Subscribe to: Post Comments (Atom)