| |
|
 |
The compression method using form dropout works well when there is a known set of scanned forms.
|
 |
The method works on the following principles:
|
 |
identifying the type of form (from the pre-defined set), or by identifying some recognized field stamped on the form,
or by identifying the "form signature."
|
 |
straightening out the scanned form against a template image, which includes: moving, straightening and correction of linear (zoom) and non-linear
deviations (creases, mechanical instability of the scanner, etc.)
|
 |
subtraction of the scanned form from the template image, to receive the resulting "dropped out" image.
|
 |
conventional compression of the dropped out form (MMR).
|
 |
The "form dropout" method improves the compression by a ratio of (x 10)
relative to conventional compression, but naturally, this depends on the type
of form and the average amount of written fill-in relative to the amount of
printed permanent text.
|
 |
Additional advantages are:
|
 |
When the form is identified, you can tell what type it is.
|
 |
When the form is straightened, you can anticipate the precise location of the identification fields.
|
 |
If dropout fails, you can tell that, apparently, the scanning was particularly poor.
|
 |
Disadvantages:
|
 |
over-subtraction, dropout of overlapping areas, between printed text and written fill-in.
|
 |
sensitivity to noise.
|
 |
problems when a form is not identified, or if dropout has failed.
|
 |
Reconstruction is executed by combining the dropped out image of the written fill-in with the image of the form template.
|