|
changes, somewhat, the definition of editing tasks. We do not wish any more to receive a file in which the respondent's incorrect answers and the enumerator's errors in filling out the questionnaire have been corrected. Rather, we want a file which reflects the respondent's answers as they appeared on the questionnaire, using only functional correction of the process fields which the enumerator fill in. | ||||||||||
| ||||||||||
of the respondent with maximum accuracy. The last task means receiving a file in structural units as they were meant to be received in the field. The structural units are: Enumeration Area (EA) (which also serves as the work unit in the data entry process), household record, individual record, a building identified according to a coded address, a page from the questionnaire, and a complete questionnaire (all pages). Most of these units are defined by values which the enumerator places in the appropriate fields. Proper values in these fields enables automatic definition of the structural units. For example, in order to link two parts of a questionnaire to form an extended individual record, the enumerator must fill in three common fields: year of birth, column number, and gender. If automatic definition of the extended individual record is unsuccessful, this is a result of the enumerator's error in filling out these fields. The functional correction of the enumerator's fields in this case is assigning the right column number. |
editing item is the working unit that includes all the problems (failed edit checks) that have been found in one household. |
those relating to definition of the units, enabled the differentiation of the tasks and the specialization of the two types of editors: senior editors, who receive problems involving more than one questionnaire (usually problems in defining the household); and regular editors, who receive problems within the household. |
number printed on it. The problem of defining the household during the editing stage usually involves pages which bear different questionnaire numbers, and therefore, the problem is given to a senior editor for handling. |
during the editing and coding stage |
identified as characteristic to one stage are integrated in other stages as well. This means that editing tasks and other procedures which serve the editing stage are performed from the beginning of the data entry process until the end; in scanning' keying, editing stages and in between. |
and individual. | ||||||||||||||||||||||
| ||||||||||||||||||||||
pages of the questionnaire. | ||||||||||||||||||||||
and the National Population Register is performed; following this stage, the system contains: | ||||||||||||||||||||||
| ||||||||||||||||||||||
automatically performed: | ||||||||||||||
| ||||||||||||||
known as an "editing item". | ||||||||||||||
editors, there is a priority to the senior editor. His editing process may result in joining pages of different questionnaires to one household and than more fields are to be involved in the edit checks and new logical inconsistencies within that household can be found. Therefore the work procedure is as follows: |
| ||||||||
preparation for editing stage, but also during the editing stage itself. Edit checks are performed throughout the editing process, both at the PC stations of each editor and at the server: | ||||||||
| ||||||||
Special Coding items | ||||||||
editors, however, coding items are created for coding open questions. The stipulation for creating them is the existence of text in alphabetic fields or a logical failure involving one of these fields. The stipulations for creating special coding items are: | ||||||||
| ||||||||
branch and coding for occupation, respectively. |
activities |
interface which enables: |
| ||||||||||||||||||
resources and difficulty of their performance. These relatively quick and simple actions are actions whose objective is to correct or confirm values in the questionnaire fields. Even coding open "other" categories is not complicated since the number of entries in the coding dictionary for these variables is relatively small and finite. | ||||||||||||||||||
The single-value variable which identifies the EA is the EA number. An editor who receives a questionnaire from a household with an EA number that is different from the others being handled in the system at the same time, uses the address listed on the first page of the questionnaire to verify that this is, indeed, a questionnaire from a different EA, and transfers it to the virtual box. At a later time, the questionnaire is allocated to its appropriate enumeration area. |
expressed by linking two census records belonging to that individual (the short demographic section and the long socio-economic section, which is filled in by 20% of those respondents over age 15), and linking the individual's census record to the record of the same individual in the National Population Register. |
the enumerator fills in. Errors in these fields create an editing problem and the editor must correct the column number in order to link both parts. |
entry process, from the preparation for keying stage through the editing actions. The linkage itself is divided into two components: locating the census identification number in the Register, and verifying the identification of the individual using rigid criteria, based on demographic variables. Several manipulations are performed on the identification number, in order to completely match the census number with the Register number (see also paper 3.5). |
editor can perform flexible queries to the Register using a flexible variable sample or by flexible definition of values. This is done by entering free strings which represent a character or several characters, for the values in the query. For example, if the third number in the year of birth is not clear, the query can be of a profile including a variable that looks like that: 19%7. The system opens a window with suggestions that are relevant to the query. In this case, all the records of people with identical values as defined in the profile that were born in 1907, 1917, 1927, 1937... would appear in the window. |
Register, with an additional 2% not found in the Register at all (tourists, foreigners). In other words, the scope of unsuccessful linkage is no more than 3%. |
duplicate record for an individual in the same EA or due to deletion in the field by a large X on the individual column in the questionnaire. The individual record was opened inspite of the X because the X went through fields that define the opening of an individual record. |
and coding stations. Defining the household is the most complex among the editing tasks and the need for it arises when the enumerator does not follow instructions. These problems are basically system related problems, since the optical scanning required a qlong form questionnaire with separate pages. |
| ||||||||||||||
occupation. Coding takes place via a process which simulates manual coding; a query is sent on the text containing key words selected by the coder, or with a numeric value, if the coder remember the expected code. Whether the query was an alphabetic query or a numeric query, the coder must choose the suitable option from the suggestion window he gets, and must not rely on his memory. |
the data capture process, only if approved by the experts. In other words, coding improves over time due to the assimilation of the learning process in the dictionaries themselves. |
speeds the process and simplifies it, but does not change it in principle. Coding control is carried out on stand-alone PCs, and not within the ODE. The texts are coded independently a second time and each case of a mismatch between the first code and the second code is sent to an expert for decision. A learning process is also assimilated within this process, so that at the beginning of the process, each coding item was sent for coding a second time, but as knowledge was accumulated regarding problematic codes, it was possible to go to the sample and more effectively utilize the resources offered by man and machine. |
|
the objective of the entire process: production of a structured file containing raw census data. The technological environment and the logical instructions were planned for their functionality, efficiency, and effectiveness in reaching this objective. |
the values recorded on the questionnaire, and a similar rate in defining the structural units. |
maintenance of rigid criteria for the performance of automatic and manual procedures, and the integration of quality control at each of the system's components, all contributed a great deal in the achievement of this goal. |
Copyright © 1997-1999 The State of Israel. All rights reserved.
See "Terms of Use" for the conditions
under which this service may be used.
![]()
![]() |
![]() |
![]() |
![]() |