Data File Round Wise

The data file contains observations of all members of the households of 15 years of age or more. One household could have more than one member in the file. Each member of the household is uniquely identified by an identity number and each household is also uniquely identified by an identity number. The first row in the data file contains variable names (column headers). Each row after the first row of the data file consists of observations of one member of one household. Variables in the data file used for the estimations are described below:
  • HH_ID: This is a unique identifier of a household.
  • COUNTRY: This is the country in which the household is located.
  • STATE: This corresponds to the states of India, in which the household is located.
  • HR: This corresponds to Region number.
  • TOWN_VILLAGE: This indicates the village number in case of rural regions and town number in case of urban regions.
  • REGION_TYPE: This indicates whether the household is in rural or urban Homogeneous regions.
  • SURVEY_STATUS: This is the status of survey for a household whether it was "Accepted" by the ORV team.
  • FPC_REGION_TV_HR_REGTYPE: This is the number of towns (surveyed and non-surveyed) in the urban Homogeneous region or the number of villages (surveyed and non-surveyed) in the rural Homogeneous region as per Census 2011, depending on whether REGION_TYPE is urban or rural, respectively.
  • MEM_WEIGHT_GE15_HR_REGTYPE_SAMPLE: This is the weight assigned to a member for round wise estimations. For towns, it is the ratio of estimated population (> 15 years) in the urban Homogeneous region in the round to surveyed population (> 15 years) of the urban Homogeneous region in the round, respectively. And for villages, it is the ratio of estimated population (> 15 years) in the rural Homogeneous region in the round to surveyed population (> 15 years) of the rural Homogeneous region in the round, respectively. Estimated population of members (> = 15 years) in urban Homogeneous region in the round is calculated by using the Compounded Annual Growth Rate (CARG) on the Census 2001-Census 2011 data for the urban population of the Homogeneous region , compounded on a daily basis. Similarly, estimated population of members (> = 15 years) in rural Homogeneous region in the round is calculated by using the Compounded Annual Growth Rate (CARG) for the rural population of the Homogeneous on the Census 2001-Census 2011 data, compounded on a daily basis. Surveyed members (> = 15 years) in urban Homogeneous region on a day are members of all the urban households of the Homogeneous region that were surveyed on the day and were accepted by an independent validations system on the same day. Surveyed members (> = 15 years) in rural Homogeneous region on a day are members of all the rural households of the Homogeneous region that were surveyed on the day and were accepted by an independent validations system on the same day.
  • MEM_ID: This is a unique identifier of a member.
  • EMPLOYMENT_STATUS: This is the employment status as stated by the surveyed member. This variable takes one of the following four values:
    • Employed
    • Unemployed, willing to work and looking for a job
    • Unemployed, willing to work but not looking for a job
    • Unemployed, not willing to work and not looking for a job
  • GENDER: The gender of a person. This is a binary response.
  • AGE_YRS: The age in years of a person. This is a numeric variable.
  • AGE_MTH: The age in months of a person. This is a numeric variable.
  • NON_RESPONSE_FACTOR_MEM_GE15_HR_REGTYPE: This is the adjusting factor to take the non-responses in the sample into account. It is the ratio of the sample households in a stratum in the round to the accepted households in that stratum in that round.
Dummy Variables
Two dummy variables are added to the Data file. These are:
  • UNEMPLOYED: Members who have the EMPLOYMENT_STATUS as "Unemployed, willing to work and looking for a job", are assigned numerical value one, else, are assigned numerical value zero.
  • LABOUR_FORCE: Members who have the EMPLOYMENT_STATUS as "Employed" or "Unemployed, willing to work and looking for a job", are assigned numerical one, else, are assigned numerical zero. After adding the dummy variables, the Date file contains 17 variables- 15 raw data variables and 2 derived dummy variables.