Data File 30 Day Trailing

The data file contains observations of all members of the households of 15 years of age or more. One household could have more than one member in the file. Each member of the household is uniquely identified by an identity number and each household is also uniquely identified by an identity number. The first row in the data file contains variable names (column headers). Each row after the first row of the data file consists of observations of one member of one household. Variables in the data file used for the estimations are described below:
  • HH_ ID: This is a unique identifier of a household.
  • COUNTRY: This is the country in which the household is located.
  • STATE: This corresponds to the states of India, in which the household is located.
  • HR: This corresponds to Homogeneous Region number.
  • TOWN_VILLAGE: This indicates the village number in case of rural regions and town number in case of urban regions.
  • REGION_TYPE: This indicates whether the household is in rural or urban regions.
  • MONTH_SLOT: This indicates the month in which the household was scheduled for execution.
  • EXECUTION_DATE: This is the date on which the survey was conducted.
  • ORV_DATE: This indicates the date on which validation of the household's data was commenced by the ORV (independent validation) team.
  • SURVEY_STATUS: This is the status of survey for a household whether it was 'Accepted' by the ORV team.
  • STRATA: The strata as defined in sample design. This is the combination of State and Region Type.
  • FPC_REGION_HR_STATE_TOTAL: This is the number of Homogeneous Regions (surveyed and non-surveyed) in a state.
  • FPC_REGION_TV_HR_TOTAL: This is the number of towns (surveyed and non-surveyed) in the urban region of a state or the number of villages (surveyed and non-surveyed) in the rural region of a state as per Census 2011, depending on whether REGION_TYPE is urban or rural, respectively.
  • FPC_REGION_HH_TV: This is the estimated number of Households (surveyed and non-surveyed) in the towns or villages (surveyed and non-surveyed) as per Census 2011 depending on whether REGION_TYPE is urban or rural, respectively.
  • MEM_WEIGHT_GE15_STATE_REGTYPE_ACC: This is the weight assigned to a member for trailing 30 day estimations. For towns, it is the ratio of estimated population (> 15 years) in the urban region of a State in the trailing 30 days to surveyed population (> 15 years) of the urban region of the State in the trailing 30 days, respectively. And for villages, it is the ratio of estimated population (> 15 years) in the rural region of a State in the trailing 30 days to surveyed population (> 15 years) of the rural region of the State in the trailing 30 days, respectively. Estimated population of members (> = 15 years) in urban region of a State in the trailing 30 days is calculated by using the Compounded Annual Growth Rate (CAGR) on the Census 2001-Census 2011 data for the urban population of the State, compounded on a daily basis. Similarly, estimated population of members (> = 15 years) in rural region of a State in the trailing 30 days is calculated by using the Compounded Annual Growth Rate (CAGR) for the rural population of the State on the Census 2001-Census 2011 data, compounded on a daily basis. Surveyed members (> = 15 years) in urban region of a State on a day are members of all the urban households of the State that is surveyed on the day and is accepted by an independent validations system on the same day. Surveyed members (> = 15 years) in rural region of a State on a day are members of all the rural households of the State that were surveyed on the day and were accepted by an independent validations system on the same day.
  • MEM_ID: This is a unique identifier of a member.
  • EMPLOYMENT_STATUS: This is the employment status as stated by the surveyed member. This variable takes one of the following four values:
    1. Employed
    2. Unemployed, willing to work and looking for a job
    3. Unemployed, willing to work but not looking for a job
    4. Unemployed, not willing to work and not looking for a job
Dummy Variables
Two dummy variables are added to the Data file. These are:
  • UNEMPLOYED: Members who have the EMPLOYMENT_STATUS as "Unemployed, willing to work and looking for a job", are assigned numerical value one, else, are assigned numerical value zero.
  • LABOUR_FORCE: Members who have the EMPLOYMENT_STATUS as "Employed" or "Unemployed, willing to work and looking for a job", are assigned numerical one, else, are assigned numerical zero.
    After adding the dummy variables, the Date file contains 19 variables- 17 raw data variables and 2 derived dummy variables.