AC Survey

Data summary

Main Variables

##  [1] "ivr_mobile_number"        "age"                     
##  [3] "gender"                   "audience_name"           
##  [5] "ivr_response_status"      "ivr_response_status_name"
##  [7] "ac_name"                  "pc_name"                 
##  [9] "district_name"            "state_name"              
## [11] "q1_response"              "q2_response"

Variable summary

state_namen_calln_acn_pcq1_response_rateq2_response_rateq1_and_q2_response_rateq1_or_q2_response_rate
Gujarat21112170262.732.901.873.76
Rajasthan31314198257.499.225.9710.74
Telangana2802791159.439.657.2511.83
Uttar Pradesh15090393805.887.674.638.91
West Bengal19195284424.685.703.446.94

state_namemean_call_acmean_q1_or_q2_response_acmean_age_acmean_male_pc
Gujarat124.194.67NaNNA
Rajasthan158.1516.9945.7374.87
Telangana307.9936.4436.7466.56
Uttar Pradesh38.403.4237.8584.06
West Bengal67.594.6925.0444.71

States


Calls & Responses

States

The following plot shows the proportion and the number of calls made and responded in each state along with the break-up of the gender. It is evident that, there is a huge gender bias in the calls made in the all three states except West Bengal. Also, among the four states, Uttar Pradesh seems to have received the least number of calls contradicting its population size.

Throughout the analysis, we count a responded call as the ones that have answered at least one of the two questions. It is quite evident that the number of respondents is relatively much less, averaging around 5% of the calls made. At the same time, the gender proportion of the respondents is identical to the callers.


We use density plots to observe the distribution and the differences of Number of calls , Number of responses and the Response rate. among the states. Response rate is simply the percentage of the calls that received response to at least one of the the two questions. We observe significant variation among the states on all three variables.


Assembly constituencies

We explore the variation within the state by looking at the assembly constituency level.The following box plots illustrates the distribution of the calls, response and the response rate in all unique assembly constituencies in each state. The jittered dots represents the relavant value of the variables in every unique constituency in each states.


Gender

Density plots along the lines of gender does reflect the sample bias in the gender as expected . Telengana and Rajasthan have a fair distribution given we take the sample bias into account while Uttar Pradesh has a inconsistent distribution. West Bengal seems to have a symmetrical distribution.

Age

The following plot depicts the age distribution along with gender of the the callers. Looking at the aggregate picture, we observe that the proportion of calls to individuals less than 25 years is quite high among women.

Once we look further into the states, the general trend fades away and looks more like the gender is fairly distributed along the age across the states. All the states except West Bengal has a fair distribution of age among both genders, while West Bengal one is extremely right skewed with a high number of young responders under 25 years of age.

Demography

Urban

In this section, we are attempting to ensure that there is no urban/rural bias in the sample. We define urban ACs as the constituencies with more than 40% of urban areas. The distribution of urban/rural ACs in the dataset looks like this.

state_nameurban_ycountn_accalls_per_acresponse_per_acresponse_rate_per_ac
Gujarat01131885133.155.253.94
Gujarat1543847115.704.513.90
Rajasthan027612170162.4217.5810.82
Rajasthan1368826141.8514.5410.25
Uttar Pradesh01352233540.363.548.76
Uttar Pradesh113815326.062.539.70
West Bengal01452521069.175.077.33
West Bengal140856364.843.785.83

Calls

We see that both mean values and calls per constituency is considerably less for urban constituencies. We scrutinise this further by looking at a box plot.

Response rate

It seems that there is a rural bias in the sampling even though the response rate from urban and rural area looks similar. We can statistically confirm this using a t-test.

## 
## 	Welch Two Sample t-test
## 
## data:  survey_all_urban$count by survey_all_urban$urban_y
## t = 24.685, df = 22658, p-value < 2.2e-16
## alternative hypothesis: true difference in means is not equal to 0
## 95 percent confidence interval:
##  15.10509 17.71079
## sample estimates:
## mean in group 0 mean in group 1 
##        126.7733        110.3653

We observe that there is a difference in the mean’s of both urban and rural constituencies and it is significant at .005 level. Hence, it is safe to conclude that there is a rural bias in the sample.

SC/ST

In the following table, sample_prop is the proportion of calls made to the that particular category and the electorate_prop is the electorate proportion of that category.

State_Namereservationelectorate_propstate_namecalls_propresponse_prop
GujaratGEN0.79Gujarat0.930.94
GujaratSC/ST0.21Gujarat0.070.06
RajasthanGEN0.71Rajasthan0.700.69
RajasthanSC/ST0.29Rajasthan0.300.31
Uttar_PradeshGEN0.79Uttar Pradesh0.770.77
Uttar_PradeshSC/ST0.21Uttar Pradesh0.230.23

In both UP and Rajasthan, there is a fair representation of SC/ST community in the calls made and responses received. But, in Gujarat, representation of SC/ST in this samples falls way below of their electoral proportion.

Answers

This analysis only includes only individuals who have atleast responded either one of the two questions.

All India

Education

In this grapph, we look at how people’s response differes with their education levels.

Gender

Here, we check if we observe any difference in the answers with regards to gender. As we can see, clearly there no difference along the gender lines.

States

In the following charts, we observe how answers variate in different states with regards to education and gender.



Gender

Abdul Najah
Research Assitant