Problem Statement

Imagine you work as a data scientist for "ZEE Telugu," one of the most popular entertainment channels in the Telugu states. Your team analyzes user behavior on their OTT platform "ZEE5," which streams popular shows like "Sa Re Ga Ma Pa," "Trinayani," and movies featuring stars like Allu Arjun and Mahesh Babu. You observe a strong positive correlation between the number of features a user interacts with on the ZEE5 platform (such as creating watchlists, using the recommendation engine, and participating in polls about shows like "Intinti Gruhalakshmi") and their total session time, especially during prime time when popular serials like "Rama Sakkani Seetha" air.

1

Correlation vs. Causation in Features

MODERATE

Can you conclude that adding more interactive features like live quizzes about characters from "Ninne Pelladatha" or AR filters featuring Chiranjeevi will directly cause users to spend more time on the ZEE5 platform? Explain why or why not, referencing the difference between correlation and causation, particularly in the context of Telugu viewers' entertainment preferences.

2

Identifying Confounding Variables

MODERATE

What could be a confounding variable in this relationship between feature interaction and viewing time for audiences across regions like Telangana, Rayalaseema, and Coastal Andhra who might have different viewing habits and preferences for content starring actors like Jr NTR or Prabhas?

3

Generating Causal Hypotheses

ADVANCED

How might you descriptively explore this further to generate hypotheses about causation that could help ZEE Telugu's content strategy team led by Ms. Anuradha Gudur make better decisions about future interactive features for major events like their annual Zee Mahotsavam awards ceremony?

 

Nerchuko Academy · Free DS Interview Prep