Dunzo Delivery Partner Incentives
The Challenge: Evaluating Delivery Partner Incentives
Dunzo is providing extra payment (incentives) to its delivery partners during Hyderabad's peak traffic hours (e.g., 7-10 PM on weekdays) and also during specific festival seasons known for high demand. As a data scientist/product analyst, how would you measure the effectiveness of this incentive program in terms of its primary goal: meeting customer demand better? What are the key metrics, analytical approaches, potential challenges, and how would you assess ROI?
Initial Thoughts & Clarifications
- Define "Effectiveness" & "Meeting Customer Demand": What specific, measurable outcomes define success? (e.g., lower wait times, higher order fulfillment, faster delivery).
- Incentive Structure: How are the "extra payments" structured? (Fixed bonus per peak hour trip, surge multiplier on earnings, guaranteed minimum earnings for working during peak). This impacts partner behavior and cost.
- Scope of "Peak Hours" & "Festival Seasons": Are these uniformly defined, or do they vary? (e.g., weekday evenings vs. specific festival days like Diwali, Sankranti in Hyderabad).
- Data Availability: What data can we access? (Order data: timestamps, locations, status; Partner data: login/logout times, location, acceptance/rejection rates, earnings; Customer data: ratings, feedback, cancellation reasons; External data: traffic, weather, city events).
- Causal Inference Challenge: How to isolate the effect of the incentive from naturally higher demand during peak/festivals, or from other concurrent changes (e.g., marketing campaigns, app updates, competitor actions)?
- Control Group Feasibility: Can we create a true control group of partners/areas not receiving incentives during these times, or do we rely on quasi-experimental methods?
- Segmentation: Does the incentive impact different types of partners (e.g., full-time vs. part-time) or different areas within Hyderabad differently?
- Potential Negative Consequences: Could incentives lead to undesirable partner behavior (e.g., only working during incentive periods, selectively rejecting non-incentivized orders, congregating in specific zones)?
- ROI Calculation: How to define and measure Return on Investment for the incentive spend? (Balancing increased operational cost vs. increased revenue/customer satisfaction/partner retention).
- Define Objectives & Key Metrics:
- Primary Goal: Improve ability to meet customer demand during peak/festival times.
- Supply-Side Metrics: Partner availability (density, online hours), acceptance rates, active partners.
- Demand-Side/Service Quality Metrics: Order fulfillment rate, customer wait time for partner assignment, average delivery time, order cancellation rates (system/customer), customer satisfaction (CSAT).
- Financial Metrics: Incremental orders, incremental GMV/revenue, cost of incentives, ROI.
- Establish a Baseline:
- Collect data on all key metrics for a period before the incentive program was introduced (or from comparable periods/areas without incentives).
- Causal Inference Design (Isolating Incentive Effect):
- Ideal (if possible): Phased rollout with geographic A/B testing (some zones in Hyderabad get incentives, comparable zones don't, for a limited time). Hard due to partner mobility/awareness.
- Quasi-Experimental Methods (more likely):
- Difference-in-Differences (DiD): Compare changes in metrics in Hyderabad (treatment) before and after incentive introduction, versus changes in comparable cities without the incentive (control cities) over the same period. Requires good control city selection.
- Interrupted Time Series (ITS): Analyze the trend of metrics in Hyderabad before the incentive and see if there's a significant change in level or slope after its introduction, controlling for seasonality.
- Regression Discontinuity Design (RDD): If incentives kick in sharply at a specific time (e.g., exactly 7 PM) or based on a clear threshold, compare outcomes just before vs. just after.
- Control for confounders: Day of week, time of day (within peak), weather, specific festival days vs. regular peak days, traffic conditions, overall demand levels.
- Data Collection & Quality:
- Ensure accurate tracking of partner online times, locations, order acceptance/rejection with reasons, incentive payouts.
- Monitor for data anomalies or system issues affecting metrics.
- Analysis & Interpretation:
- Calculate lift in key metrics for incentive periods vs. non-incentive periods/areas, adjusting for baselines and confounders.
- Segment analysis: Impact on different partner segments, different zones in Hyderabad, different types of orders.
- Analyze for unintended consequences (e.g., drop in partner availability just outside incentive hours).
- ROI Calculation:
Benefit = (Value of Incremental Orders Fulfilled) + (Value of Reduced Customer Churn due to Better Service) + (Value of Increased Partner Retention/Happiness) ...Cost = Total Incentive Payouts + Operational Overhead of Program.ROI = (Benefit - Cost) / Cost.
- Reporting & Recommendations:
- Present findings clearly to different stakeholders (Product, Ops, Finance, Leadership).
- Recommend continuing, modifying (e.g., incentive amount, timing, targeting), or discontinuing the program based on effectiveness and ROI.
- Suggest A/B tests for optimizing incentive structure if the program shows promise.
Simulated Conversation
- Clearly define "meeting customer demand" with specific, measurable metrics.
- Establish a robust methodology to isolate the incremental impact of these incentives – this is a causal inference problem.
- Analyze both direct outcomes on service quality and supply, as well as potential unintended consequences or behavioral changes in partners.
- Assess the program's financial viability and Return on Investment (ROI).
- Finally, provide actionable recommendations based on the findings.
To start, it's essential to have baseline data: metrics from Hyderabad during these peak/festival periods before the incentive program was launched, and also data from comparable non-incentivized periods or even comparable cities that don't have this specific program, if possible.
Key Metrics for "Meeting Customer Demand":
- Supply-Side Metrics (Impact of Incentives on Partners):
- Partner Availability/Online Hours: Increase in the number of active partners online per hour in target zones (Hyderabad, peak times/festivals).
- Partner Density in Hotspots: Number of available partners per sq km in historically high-demand micro-zones.
- Order Acceptance Rate: Percentage of orders offered to nearby partners that are accepted. (Target: Increase).
- Service Quality & Demand Fulfillment Metrics (Impact on Customers):
- Order Fulfillment Rate (OFR): Percentage of successfully placed orders that are matched with a partner and completed. (Target: Increase).
- Customer Wait Time (for partner assignment): Time from order placement until a partner accepts. (Target: Decrease).
- Average Estimated Time of Arrival (ETA) vs. Actual Delivery Time: We want to reduce overall delivery time, but also improve ETA accuracy. (Target: Decrease average delivery time, decrease ETA variance).
- Order Cancellation Rates:
- System-Initiated (No Partner Available): This is a direct measure of unmet demand due to supply shortage. (Target: Decrease significantly).
- Customer-Initiated (e.g., due to long wait/ETA): Also indicates demand not being met satisfactorily. (Target: Decrease).
- Platform Health Metrics:
- Surge Incidence/Magnitude (if Dunzo uses customer-facing surge): Ideally, increased partner supply due to incentives should reduce the need for or intensity of customer-facing surge pricing.
These metrics give a balanced view of whether the increased supply (due to incentives) translates into better service for customers.
Causal Inference Strategy:
- Treatment Group: Hyderabad (where the incentive program is active during peak/festival periods).
- Control Group Identification (Crucial & Hard):
- Option A: Comparable City/Cities: Identify one or more other Indian cities served by Dunzo that are demographically and economically similar to Hyderabad, share similar baseline demand patterns for Dunzo, but do not have this specific incentive program. This is the ideal control for DiD.
- Challenges: Finding truly comparable cities; ensuring they aren't affected by other unique, large-scale interventions.
- Option B: Non-Incentivized Hours/Days in Hyderabad (Less Ideal but a Fallback): Compare peak/festival incentive periods (7-10 PM, festival days) with non-incentive periods (e.g., 2-5 PM on regular weekdays) within Hyderabad.
- Challenges: These periods are inherently different in terms of demand, traffic, and partner behavior, making it hard to isolate the incentive effect. Requires careful modeling to control for time-of-day effects.
- Option C: Pre-Program Historical Data as Control (Interrupted Time Series - ITS): Analyze the trend of our key metrics in Hyderabad leading up to the incentive program's launch, and then look for a statistically significant change in the level or slope of the trend after launch.
- Challenges: Need sufficient pre-program data; sensitive to other unobserved events coinciding with the launch.
- Option A: Comparable City/Cities: Identify one or more other Indian cities served by Dunzo that are demographically and economically similar to Hyderabad, share similar baseline demand patterns for Dunzo, but do not have this specific incentive program. This is the ideal control for DiD.
- Difference-in-Differences (DiD) Analysis (if Option A is feasible):
- The DiD estimator would be:
Effect = (Metric_Hyderabad_PostIncentive - Metric_Hyderabad_PreIncentive) - (Metric_ControlCity_PostPeriod - Metric_ControlCity_PrePeriod) - This controls for common time trends and factors affecting both Hyderabad and the control city (e.g., broader economic shifts, nationwide app updates).
- We must validate the "parallel trends" assumption: before the incentive, Hyderabad and the control city should have had similar trends for the key metrics.
- The DiD estimator would be:
- Data Collection Periods:
- Pre-Period: At least 4-8 weeks of data before the incentive program started.
- Post-Period: At least 4-8 weeks of data after the program is stable. For festival analysis, this would be the specific festival period.
If a clean control city isn't available, ITS combined with careful modeling of seasonality and time-of-day effects for Hyderabad data would be the next best, acknowledging its limitations in fully ruling out confounders.
Refined Approach for Festival Periods:
- Granular Analysis Tracks within Festivals:
- Festival vs. Non-Festival Peak Hours: Analyze the incentive effect during regular peak hours (7-10 PM on non-festival weekdays) separately from its effect during specific festival days/hours. The incentive structure might even be different.
- This Festival Year vs. Previous Festival Years (Hyderabad only):
- This is an ITS-like approach specifically for the festival. Collect data for the same festival (e.g., Ugadi) in Hyderabad for the past 2-3 years (when there was no such incentive or a much smaller one).
- Model the typical festival uplift for key metrics (e.g., partner online hours, orders) based on historical data, controlling for day-of-week of festival, overall year-on-year growth.
- The "treatment effect" would be the deviation in the current festival year (with incentive) from this historically predicted festival pattern.
# Conceptual: Historical festival uplift model # Sales_festival_hist = β₀ + β₁(Year) + β₂(DayOfWeekOfFestival) + ε # Predicted_Sales_current_festival_NO_INCENTIVE = model.predict(current_year_features) # Incentive_Effect = Actual_Sales_current_festival_WITH_INCENTIVE - Predicted_Sales_current_festival_NO_INCENTIVE
- Control for Festival Intensity (if possible):
- If the festival has multiple days of varying importance (e.g., main festival day vs. preceding/succeeding days), analyze these separately. The incentive's incremental effect might be larger on shoulder days than on the absolute peak day when supply is already maxed out by natural festive spirit.
- Use external data if possible: e.g., Google Trends for "Ugadi shopping" in Hyderabad vs. control city to gauge relative interest/intensity.
- Partner Behavior Specific to Festivals:
- Survey partners: Are they primarily motivated by the festival spirit to work, or is the incentive the key driver during the festival? This qualitative data can add context.
- Analyze if the incentive primarily brings new/less active partners online during festivals, or just makes already active partners work more hours.
- Refined DiD with Festival Interaction:
- If using a control city, interact the DiD term with a "festival_period" dummy:
Metric_it = ... + β₃(Treat_i * Post_t) + β₄(Treat_i * Post_t * FestivalPeriod_t) + ...Here, `β₄` would capture the additional effect of the incentive during festival periods over and above its effect in regular peak times. This requires the control city to also have some form of "FestivalPeriod_t" even if the celebration is minor.
- If using a control city, interact the DiD term with a "festival_period" dummy:
This requires careful, segmented analysis. The cleanest signal for the incentive's effect might come from non-festival peak hours. For festival periods, we'd be estimating the incentive's power to further boost supply beyond the natural festival surge, which is a harder but crucial estimation.
ROI Analysis Framework:
A. Quantifying Benefits (Incremental Value):
- Incremental Orders & GMV/Revenue:
- Using the causal inference methods (DiD, ITS), estimate the number of additional orders fulfilled during incentive periods that would not have been fulfilled otherwise (due to lack of partner supply).
- Calculate the Gross Merchandise Value (GMV) and Net Revenue (GMV - refunds/cancellations - marketplace commissions paid to merchants if applicable) from these incremental orders.
- Value of Reduced Lost Sales:
- Estimate the revenue saved from orders that would have been system-cancelled (no partner) or customer-cancelled (due to long wait) without the improved supply from incentives.
- Long-Term Value (Harder to quantify but important):
- Improved Customer Retention: Did customers who experienced better service (faster delivery, higher fulfillment) during these peak/festival times show higher retention rates or purchase frequency in subsequent months compared to those in control groups/periods or those who had poor experiences? This requires cohort tracking.
- New Customer Acquisition/Conversion: If improved service availability attracts new users or converts first-time users impressed by peak-time reliability, their LTV contributes.
- Increased Partner Retention: Do partners receiving these incentives show higher retention on the platform? Reduced partner churn saves on acquisition and training costs for new partners. (This is a cost saving).
- Brand & Reputation Value (Qualitative initially, potentially quantitative later):
- Monitor CSAT scores, app store reviews, social media sentiment regarding Dunzo's reliability during peak times in Hyderabad. Positive shifts have long-term value.
B. Quantifying Costs:
- Total Incentive Payouts: The direct cost of the extra payments made to delivery partners during the program.
- Operational Overhead: Any additional costs for managing, monitoring, and supporting this incentive program (e.g., data analysis effort, support team queries about incentives). Usually smaller but should be considered.
- Potential Cannibalization (if incentives are too rich): If partners only work during incentive hours and reduce their availability during non-incentivized but still necessary hours, this could be a hidden cost or negative impact elsewhere.
C. Calculating ROI:
- Short-Term ROI:
(Incremental Net Revenue from Promo - Total Incentive Payouts - Incremental Ops Costs) / (Total Incentive Payouts + Incremental Ops Costs)Focus here might be on contribution margin from incremental orders. - Long-Term ROI (More Holistic):
(Sum of all Quantified Incremental Benefits including LTV uplift - Total Costs) / Total Costs
A key challenge is accurately attributing the "Value of Reduced Customer Churn" or "LTV uplift" causally to the incentive program. This requires careful cohort analysis of customers who ordered during incentivized vs. non-incentivized conditions (in treatment vs. control settings).
Primary Data Sources Required:
- Order Management System (OMS) Data:
- Order timestamps (creation, acceptance, pickup, delivery), order value, items, customer ID, delivery location (lat/long, Pincode), order status (completed, cancelled - by whom and reason).
- ETAs provided vs. actual delivery times.
- Delivery Partner Platform Data:
- Partner ID, login/logout timestamps, online duration, current location (GPS pings), order acceptance/rejection timestamps and reasons, active on-trip time vs. idle time.
- Historical performance metrics for partners (ratings, past earnings, tenure).
- Partner Payout & Incentives System:
- Detailed records of all incentive payments made, type of incentive, criteria met, associated order IDs (if applicable), regular earnings.
- Customer Relationship Management (CRM) & Feedback System:
- Customer ratings for deliveries/partners, customer complaints (categorized), CSAT survey results if available for these periods.
- Customer segmentation data (new/existing, LTV tier, demographics if available).
- Pricing & Promotion Engine Data:
- Details of when incentives were active, for which zones/partners, the exact structure of the incentive.
- Data on any customer-facing surge pricing active concurrently.
- External Data (To Control for Confounders):
- Weather data for Hyderabad (rain can heavily impact delivery).
- Traffic data (e.g., from Google Maps API or similar for general traffic conditions).
- Local events calendar for Hyderabad (public holidays, major city events beyond just the big festivals we are targeting).
Data Quality Assurance Measures:
- Completeness Checks: Ensure no major data gaps for critical fields during the analysis periods. Investigate any system outages that might have affected data logging.
- Accuracy Validation:
- Cross-validate timestamps across systems (e.g., order placed in OMS vs. offer sent to partner app).
- For GPS data, implement cleaning logic to handle inaccuracies or jumps.
- Reconcile incentive payouts with actual partner activity and order completion.
- Consistency Checks: Ensure definitions (e.g., "active hour," "peak period") are consistently applied across all data sources and analyses.
- Outlier Detection: Identify and investigate extreme values in delivery times, wait times, or partner earnings, which might indicate data errors or unique situations.
Potential Biases to Watch For:
- Selection Bias (Partner Self-Selection): Partners who choose to work during incentive periods might be inherently different (e.g., more experienced, more motivated by earnings, different vehicle types) than those who don't. This can affect how they respond to incentives and their baseline performance. Randomization in an A/B test helps, but in observational studies, we need to control for these characteristics.
- Observer Effect / Hawthorne Effect (if partners know they are being heavily monitored for an "experiment"): Unlikely for a broad incentive program but possible in smaller pilots.
- Reporting Bias / Gaming by Partners:
- Partners might artificially stay online just within incentive zones/times without intending to take many orders if the incentive is partly for just being online.
- They might collude to manipulate surge or demand signals if the incentive structure is exploitable.
- Selectively accepting only high-value/easy orders during incentive periods, worsening service for other orders.
- Survivorship Bias: If we only analyze completed orders, we miss insights from unfulfilled demand or orders cancelled due to lack of partners. Need to look at the "intent to order" or "order request" stage.
- Confounding External Events: A major traffic disruption, unexpected city event, or competitor action coinciding with an incentive period can distort perceived effectiveness. This is why control groups/methods are vital.
- Time Lag Bias: The full impact of improved customer satisfaction or partner retention might not be visible in the short term, leading to underestimation of long-term ROI.
Stakeholder-Specific Reporting:
1. For CEO / Senior Leadership (Strategic Focus):
- Format: Concise executive summary (1-2 pages) + short presentation.
- Content:
- Overall Program Effectiveness: Did it meet the primary goal of improving customer demand fulfillment during critical periods? Show top 2-3 KPIs (e.g., lift in Order Fulfillment Rate, reduction in Customer Wait Time).
- Financial Impact (ROI): Clear statement on short-term and projected long-term ROI. Was it profitable? By how much? What are the confidence intervals or scenarios (base, bull, bear)?
- Strategic Implications: Should we continue, expand, modify, or discontinue the program? Impact on market share in Hyderabad, competitive positioning.
- Key Risks & Opportunities: Highlight major learnings and future opportunities for optimization.
- Visuals: High-level trend charts, summary scorecards, clear ROI figures.
2. For Operations Team (Operational Efficiency Focus):
- Format: Detailed operational dashboard + regular review meetings.
- Content:
- Partner Supply Dynamics: Deep dive into partner availability, online hours, acceptance rates, partner density in hotspots during incentive vs. non-incentive periods. Impact on specific zones within Hyderabad.
- Efficiency Metrics: Changes in average delivery time components (wait time, travel time), partner utilization rates, orders delivered per partner hour.
- Identification of Gaming/Negative Behaviors: Any evidence of partners exploiting the system, and its impact.
- Cost Per Incremental Order Fulfilled: How much incentive did it take to get one extra order delivered?
- Recommendations: Specific suggestions for optimizing incentive amounts, timing, geographical targeting, or rules to improve operational outcomes.
- Visuals: Heatmaps of partner density, time-series plots of operational metrics, detailed cost breakdowns.
3. For Product Team (Customer & Partner Experience Focus):
- Format: Product review presentation + interactive dashboard.
- Content:
- Customer Experience Metrics: Impact on OFR, wait times, delivery times, cancellation rates, CSAT scores, customer complaints specifically related to peak/festival deliveries.
- Partner Experience Metrics: Partner earnings during incentive periods (gross and net, if estimable), partner satisfaction surveys (if conducted), partner churn/retention rates for those participating in incentives.
- App/Platform Impact: Any strain on the system during incentivized peak loads? Impact on ETA accuracy algorithms?
- A/B Test Results for different incentive structures (if run).
- Recommendations: Product changes to better support partners during peak times, improvements to incentive communication in-app, ideas for tiered incentives based on partner performance.
- Visuals: Customer journey maps during peak, partner earnings distributions, CSAT trends.
4. For Finance Team (Financial Viability Focus):
- Format: Detailed financial model and report.
- Content:
- Precise calculation of incremental revenue, cost of incentives, other variable costs, and net profit/loss attributable to the program.
- Sensitivity analysis of ROI based on different assumptions (e.g., partner response rate, discount depth, long-term customer retention).
- Budget variance: Actual spend vs. planned spend for incentives.
- Projections for scaling the program to other cities or future festivals.
- Visuals: Waterfall charts for profit breakdown, scenario analysis graphs, LTV projection models.
Across all stakeholders, I'd emphasize the methodology used to ensure confidence in the findings, clearly state assumptions, and highlight the statistical significance (or lack thereof) of observed changes.
Success Criteria for the Incentive Program:
These would be multi-dimensional, reflecting the program's goals:
- Improved Demand Fulfillment (Primary Goal):
- Statistically significant increase in Order Fulfillment Rate (OFR) during target incentive periods by at least, say, 10-15% compared to the control/baseline.
- Statistically significant reduction in Average Customer Wait Time for partner assignment by at least, say, 20-25%.
- Statistically significant reduction in System-Initiated Cancellations (due to no partner) by at least, say, 30-40%.
- Positive Return on Investment (Financial Viability):
- Demonstrably positive short-term incremental profit contribution OR a clear, modeled path to positive long-term ROI (e.g., through improved customer LTV due to better service) within a defined payback period (e.g., 6-9 months). A target ROI might be >10-20%.
- Sustainable Partner Engagement (Supply Health):
- Increased partner online hours and active partner density in target zones during incentive periods without a significant corresponding drop in availability during adjacent non-incentivized periods (i.e., avoiding merely shifting supply).
- Partner satisfaction with the incentive program (e.g., >70% satisfaction in surveys, or positive trend in qualitative feedback).
- No significant increase in detrimental partner gaming behaviors that negatively impact service quality.
- Maintained or Improved Customer Satisfaction (Guardrail):
- No statistically significant decrease in overall CSAT scores for orders placed during incentive periods. Ideally, an improvement due to better service.
Discontinuation or Significant Overhaul Triggers:
- Persistent Negative ROI: If, after a reasonable period (e.g., 3-6 months of operation and analysis, including attempts to optimize), the program consistently shows a negative net financial impact, and projected LTV gains don't credibly offset this.
- Failure to Meet Primary Service Level Targets: If, despite the incentives, there's no statistically significant improvement in OFR or customer wait times, or if improvements are marginal and don't justify the cost.
- Widespread Negative Unintended Consequences:
- Significant increase in partner gaming that degrades overall service quality (e.g., artificially inflating wait times to trigger higher surge components of incentives, excessive order rejections for non-lucrative trips).
- Severe cannibalization of partner supply from other critical times/zones.
- Deterioration in customer satisfaction or trust due to perceived unfairness or issues arising from the incentive system.
- Discovery of More Cost-Effective Alternatives: If other strategies (e.g., better dispatch algorithms, different non-monetary partner engagement programs, minor customer-facing surge) are A/B tested and found to achieve similar or better demand fulfillment at a lower cost.
- Partner Over-Reliance & Unsustainability: If partners become so reliant on incentives that baseline service collapses when incentives are even slightly reduced, indicating the core earning proposition isn't viable without constant artificial boosts. This creates an unsustainable cost model.
Any decision to overhaul or discontinue would involve a thorough review of why the program isn't meeting expectations and whether modifications (e.g., different incentive amounts, structures, targeting) could salvage it before a full stop. The key is an iterative, data-driven approach.
What to Learn from This Case
- Structure is Key: Start with a clear problem definition and a structured analytical framework.
- Define Measurable KPIs: Clearly define what "success" or "effectiveness" means in terms of specific, measurable metrics for both supply and demand sides.
- Focus on Causal Inference: The core challenge is often isolating the true impact of an intervention. Understand and propose appropriate experimental or quasi-experimental designs (e.g., DiD, ITS, RCT if feasible).
- Address Confounding Variables: Actively identify and plan to control for factors that could distort results (e.g., seasonality, external events, inherent differences between groups).
- Consider ROI Holistically: Think about both direct costs/benefits and indirect/long-term value (LTV, brand perception, partner retention).
- Anticipate Data Challenges: Be aware of data quality issues, necessary data sources, and potential biases in data collection or program implementation.
- Tailor Communication: Understand that different stakeholders (CEO, Ops, Product, Finance) need different levels of detail and types of insights.
- Set Clear Success/Failure Criteria: Define objective thresholds for program continuation, modification, or discontinuation based on data.
- Think About Unintended Consequences: Consider how the program might lead to undesirable behaviors (e.g., partner gaming) and how to monitor for them.
- Iterative Approach: Emphasize that analysis and program management are often iterative processes of testing, learning, and refining.