Can better incentives improve government efficiency?

Senior Editors

Guo Xu

Assistant Professor, Berkeley Haas School of Business

Co-editors

Downloads:

Download

Chapter 3

Improving bureaucrat performance through incentives

A common observation for public sector organisations is that explicit incentives (e.g. bonus pay and promotion contests) are rarely used (Holmstrom and Tirole 1989, Dewatripont et al. 1999a, Dewatripont et al. 1999b). Firing costs are often high for public servants, and formal incentives are rarely a feature of bureaucratic remuneration.

The new public management literature of the 1980s and 90s led to a number of policy experiments on incentives and proved to be controversial (see Hood 1995 for a review). Important sources of controversy surrounded issues of multi-tasking, perverse incentives and high transaction costs due to the increased need of specifying and monitoring contracts (Williamson 1979). In recent years, however, there has been a renewed interest in whether there is greater scope for such rewards. These studies of incentives face a series of challenges that are particularly salient in the public sector context.

Difficulties of measuring performance

The implementation of incentive contracts requires a mapping between a bureaucrat’s output and reward. In the public sector, the overarching challenge in the implementation of incentive contracts is the measurement of performance. Bureaucrats complete complex tasks that are particularly difficult to quantify. To measure output, work in private sector settings would focus on specific production processes, such as the installation of windshields (Lazear 2000), the picking of fruits (Bandiera et al. 2009), or line-level productivity in factories (Atkin et al. 2017). While such well-defined tasks may exist for front-line providers such as nurses or teachers (Ashraf et al. 2014, Deserranno et al. 2022, Muralidharan and Sundararaman 2011, Duflo et al. 2012), measuring output for more senior bureaucrats who implement policy and design rules is more difficult and sometimes performance is proxied by compliance with rules. More generally, a major challenge in low state capacity settings is limited ability to verify reported compliance (Andrews et al. 2017).

Furthermore, organisational goals in the public sector are most often multi-dimensional and non-verifiable. The first of these raises the issue of how different dimensions of performance are aggregated and/or traded-off against each other. The second implies that it is difficult, even ex post, to establish whether a particular goal was met. Another issue, which we refer to in greater detail below, is the attribution of individual contributions in team production, where the measure of performance cannot be disentangled across agents (Holmstrom 1982). Finally, with most transactions occurring inside the organisation, output is seldom evaluated in markets, thus making it hard to value (Downs 1965).

Measurement issues are further compounded by challenges of mission design. Bureaucracies by their nature are not geared towards narrow goals based on financial criteria. Thinking of bureaucracies as mission-driven organisations is most closely associated with Wilson (1989) and is also emphasised in Tirole (1994). The notion of a mission is a catch-all for a range of outcomes that a bureaucracy might pursue.[1]1. There has been an increased effort in trying to establish how bureaucracies are performing. As a result, there has been great interest in cross-country comparisons. One famous example is the World Bank's Doing Business project which provides evaluation and ranking across a range of state roles in supporting markets. For example, there is an attempt to measure how costly it is to set up a new business.

The microeconomic literature has taken several approaches to performance measurement. The most common approach restricts the analysis to bureaucrats and tasks that can be more easily measured, like agricultural extension workers (Dal Bó et al. 2021), revenue collectors (Khan et al. 2016, Khan et al. 2019, Aman-Rana 2022), health care providers (Ashraf and Bandiera 2018, Deserranno et al. 2022, Khan 2020), teachers (Akhtari et al. 2022, Leaver et al. 2021, Brown and Andrabi 2021), procurement officers (Bandiera et al. 2021, Best et al. 2019) or judges (Dahis et al. 2023, Mehmood 2021). The clear advantage of this approach is the direct mapping from an individual to a comparable outcome. The disadvantage is that this approach works – with exceptions – mostly for lower tier civil servants who are more specialised. An exception perhaps is the use of attendance data, which provides an extreme measure of non-performance that can be applied to all workers (Chaudhury et al. 2006, Dhaliwal and Hanna 2017, Callen et al. 2023).

To make progress, a second approach has followed the CEO literature (Bertrand and Schoar 2003) by attempting to map higher-level individuals to an aggregate outcome. In the private sector setting, CEO traits may be related to company-specific outcomes such as profits or stock market returns. Examples in public organisations include provincial governors and GDP growth (Jia 2017), governors and colony-level revenue generation (Xu 2018), field office managers and office-level outcomes (Fenizia 2022), and district-level development outcomes (Gulzar and Pasquale 2017). While this approach allows the study of the impact of more senior officers on aggregate outcomes, the exact mechanism through which they affect outcomes is hard to pin down. Furthermore, this approach is also limited to organisations with many comparable high level units that serve the same functions, such as field offices or districts.

Finally, another strand of the literature uses subjective performance measures. Such ratings are frequently found in internal evaluations across both private and public organisations. Rasul and Rogger (2018), for example, code administrative project reports to obtain project completion ratings and relate them to management practices. Limodio (2021) uses internal project performance ratings of the World Bank to study the allocation of World Bank staff. Bertrand et al. (2020) field a large-scale survey in which they collected subjective assessments of senior Indian civil servants among dimensions such as effectiveness, probity or pro-poor orientation. The advantage of this approach is that it can be applied to any task and output (including qualitative), providing a more holistic measure. The disadvantage is that perceptions could be biased, thus calling for the need of objective measures to validate or complement.

Overall, the measurement of bureaucrat performance remains a challenge. Approaches in the literature are highly context-specific, depending on the type of public officials studied, and the complexity of the job they execute.

Multi-tasking and implementation challenges

Even when output measures are available, the choice of how to map output to reward remains an open question. Assuming that output only has a single dimension is often unrealistic. Bureaucrats also frequently work across multiple tasks and can therefore choose which tasks to concentrate their effort on and hence which outputs are favoured (Holmstrom and Milgrom 1991).[2]2. Another dimension of a bureaucrat's action could also be whether to ask for a bribe. Then whether a bribe is paid becomes a dimension of (non)-performance.

The main implication of multi-tasking, which has been discussed extensively, arises when some tasks are more easily measured than others and are incentivised. This is particularly problematic when the efforts put in different dimensions are substitutes. A classic example is when teachers who are incentivised according to test results focus excessively on this rather than on all round performance. One way around this is to get better measures of performance on alternative dimensions and the other is to have a less steep incentive scheme. In many settings, it is easier to implement a non-linear compensation scheme, such as a bonus paid for the highest performer (e.g. a monthly competition), or a threshold rule (e.g. bonus paid for each student with straight As). While such compensation schemes are abundant, such nonlinearities have distortionary effects. In the education setting, for example, conditioning teacher remuneration on test score outcomes could lead teachers to spend more time developing test-taking skills rather than general instruction (Glewwe et al. 2010). If teachers are compensated based on the number of students passing an exam, teachers may also divert effort away from the inframarginal students towards the marginal students close to the passing threshold (Neal and Schanzenbach 2010, Ahn and Vigdor 2014). Similarly, when incentive contracts reflect tournaments, the incentive effect may be large for those who are marginal but absent for those who are inframarginal. In Khan et al. (2019), for example, high performing revenue officers are rewarded with the transfer to their preferred work locality. The incentive effects, however, depend on how many other officers compete over the same locality. Officers competing over popular localities may thus be disincentivised if they perceive their chances to “win” to be low. Similarly, officers who prefer less popular districts may have little incentive to exert effort if they stand to receive their allocation anyway.

Relatedly, there is a choice of “who to incentivise.”’ Public service provision requires coordination between multiple types of stakeholders and incentive design must reflect that. Deserranno et al. (2022) provide first empirical evidence that the allocation of incentives across the hierarchy of an organisation matters in the context of health-care provision in Sierra Leone. They find that sharing incentives equally between the lower and upper layer of the hierarchy raises output by 61% compared to unilateral allocations that are typical in public organisations. Another challenge arises from the very nature of public services that often involves inputs from both providers and citizens. If providers believe that all their effort and resources will be substituted away by citizens, incentives may not work. This has been documented as an important factor behind the limited effectiveness of input augmentation policies in education (Pop-Eleches and Urquiola 2013, Mbiti et al. 2019). Finally, there is a general concern that incentives are harder to implement for more senior civil servants. To prevent influence activities and political interference, classic bureaucracies have typically relied on easily measurable characteristics such as seniority (Prendergast 1999, de Janvry et al. 2020). A downside of such rigid rules however is that they may disincentivise performance (Bertrand et al. 2020).

The second consideration for designing incentive contracts is whether to contract on output or input. Output is often not only imperfectly observed but also subject to shocks beyond the control of the bureaucrat. However, in some contexts inputs are easier to observe and more closely reflect deliberate choices made. In the teacher example, performance pay could be either based on output – e.g. test scores (Muralidharan and Sundararaman 2011) – or based on inputs – e.g. teacher attendance (Duflo et al. 2012). The key difference lies in how much autonomy is granted to the bureaucrat. By contracting on inputs, the designer implicitly commits to a specific mapping between input and output. To the extent that the production function is more complex, however, the bureaucrat – by virtue of expertise – may possess better information about the optimal mix of inputs. Dal Bó et al. (2021) provide evidence for this in the context of agricultural extension workers in Paraguay. When provided with a monitoring technology to supervise subordinate workers, they find that middle-managers prioritise those subordinates who would be more responsive to the treatment. In general, incentivising on inputs may make more sense when outcome is difficult to measure or monitor, e.g. patient health, and when production inputs are clearly identifiable, feasibly measured, and non-substitutable, e.g. teacher attendance. Incentivising on outputs may make more sense when production inputs are difficult to identify, measure, or monitor, e.g. tax collector’s effort, and when outcome must meet a threshold, e.g. test scores.

Despite challenges in the design and implementation of incentives that can make these fail or even backfire, recent research does suggest that incentives “work” if well designed. There is now a large body of work that documents the role of incentives for front-line public service providers such as health care workers (Ashraf et al. 2014), teachers (Muralidharan and Sundararaman 2011, Leaver et al. 2021) and tax collectors (Khan et al. 2016). These studies focus on tasks for which performance is easier to measure. Multi-tasking concerns are often directly anticipated and built into the research design, mostly by attempting to measure both incentivised and non-incentivised outcomes. Khan et al. (2016), for example, designed an incentive scheme to reward tax collectors based on revenue collection. To test for the role of multi-tasking, they include two additional treatment arms: a “revenue plus” that ties the bonus not only on revenue generation but also taxpayer satisfaction, and a “flexible bonus” that is based on a more holistic subjective evaluation.

Non-monetary incentives

Despite the renewed interest in incentives in public organisations, the use of explicit, monetary incentives remains the exception rather than the norm. Instead, bureaucracies have relied on alternative, indirect and non-monetary means to incentivise performance, such as leveraging heterogeneity in the desirability of (same-seniority) postings either along vertical traits (e.g. prestige) (Iyer and Mani 2012, Jia 2017) or horizontal traits (e.g. personal preference) (Khan et al. 2019). The implementation of such incentive schemes, however, still hinges critically on the accurate measurement of performance. It is perhaps for that very reason that indirect means of inducing performance, for example through rotations or high turnover, have often been excessively used for political purposes (Akhtari et al. 2022) and satirised as a “bureaucratic merry-go-around” (De Zwart 1994). For more senior positions an important determinant of effort and choices made are accountability systems that punish non-compliance with rules through criminal charges or career consequences, serving as a de facto negative non-monetary incentive. There is an incipient literature on how such systems can lead to waste in government (Bosio et al. 2022, Bandiera et al. 2021). Considering how pervasive these systems are in developing countries, more work is needed on that front.

Finally, it is often suggested that those who work in bureaucracies are mission-motivated and thus care about the output even if their monetary compensation is not explicitly tied to it (Ashraf and Bandiera 2018, Besley and Ghatak 2018, Bénabou and Tirole 2006). Mission may thus be a potentially cost-effective way to incentivise performance when bureaucracies have limited budgets. Khan (2020) provides experimental evidence from healthcare workers in Pakistan that greater mission emphasis helps increase worker performance and improve health outcomes. Importantly, the greater focus on mission helps increase performance even on dimensions that were not explicitly incentivised, suggesting that mission-motivation may also help alleviate multitasking problems. In the contemporary US setting, Spenkuch et al. (forthcoming) show that ideological alignment of procurement officers with the serving President increases performance and self-reported morale.

References

Ahn, T, and J Vigdor (2014), “When Incentives Matter Too Much: Explaining Significant Responses to Irrelevant Information”, National Bureau of Economic Research Working Paper No. 20321.

Akhtari, M, D Moreira, and L Trucco (2022), “Political Turnover, Bureaucratic Turnover, and the Quality of Public Services”, American Economic Review, 112(2): 442-493.

Aman-Rana, S (2022), “Meritocracy in a Bureaucracy”, Working Paper.

Andrews, M, L Pritchett, and M J V Woolcock (2017), Building State Capability: Evidence, Analysis, Action, Oxford University Press.

Ashraf, N, and O Bandiera (2018), “Social Incentives in Organizations”, Annual Review of Economics, 10: 439–463.

Ashraf, N, O Bandiera, and B K Jack (2014), “No margin, no mission? A field experiment on incentives for public service delivery”, Journal of Public Economics, 120: 1–17.

Bandiera, O, I Barankay, and I Rasul (2009), “Social Connections and Incentives in the Workplace: Evidence From Personnel Data”, Econometrica, 77(4): 1047–1094.

Bandiera, O, M C Best, A Q Khan, and A Prat (2021), "The Allocation of Authority in Organizations: A Field Experiment with Bureaucrats", Quarterly Journal of Economics, 136(4): 2195–2242.

Bénabou, R, and J Tirole (2006), “Incentives and Prosocial Behavior”, American Economic Review, 96(5): 1652–1678.

Bertrand, M, R Burgess, A Chawla, and G Xu (2020), “The Glittering Prizes: Career Incentives and Bureaucrat Performance”, Review of Economic Studies, 87(2): 626–655.

Bertrand, M, and A Schoar (2003), “Managing with Style: The Effect of Managers on Firm Policies”, Quarterly Journal of Economics, 118(4): 1169–1208.

Besley, T, and M Ghatak (2018), “Prosocial Motivation and Incentives”, Annual Review of Economics, 10: 411–438.

Bosio, E, S Djankov, E Glaeser, and A Shleifer (2022), “Public Procurement in Law and Practice”, American Economic Review, 112(4): 1091-1117.

Brown, C, and T Andrabi (2021), “Inducing Positive Sorting through Performance Pay: Experimental Evidence from Pakistani Schools”, Working Paper.

Callen, M, S Gulzar, A Hasanain, M Y Khan, and A Rezaee (2023), “The Political Economy of Public Sector Absence”, Journal of Public Economics, 218: 104787.

Chaudhury, N, J Hammer, M Kremer, K Muralidharan, and F H Rogers (2006), "Missing in action: teacher and health worker absence in developing countries", Journal of Economic Perspectives, 20(1): 91-116.

Dahis, R, L Schiavon, and T Scot (2023), “Selecting Top Bureaucrats: Admission Exams and Performance in Brazil”, Review of Economics and Statistics, 1-47.

Dal Bó, E, F Finan, N Y Li, and L Schechter (2021), “Information Technology and Government Decentralization: Experimental Evidence From Paraguay”, Econometrica, 89(2): 677–701.

De Zwart, F (1994), The Bureaucratic Merry-Go-Round: Manipulating the Transfer of Indian Civil Servants, Leiden University Press.

Deserranno, E, S Caria, P Kastrau, and G Leon (2022), "The Allocation of Incentives in Multi-Layered Organizations", Working Paper.

Dewatripont, M, I Jewitt, and J Tirole (1999a), “The economics of career concerns, part I: Comparing information structures”, Review of Economic Studies, 66(1): 183–198.

Dewatripont, M, I Jewitt, and J Tirole (1999b), “The economics of career concerns, part II: Application to missions and accountability of government agencies”, Review of Economic Studies, 66(1): 199–217.

Dhaliwal, I, and R Hanna (2017), "The devil is in the details: The successes and limitations of bureaucratic reform in India", Journal of Development Economics, 124: 1-21.

Downs, A (1965), “A theory of bureaucracy”, American Economic Review, 55(1/2): 439–446.

Duflo, E, R Hanna, and S P Ryan (2012), “Incentives work: Getting teachers to come to school”, American Economic Review, 102(4): 1241–78.

Fenizia, A (2022), “Managers and Productivity in the Public Sector”, Econometrica, 90(3): 1063-1084.

Glewwe, P, N Ilias, and M Kremer (2010), “Teacher incentives”, American Economic Journal: Applied Economics, 2(3): 205–27.

Gulzar, S, and B J Pasquale (2017), “Politicians, bureaucrats, and development: Evidence from India”, American Political Science Review, 111(1): 162–183.

Holmstrom, B (1982), “Moral hazard in teams”, The Bell Journal of Economics, 324–340.

Holmstrom, B, and P Milgrom (1991), “Multitask principal-agent analyses: Incentive contracts, asset ownership, and job design”, Journal of Law, Economics, & Organization, 7: 24.

Holmstrom, B, and J Tirole (1989), “The theory of the firm”, Handbook of Industrial Organization, 1: 61–133.

Hood, C (1995), “The “new public management” in the 1980s: Variations on a theme”, Accounting, Organizations and Society, 20(2-3): 93–109.

Iyer, L, and A Mani (2012), “Traveling Agents: Political Change and Bureaucratic Turnover in India”, Review of Economics and Statistics, 94(3): 723–739.

Jia, R (2017), “Pollution for Promotion”, Working Paper.

Khan, A Q, A I Khwaja, and B A Olken (2016), “Tax Farming Redux: Experimental Evidence on Performance Pay for Tax Collectors”, Quarterly Journal of Economics, 131(1): 219–271.

Khan, A Q, A I Khwaja, and B A Olken (2019), “Making Moves Matter: Experimental Evidence on Incentivizing Bureaucrats through Performance-Based Postings”, American Economic Review, 109(1): 237–270.

Khan, M Y (2020), “Mission Motivation and Public Sector Performance: Experimental Evidence from Pakistan”, Working Paper.

Lazear, E P (2000), “Performance Pay and Productivity”, American Economic Review, 90(5): 1346–1361.

Leaver, C, O Ozier, P Serneels, and A Zeitlin (2021), “Recruitment, effort, and retention effects of performance contracts for civil servants: Experimental evidence from Rwandan primary schools”, American Economic Review, 111(7): 2213-2246.

Limodio, N (2021), “Bureaucrat Allocation in the Public Sector: Evidence from the World Bank”, The Economic Journal, 131(639): 3012-3040.

Mbiti, I, K Muralidharan, M Romero, Y Schipper, C Manda, and R Rajani (2019), “Inputs, Incentives, and Complementarities in Education: Experimental Evidence from Tanzania”, Quarterly Journal of Economics, 134(3): 1627-1673.

Mehmood, S (2021), “The Impact of Presidential Appointment of Judges: Montesquieu or the Federalists?”, AMSE Working Paper No. 2118.

Muralidharan, K, and V Sundararaman (2011), “Teacher Performance Pay: Experimental Evidence from India”, Journal of Political Economy, 119(1): 39–77.

Neal, D, and D W Schanzenbach (2010), “Left behind by design: Proficiency counts and test-based accountability”, Review of Economics and Statistics, 92(2): 263–283.

Pop-Eleches, C, and M Urquiola (2013), “Going to a Better School: Effects and Behavioral Responses”, American Economic Review, 103(4): 1289-1324.

Prendergast, C (1999), “The Provision of Incentives in Firms”, Journal of Economic Literature, 37(1): 7–63.

Rasul, I, and D Rogger (2018), “Management of Bureaucrats and Public Service Delivery: Evidence from the Nigerian Civil Service”, The Economic Journal, 128(608): 413–446.

Spenkuch, J L, E Teso, and G Xu (2023), “Ideology and Performance in Public Organizations”, Econometrica, forthcoming.

Tirole, J (1994), “The Internal Organization of Government”, Oxford Economic Papers, 49(1): 1-29.

Williamson, O E (1979), “Transaction-Cost Economics: The Governance of Contractual Relations”, Journal of Law Economics, 22(2): 233–261.

Wilson, J Q (1989), Bureaucracy: What Government Agencies Do and Why they Do it, New York: Basic Books.

Xu, G (2018), “The Costs of Patronage: Evidence from the British Empire”, American Economic Review, 108(11): 3170–98.

Introduction - Bureaucracy

Improving bureaucrat performance through selection

Contact VoxDev

If you have questions, feedback, or would like more information about this article, please feel free to reach out to the VoxDev team. We’re here to help with any inquiries and to provide further insights on our research and content.

Bureaucracy

Guo Xu

Difficulties of measuring performance

Multi-tasking and implementation challenges

Non-monetary incentives

References

Stay up to date