World Wide Web: LyonLabs, LLC and Barrett Lyon, licensed under CC BY-NC 4.0
Visual representation of the Internet

Unit 21 Innovation, information, and the networked economy

Innovations that enhance our wellbeing are a hallmark of capitalism. Making the most of human creativity and inventiveness is a public policy challenge

  • Innovation depends on many factors: the state of knowledge, individual creativity, public policy, economic institutions, and social norms.
  • Individuals or companies who introduce socially beneficial innovations are rewarded with profits above the opportunity cost of capital, referred to as innovation rents.
  • Innovation rents are eventually competed away by imitators who spread the new knowledge by using it.
  • The production and use of new knowledge is unusual in three ways: knowledge is a non-rival good, producing new knowledge is initially costly, but once produced it can be distributed and used for free, and innovations generally become more useful as more people use them.
  • Innovating firms often have little immediate competition and can profit by setting prices far above the marginal costs of production, which disadvantages consumers.
  • But innovating firms still cannot capture all of the benefits their innovations generate, so may invest too little in innovation.
  • Public policy therefore seeks to spread socially beneficial innovations, while at the same time providing adequate rewards for those producing innovations.
  • Given this trade-off, intellectual property rights can be either ‘too strong’, preventing new innovations from spreading, or ‘too weak’, providing innovation rents that are too small to sufficiently reward innovators.
  • Digital technologies support ‘two-sided markets’ like Facebook, eBay, and Airbnb, which match individuals who can mutually benefit from exchanges.
  • These technologies have altered the nature of economic competition, but exhibit many of the same market failures observed in the production of knowledge.

Around the turn of the present century, South Africa had one of the highest rates of people living with HIV in the world: about 5 million South Africans, 1 in 10 of the population, were HIV positive. But in 1998, Bristol-Myers Squibb, Merck, and 37 other multinational pharmaceutical companies brought a lawsuit against the government of South Africa, seeking to pre­vent it from importing generic (non-brand name) drugs, other inexpensive antiretroviral drugs, and other AIDS treatments from around the world.

Street protests erupted in South Africa, and both the European Union and the World Health Organization announced their support for the South African government’s position. Al Gore, then US vice president, who had represented the interests of pharmaceutical companies in negotiations with South Africa, was confronted by AIDS activists chanting, ‘Gore’s greed kills!’ In September 1999, the US government—previously the drug companies’ strongest ally—said that it would not impose sanctions on poor countries that are affected by the HIV epidemic, even if US patent laws were broken, so long as the countries abided by international treaties governing intellectual property. The pharmaceutical giants pushed back, engaging an army of intellectual property rights lawyers to promote their case. They closed factories in South Africa and cancelled planned investments.

But three years later, with millions of dollars spent on litigation and with the even greater cost to their reputations, the companies backed down (even paying the South African government’s legal fees). Jean-Pierre Garnier, the chief executive officer of GlaxoSmithKline, telephoned Kofi Annan, secretary general of the United Nations, to ask him to help make a deal with President Thabo Mbeki of South Africa. ‘We’re not insensitive to public opinion. That is a factor in our decision-making,’ Garnier later explained.1 2

It was too late: the damage had already been done. ‘This has been a public relations disaster for the companies,’ commented Hemant Shah, an industry analyst. ‘The probability of any drug company suing a developing country on a life-saving medicine is now extremely low based on what they learned in South Africa.’

Of course, pharmaceutical company owners cannot sell an AIDS treatment at less than what it cost them to manufacture it and still stay in business. Moreover, few of the industry’s research projects lead to a marketable product (research in 2016 estimated the industry’s success rate as just over 4%). The sales of a successful product must therefore cover the costs of many failed projects because, of course, it is impossible to predict which research projects will succeed.

In this instance, the drug companies went to court in South Africa to protect their patents. In the pharmaceutical industry, the patent system gives the innovating company a time-limited monopoly that allows the company to charge a price much higher than the cost of producing the drug (sometimes by a factor of 10) during the years of patent protection. The prospect of high profits provides an incentive for companies to invest in risky research and development.

By creating a government-imposed monopoly, patent protection often conflicts with the equally important objective of making goods and services available at their marginal cost (recall from Unit 7 that a monopolist will set a price above the marginal cost). The high price—sufficient to cover the cost of research and development, including investments on failed projects—means that many of those who could benefit from access to the drug will not get it. This is an example of the deadweight efficiency losses resulting from monopoly pricing studied in Unit 7.

Conflicts between competing objectives—in this case, the production of new knowledge on the one hand and its rapid diffusion on the other—are unavoidable in the economy, and are particularly difficult to resolve when they concern innovation, as we will see.

But sometimes, new technologies allow for win-win outcomes.

Recall the problem of the fishermen and fish buyers of Kerala that we described at the beginning of Unit 11. On returning to port to sell their daily catch of sardines to fish dealers, fishermen often found that there was excess supply in the market. The result was higher prices for the consumer, on average, and lower incomes for the fishermen.3 4

law of one price
Holds when a good is traded at the same price across all buyers and sellers. If a good were sold at different prices in different places, a trader could buy it cheaply in one place and sell it at a higher price in another. See also: arbitrage.

When the fishermen got mobile phones, they would phone the many coastal fish markets from out at sea, and pick the one where the prices that day were highest. The mobile phone made it possible to implement the law of one price in Keralan fish markets, to the benefit of fishermen and consumers. It was not entirely win-win, however. The mobile phones greatly increased the competition among the dealers who were the intermediaries between fishermen and buyers, because a fisherman could bargain for higher prices before choosing which market to enter. The dealers were the losers from this innovation.

But the mobile phone had much weaker effects in other parts of the world, such as Uttar Pradesh and Rajasthan in India, where the lack of roads and storage facilities prevented farmers from profiting from information on price differences. A small farmer in Allahabad remarked that price information that he could get on his phone was not worth much to him because there were ‘no roads to go there’. In this case the innovation was of little use, because of a lack of public investment in the necessary infrastructure.

Similarly, when mobile phones came to Niger, in West Africa, farmers lacked the means to transport their cowpeas and other crops to alternative markets, and so traders who transported the goods took much of the benefit. The fishermen did not face this problem because the boats used to catch the fish were also a means of transport, allowing them to choose among markets.

In this unit, we will show how economic concepts can make sense of the South African government’s policies to make AIDS treatments more widely available, the conflict that the policies caused, and the contrasting impact of the mobile phone on fishermen in Kerala and farmers in other Indian states.

To understand innovation, you will have to forget about the image of an eccentric inventor, working alone, creating a ‘better mousetrap,’ and getting rich as a reward for the public benefit of his inspiration. Innovation is not a one-off event set off by a spark of genius. Instead:

  • Innovation is a process: It is a fundamental source of change in our life that itself is constantly undergoing change.
  • Innovation is also systemic: It connects networks of users, private firms, individuals, and government bodies.

We discuss innovation as a process and as a system in the next two sections.

Exercise 21.1 Patents and innovation in the pharmaceutical industry

  1. According to Scherer in the ‘Economist in action’ video, what are the key features of the pharmaceutical market that differentiate it from other markets?
  2. According to the video, what prevented the same drug from being made available in both high and low-income countries, and how was this issue resolved?

21.1 The innovation process: Invention and diffusion

innovation
The process of invention and diffusion considered as a whole.
invention
The development of new methods of production and new products.
diffusion
The spread of the invention throughout the economy. See also: diffusion gap.
process innovation
An innovation that allows a good or service to be produced at lower cost than its competitors.
product innovation
An innovation that produces a new good or service at a cost that will attract buyers.
innovation rents
Profits in excess of the opportunity cost of capital that an innovator gets by introducing a new technology, organizational form, or marketing strategy. Also known as: Schumpeterian rents.

We begin with a few new terms. We use the word innovation to refer to both the development of new methods of production and new products (invention) and the spread of the invention throughout the economy (diffusion). An innovating firm can produce a good or service at a cost lower than its competitors, or a new good at a cost that will attract buyers. The first is called a process innovation and the second is called a product innovation.

Invention and innovation

The descriptive term invention is sometimes reserved for major breakthroughs, but we use it to refer to:

Radical innovation

Radical innovation introduces a brand new technology or idea that had not been previously available. The invention of incandescent lighting (producing light by running electricity through a filament) was a major advance over light made by burning oil or kerosene. The MP3 format allowed music to be compressed in a manner that enabled easy storage on hard drives and transmission over the Internet, offering a vastly different way to store music than CD or vinyl.

Incremental innovation

This improves an existing product or process cumulatively. After Edison and Swan patented their designs for the incandescent electric light bulb in 1880 and started working together in 1883, all subsequent improvements in the filament that generates the light were incremental innovations in light­ing. You have already learned about the incremental improvement of the spinning jenny, one of the major inventions of the Industrial Revolution, which began with just eight spindles and eventually operated hundreds.5 6

Many of the concepts that are useful for the study of innovation have already been introduced in earlier units. They are listed in Figure 21.1, and you will encounter them again throughout this unit. Before going on, make sure you understand these concepts.

Concepts Previously in Units
Innovation rents 1, 2
External effects and public goods 4, 12
Strategic interactions 4, 5, 6
Property rights, including IPR 1, 2, 5, 12
Economies of scale 7
Complements and substitutes 7, 16
Mutual gains and conflicts over their distribution 5
Creative destruction 2, 16
Institutions and social norms 4, 5, 16

Figure 21.1 Concepts relevant to innovation that you have studied.

Recall from Unit 2 that at the going price, a company introducing a successful invention makes profits in excess of the profits that other firms make, termed innovation rents. In Figure 21.2, the research, development, and implementation costs of undertaking an innovation are shown along with the temporary innovation rents (profits above the opportunity cost of capital) from a successful invention.

Diffusion

The prospect of these innovation rents then induces others to try to copy the invention. If they are successful, the temporary rents of the innovator are eventually entirely competed away. The result of this copying process is that eventually the initial innovator will again earn profits that just cover the opportunity cost of capital, so economic profit returns to zero.

Latecomers are also eventually pushed to adopt the innovation, because the falling prices that result when the new methods become widely adopted typically mean that sticking with the old technology is a recipe for bank­ruptcy. A firm that does not innovate will make negative economic profits, meaning that its revenues fail to cover the opportunity cost of capital. This carrot-and-stick combination of the promise of rents from successful innovation and the threat of bankruptcy if firms fail to keep up with innov­ators has proved a powerful force in reducing the amount of labour required to produce goods and services, thereby raising our living standards.

In this diagram, the horizontal axis displays time. The vertical axis displays cost and profits per unit of time. A horizontal line above the horizontal axis shows zero economic profits. Cost and profits is a step function that is negative during the period of invention, above the zero economic profit line after invention, and on the zero economic profit line after the invention is copied by others. The area between the horizontal axis and the lower portion of the step function is the cost of invention. The area between the upper portion of the step function and the zero economic profits line is innovation rents.
Fullscreen

Figure 21.2 The costs and rents associated with innovations.

general-purpose technologies
Technological advances that can be applied to many sectors, and spawn further innovations. Information and communications technology (ICT), and electricity are two common examples.

Although there have been inventions throughout human history, the acceleration of the innovation process started in England around 1750 (as we saw in Unit 2) with some key new technologies introduced in textiles, energy, and transportation. It did not end with the Industrial Revolution. Important new technologies with applications to many industries such as the steam engine, electricity, and transportation (canals, railroads, bicycles, airplanes) are called general-purpose technologies.

William Nordhaus, an economist whose analysis of the discount rate applied to environmental problems you read about in Unit 20, has estimated the speed of computation using an index that has a value of 1 for the speed of a computation done by hand (like dividing one number by another). For example, in 1920 a Japanese abacus master could perform computations 4.5 times faster than a mathematically competent person could do the same calculation by hand. This difference had probably been constant for many centuries, because the abacus is an ancient computational device.

But sometime around 1940, computational speed takes off. The IBM 1130 introduced in 1965 was 4,520 times faster than hand computation (and as you can see, it was below the line of best fit through the data points from 1920).

In this diagram, the horizontal axis displays years from 1800 to 2025. The vertical axis displays an index of computations per second in ratio scale and ranges from 1 to 10,000,000,000,000,000, so that the index increases tenfold at each consecutive step on the axis. Data for different technologies is shown. The two variables are positively correlated. The line of best fit has equation y = 8 times capital E - 20 times e to the 0.2495x.
Fullscreen

Figure 21.3 Innovation in computing power: Index of computing speed. Particular examples are shown in colour and labelled.

William D. Nordhaus. 2007. ‘Two Centuries of Productivity Growth in Computing’. The Journal of Economic History 67 (01), Index updated to 2010.

innovation system
The relationships among private firms, governments, educational institutions, individual scientists, and other actors involved in the invention, modification, and diffusion of new technologies, and the way that these social interac­tions are governed by a combination of laws, policies, know­ledge, and social norms in force.

The most recent entry in Figure 21.3, the SiCortex supercomputer, performs more than 1 billion computations per second. It is more than a quadrillion (count the zeros) times faster than you, and it is well above a line of best fit through the data points from 1920, so there is no indication that the process is slowing down.

But as the ‘When economists disagree’ box shows, engineers and economists disagree over whether improvements in computation or any other technology will continue at the pace given in Nordhaus’s chart, or instead will return to the modest pace of improvement that prevailed over most of human history.7

The stepped line in Figure 21.2 illustrated a simple theory of innovation and diffusion of technical progress. It clarifies how innovation rents, costs of innovation, and the copying of innovations are interrelated from the stand­point of a firm or individual that wants to develop a new product or process.

To understand this process, we need to know how inventions actually happen, how the costs and rents are decided, and when the process of copying takes place. To do this, we have to go beyond the point of view of the single firm in Figure 21.2 and think of innovation as the product of interactions among firms, the government, educational institutions, and many other players in the innovation system.

When economists disagree The end of the permanent technological revolution?

We began Unit 1 with the Industrial Revolution, the capitalist revolution, and history’s hockey sticks of rapid technological progress. In Unit 2, we explained how these advances translated into improvements in wellbeing. And we have just seen the dramatic (and possibly even accelerating) rate of technical advance in computation.

In Unit 16, we studied the long-run trend for the economy to produce more services relative to goods. If service productivity grows more slowly than manufacturing productivity, the shift from goods to services reduces overall productivity growth in the economy.

Will this limit the ability of technological progress to increase labour productivity at the rate that has occurred since the Industrial Revolution, and especially during the golden age of capitalism? It seems appropriate to begin this unit with the disagreement among economists about whether the ‘permanent’ technological revolution is ending.

In this diagram the horizontal axis displays years from 1400 to 2021. The vertical axis displays growth rates as 50-year (for GDP growth) and 20-year (for labour productivity) centered moving averages, and ranges from -0.5% to 3%. Series for GDP per capita in the UK and labour productivity in the US are shown. GDP per capita experienced virtually no growth until 1650, after which it fluctuated between 0% and 1.5% until the middle of the 20th century. During the late 20th centuries and early 21st century it fluctuated between 1.5% and 2.5%. Labour productivity in the US decreased from 2.5% in 1950 to 1.4% in the late 1990s, increased to 2% in 2000 and decreased to 1.5% in 2012.
Fullscreen

Figure 21.4 The growth rate of productivity over the long run (1400–2021).

Jutta Bolt and Jan Juiten van Zanden. 2013. ‘The First Update of the Maddison Project Re-Estimating Growth Before 1820’. Maddison-Project Working Paper WP-4, January; Stephen Broadberry. 2013. ‘Accounting for the Great Divergence’. London School of Economics and Political Science. The Conference Board. 2021. Total Economy Database.

Figure 21.4 shows the best available data on the advance of productivity of labour in the UK since 1400, and also for the US for the period in which the US has been the global technology leader. Robert Gordon, an economist who specializes in productivity and growth, has written extensively about productivity growth and its effects, particularly in the first chapter of his book The Rise and Fall of American Growth. He points to the downturn in the productivity growth rate series at the end of the period in the chart.

Gordon believes that the rapid growth era from the first half of the twentieth century is long gone, and slower growth lies ahead of us. In contrast, Erik Brynjolfsson and Andrew McAfee, both economists, advance the view that digital technology is opening up a ‘second machine age’. In a video broadcast by Swiss National Television and its second part, they explain their points of view.

Exercise 21.2 The permanent technological revolution

Use all the sources above, as well as Thomas Edsall’s New York Times article ‘Boom or Gloom’ and Lee Koromvokis’ PBS Newshour article ‘Are the best days of the U.S. economy over?’, to answer the following questions:

  1. According to Gordon, Brynjolfsson, and McAfee, which other factors, apart from technological innovation, affect the rate of GDP per capita growth? Why might it take a long time for today’s innovations to affect the economy’s growth rate?
  2. How well do you think GDP per capita growth measures the effect of innovation? Suggest alternative ways to measure the effects of innovation.
  3. According to Brynjolfsson and McAfee, how will technological progress affect inequality? Use the data and models from Units 16 and 19 to discuss whether you agree with Brynjolfsson and McAfee’s analysis of the relationship between technological progress and inequality.
  4. In this unit, we discuss how policies and institutions can help the process of innovation. How can policies and institutions also help the economy adjust to the effects of innovation?

Question 21.1 Choose the correct answer(s)

Which of the following statements regarding innovation is correct?

  • An innovation is the development of new methods of production and new products. The spread of these is not innovation.
  • A product innovation is when a firm produces a good or service at cost lower than its competitors.
  • A process innovation is when a firm produces a new good at a cost that will attract buyers.
  • Innovation comprises of both invention and diffusion.
  • The spreading of invention, known as diffusion, is also part of innovation.
  • This is process innovation.
  • This is product innovation.
  • This is the definition of innovation.

21.2 Innovation systems

Innovative activities are not spread evenly across the globe or even across a country. Think of the area now known as Silicon Valley in California, once a sleepy farming area centred on Santa Clara Valley. Silicon Valley got its nickname when high-growth firms in computing and semiconductor design moved in, later joined by innovators in biotech. In 2010, in a single US postal area (ZIP code 95054) in the centre of Silicon Valley, 20,000 patents were registered. Patent attorneys cluster in this part of Santa Clara. If this small area of 16.2 km2 were a country, it would have ranked 17th in the world in patents in 2010.8

The outpouring of patents from Silicon Valley is a measure of its output of what is termed codified knowledge, meaning knowledge that can be written down. But much of the knowledge produced cannot be written down, or at least not exactly. This non-codifiable knowledge is termed tacit knowledge.

The difference between codified and tacit knowledge can be illustrated this way. A recipe for a cake can be written down, as it is codified knowledge, but being able to read the recipe and follow it exactly does not get you a reputation for being an outstanding cook; on the other hand, the tacit knowledge of an exceptional chef is not something that you can easily write in a book.

codified knowledge
Knowledge that can be written down in a form that would allow it to be understood by others and reproduced, such as the chemical formula for a drug. See also: tacit knowledge.
tacit knowledge
Knowledge made up of the judgements, know-how, and other skills of those participating in the innovation process. The type of knowledge that cannot be accurately written down. See also: codified knowledge.

The importance of tacit knowledge is demonstrated in the destruction and re-emergence of the German chemical industry. After the First World War and again after the Second World War, German chemical companies had their factories in Germany disassembled and their facilities in the US and UK expropriated. All that remained were key personnel.

Had all of the necessary knowledge to build a modern chemical industry been codified, there is no particular reason why Germany should have resumed its leadership in this field. Any country with a large scientific and engineering labour force could have created the industry using the available codified knowledge, more or less like the cook following a recipe. But using their know-how and experience (the tacit knowledge), German companies nevertheless managed to resume dominant positions in some markets.

Silicon Valley is as famous for its tacit knowledge as it is for its patented codified knowledge. The extraordinary concentration of innovative businesses in Silicon Valley reflects the importance of external effects and public goods in the production and application of new technologies. The two words ‘Silicon Valley’ no longer just refer to a place. They now represent a particular way that innovation gets done. Silicon Valley has become associated with an innovation system.

As well as the legal institutions that protect codifiable knowledge and that govern how easily holders of tacit knowledge can move between firms, an innovation system includes financial institutions such as venture capital funds, banks, or technology-oriented firms that will finance projects that seek to commercialize innovations.

Different countries provide quite different innovation systems that often co-evolve with industries in which they specialize. For example, radical innovation is more prevalent in the US, where labour can move easily between firms and venture capital is well developed, and incremental innovation is more prevalent in Germany, where ties of workers to firms are stronger and finance for innovation comes from retained profits and banks rather than from venture capital.

non-compete contract
A contract of employment containing a provision or agreement by which the worker cannot leave to work for a competitor. This may reduce the reservation option of the worker, lowering the wage that the employer needs to pay.

Even within the US, Silicon Valley was unusual. During the 1960s, Silicon Valley was a minor player in technology compared to the Route 128 concentration near Boston, Massachusetts, which benefited from proximity to Harvard and MIT. But Route 128 differed from Silicon Valley in important ways, including the use of non-compete contracts that prohibited anyone leaving one firm from taking up employment with a competing firm, as a way of protecting information that a firm produced:

  • The state of Massachusetts enforced non-compete contracts: This limited inter-firm mobility and the information-sharing that resulted from it.
  • The state of California took the opposite position: It outlawed non-compete contracts, saying that: ‘Every contract by which anyone is restrained from engaging in a lawful profession, trade, or business of any kind is … void.’ The resulting circulation of engineers among firms in Silicon Valley promoted the rapid diffusion of new knowledge among firms.

The Silicon Valley innovation system

Why is innovation concentrated in Silicon Valley? Institutions and incentives reinforce each other to produce a radical innovation cluster. The Silicon Valley model is one of highly mobile entrepreneurs, investors, and employees linked within a small geographical area, with support from government and educational institutions.9 10

The Silicon Valley system consists of:

  1. Innovating firms: Most innovation takes place in firms specializing in producing new methods or products (start-ups) rather than in existing firms that produce goods and services.
  2. Other innovating institutions: In a partnership that began early in the 1900s, two universities, one public (University of California at Berkeley) and the other private (Stanford University) work closely with firms to commercialize innovations. An industrial park was set up in 1951 at Stanford with major corporations like General Electric, IBM, and Hewlett Packard. University, government, and private R&D labs are co-located in the Valley, even including the R&D centre of Walmart, the retail giant.
  3. Government: Military research in electronics and high-energy physics was funded at the universities and in private firms in the area, starting in the run-up to the Second World War. During the Cold War (from the end of the Second World War until the 1990s), this continued with Lockheed Missiles and Space the largest employer in the Valley. A change in the law in 1980, called the Bayh-Dole Act, enabled universities to gain ownership of their output and commercialize it even if the federal government had helped fund it. This brought private investors into the network.
  4. Social norms: A social norm for high-risk, high-return behaviour, which some say has its origins in the speculators who flooded into California to mine for gold in the nineteenth century, sustains a culture of serial entrepreneurship. Failed innovators can start again with a new idea. High firm failure rates and other reasons for employee mobility across firms distribute the tacit knowledge acquired in one firm to other firms. Some have concluded that this unintentional sharing of information among firms was key to Silicon Valley’s success.
  5. Finance: Entrepreneurs at an early stage will pitch their project to venture capital (VC) investors. When the VCs decide to invest and take a substantial ownership stake, usually for a period of 12 to 18 months, it creates strong incentives for the startup to grow rapidly and, if successful, means the VC investor can exit with a high rate of profit. The funding model for startups is a rapid-paced cycle of pitching a new business idea to investors, which is based on the commercialization of an invention, followed by recruiting key employees (often with earnings linked to the value of the firm when it is sold), market growth, and seeking more cash. Founders, investors, and employees all understand that failure is likely. Funders still benefit, because the few successful ventures produce large returns that compensate for many losses.

The German innovation system

Innovation in the US is concentrated in industries whose patents heavily cite scientific articles. This is one indicator of radical innovation. By contrast, the very successful export industries of Germany rely on incre­mental innovation, where patents are much less intensive in scientific citations and tacit knowledge tends to be more important. Networks are also crucial to the German innovation system but they work differently from those in Silicon Valley. Like Silicon Valley, innovation is concentrated geographically, with centres around Munich and Stuttgart in southwest Germany.

The German system consists of:

Read the introduction to: Peter A Hall and David Soskice. 2001. Varieties of Capitalism: The Institutional Foundations of Comparative Advantage. New York, NY: Oxford University Press.

  1. Innovating firms: Incremental innovation takes place in medium-size and large long-lived companies in Germany, and relies on long-term relationships between employers and workers, between firms and banks, and among firms linked through production relationships and ownership and control ties. To succeed in introducing new technology, firms face many coordination problems, which can be solved through cooperative and competitive relationships with employees, other firms, and banks.
  2. Government: The government supports the training of highly skilled workers through a government-subsidized apprenticeship system, which is supervised by industry associations. This system reduces training costs for firms and ensures high quality training. Apprentices contribute by accepting low training wages. Large firms are required to have elected bodies to represent workers in negotiations with managers. They help to devise ways to exploit all possible mutual gains and to distribute these gains in a way acceptable to all.
  3. Innovators: Skilled workers are needed for the successful introduction of process and product innovations. To make this possible, young people need to be assured of long-term, high-wage employment before they are willing to commit to multi-year apprenticeships. Similarly, workers engaging in innovation that could result in job cuts need to be assured that they will not lose their jobs. The vocational training scheme addresses these issues in a number of ways. As discussed above, the government sponsors and subsidizes high-quality apprenticeships. Training schemes are also certified. This assures trainees that their skills are valuable outside the firm, improving their reservation position should their job end, and helping to ensure high wages as long as the job continues.
  4. Social norms: Incremental innovation (for example, in the bicycle industry) requires industry-wide standards to make technology transfer easier. Long-term relationships and cross-ownership among firms are essential for facilitating technology transfer, because long-term employment contracts mean that the Silicon Valley-style technology transfer that occurs when workers move from one firm to another is much less common. Similarly, the assurance that firms’ highly trained workers will not be poached is not achieved through laws but by norms that are widely respected by the otherwise highly competitive firms.
  5. Finance: The system of ownership of large German firms differs sharply from that of US or UK firms. Takeovers are easier in the US or the UK, and allow for rapid changes in the use of the assets of a firm. Because ownership of firms is much more concentrated in Germany, it is virtually impossible for a hostile takeover—that is, one opposed by the management—to occur. Therefore, long-run inter-firm collaboration over technology development is possible, and industry-wide standards are easier to set. The financing of innovation in Germany comes from retained profits (profits not distributed to shareholders) and bank loans. Long-term finance provides reassurance for trainees who invest in acquiring company-specific skills, as well as for other companies investing in related technology developments.

Figure 21.5 compares the two systems. Both are successful, but in different ways. Silicon Valley-based firms dominate important digital technologies (ICT) associated with the latest general-purpose technology, while the German firms making up its distinctive innovation system have managed to sustain a much higher level of well-paid industrial jobs in the face of global competition, compared to the US or any other country outside of East Asia.

  Silicon Valley German innovation system
Innovation Radical codified, especially in ICT Incremental tacit, especially in capital goods and transport equipment
Innovating firms Entrepreneurial innovation specialists Established industrial and other firms
Government Military contracts, higher education Subsidies for training workers
Innovators Engineers, scientists, universities Skilled workers and engineers
Social norms Competitive; risk-taking Cooperative; risk-pooling
Finance Venture capital Bank loans, retained earnings
Property rights Patents of more importance Non-patent forms of protection of more importance

Figure 21.5 Two innovation systems: Silicon Valley and Germany.

The economics of innovation systems

Successful innovation can contribute to rising living standards by expanding the set of products available to consumers, and by reducing the prices of existing products. However, many societies struggle to innovate. Economist Lisa Cook of Michigan State University asks why the transition to capitalism in Russia in the 1990s did not spark a wave of innovation. She documents the late 19th century inventions contributed by African American inventors, including gas masks, traffic lights, and light bulb technology and how this burst of innovations was cut short by a wave of attacks and anti-black mob violence. Her insights on the political and economic conditions under which innovation will flourish are relevant to understanding the vast differences across the world today in the extent of innovation.

Compare the amount of innovation in capitalist economies to the amount in centrally planned economies of the Soviet Union and its allies during the twentieth century. In a list of 111 major non-military product and process innovations between 1917 and 1998, only one—synthetic rubber—came from Soviet bloc countries. Scholars have suggested that an important factor contributing to the collapse of the Soviet planned economies was the Communist Party’s failure to deliver innovation in consumer goods, which eroded the legitimacy of its rule.11

The successful capitalist innovation systems in Silicon Valley and Germany have two things in common:

  • The innovation system is not based on individual creativity: A single firm or an inventor relies on the relationships among all of the actors—owners, employees, governments, and sources of finance. Regions without these support networks are less successful at innovation.
  • There is a helping hand as well as an ‘invisible hand’: Successful innovation systems involve profit-seeking competition among individuals and firms, but the government also plays an essential role—military contracts in Silicon Valley and worker training in Germany, for example.

In the next three sections, we explore three aspects of invention and diffusion that make the innovation process a challenge to public policy, and why it has proven so difficult for other localities to copy the Silicon Valley or German innovation systems.

external effect
A positive or negative effect of a production, consumption, or other economic decision on another person or people that is not specified as a benefit or liability in a contract. It is called an external effect because the effect in question is outside the contract. Also known as: externality. See also: incomplete contract, market failure, external benefit, external cost.
public good
A good for which use by one person does not reduce its availability to others. Also known as: non-rival good. See also: non-excludable public good, artificially scarce good.
economies of scale
These occur when doubling all of the inputs to a production process more than doubles the output. The shape of a firm’s long-run average cost curve depends both on returns to scale in production and the effect of scale on the prices it pays for its inputs. Also known as: increasing returns to scale. See also: diseconomies of scale.

These aspects are:

  • External effects and the problem of coordination among innovators*: A firm’s successful invention almost always has either positive or negative effects on the value of other firms’ investments in the innovation process. Owners of a firm who are concerned solely about their profits will fail to take into account these external effects.
  • Public goods: Innovation can be seen as the production of new knowledge by the use of a combination of old knowledge and creativity. The fact that most forms of knowledge are non-rival—making it available to an additional user does not mean that some current user will be deprived of its use—makes the innovation process one that uses public goods to produce other public goods.
  • Economies of scale and winner-take-all competition: Big is beautiful when it comes to the knowledge-based economy. Average costs fall as more units of a good or service are provided, and this means that firms entering a market first often can take the entire market, at least temporarily.

Recall from Unit 12 that these three characteristics are all sources of market failure. Simply letting market competition regulate the process of innovation will generally not result in an efficient outcome. These same three aspects of the innovation process also pose challenges to governments that seek to address these market failures. This is because governments may lack the necessary information (or the motivation) to develop appropriate policies.

We begin with a model of the problem of external effects and the problem of coordination among innovators, simplified to just two firms considering investing in innovations, and a government that may assist in the innovation process.

Exercise 21.3 Comparing innovation systems

In this unit, we compared the Silicon Valley and German innovation systems.

Which of these two systems do you think would be more likely to be introduced and succeed in the country or region in which you are now living? Why or why not? (If you are in Germany, would the Silicon Valley system work where you are? If you are in California, would the German system work there?)

Question 21.2 Choose the correct answer(s)

Which of the following statements is correct regarding the Silicon Valley and German innovation systems?

  • The Silicon Valley innovation system is considered much more successful than the German innovation system.
  • Both Silicon Valley and German innovation systems rely on universities to provide highly skilled, and therefore highly paid, graduates.
  • The successes of Silicon Valley and German innovation systems are both due to the relationships among all of the actors (owners, employees, governments and financiers) that promote innovation.
  • Both Silicon Valley and German innovation systems benefit from a high level of financing from venture capitalists, whose high tolerance for business failure sustain a culture of entrepreneurship.
  • The two systems, though very different, are both regarded as very successful. Silicon Valley-based firms dominate important ICT technologies, while the German firms have managed to sustain a much higher level of well-paying industrial jobs despite global competition.
  • Whilst this statement is true for Silicon Valley, workers in Germany are trained through multi-year apprenticeships at low wages by the firms.
  • As explained in the text.
  • The financing of innovation in Germany comes from retained earnings and bank loans, which allows investment in more industry-specific skills.

21.3 External effects: Complements, substitutes, and coordination

complements
Two goods for which an increase in the price of one leads to a decrease in the quantity demanded of the other. See also: substitutes.
substitutes
Two goods for which an increase in the price of one leads to an increase in the quantity demanded of the other. See also: complements.

Innovations considered by a firm will typically either increase or decrease other firms’ profit levels, and affect those firms’ choices about innovation. Think about just two firms, each considering innovations that are either:

  • Complements: The value of one innovation is greater in the presence of the other. Tin cans were invented to store food in 1810 by Peter Durand, a British merchant, and the first canning factory began production in 1813. But the cans were very difficult to open and not widely used in the home until 1858, when Ezra Warner invented a simple can opener.
  • Substitutes: The two innovations are valuable alone, but less valuable when some other innovation has already occurred. A good example is the video format war during the 1980s between two competing standards, VHS and Betamax. Videos made using one format could not be played on machines designed to play the other. Either Sony Betamax or JVC’s rival VHS would have been a perfectly good single format for home video recording, but the introduction of both led to a costly rivalry.

In the absence of explicit government policies or private means of coordination among firms, the challenges posed by complementary innovations and substitute innovations are quite different:

  • When potential innovations are complements: Innovations sometimes do not occur even when it would have been socially beneficial, and profitable to the firms, if they had both occurred.
  • When potential innovations are substitutes: Both innovations sometimes occur, when having only one or the other would be more socially beneficial and profitable to the firms involved. Competition between substitutes may impose a high cost on both innovators.

We can use game theory to understand how two potential innovating firms interact strategically, and show why these contrasting problems arise and why they may be difficult to solve. (You may wish to review the introduction to game theory in Unit 4.)

Innovations that are complements

Here we have two hypothetical firms, Plugbike, which is considering developing a novel electric bicycle, and Bikeflex, which is weighing up the likely profits and costs of investing in a mobile network of battery exchanges. As above, the presence of Bikeflex makes Plugbike more valuable and vice versa, so they are complements. They will make their decisions (Innovate or Do not innovate) independently, but they know the profits and losses that will result in each of the four possible outcomes. They are given in the payoff matrix below. The row player is Plugbike, and its payoffs come first in each cell; the column player is Bikeflex, its payoffs are second in each cell. Positive numbers are profits for the company, while negative numbers are losses.

This diagram shows Plugbike and Bikeflex’s possible actions, which are to innovate or not to innovate. Payoffs are (Plugbike’s, Bikeflex’s). If both innovate, payoffs are (1, 1). If Plugbike innovates and Bikeflex does not innovate, payoffs are (-0.5, 0). If Plugbike does not innovate and Bikeflex innovates, payoffs are (0, -0.5). If neither innovates, payoffs are (0, 0). If Bikeflex innovates, Plugbike’s best response is to innovate. If Bikeflex does not innovate, Plugbike’s best response is to not innovate. If Plugbike innovates, Bikeflex’s best response is to innovate. If Plugbike does not innovate, Bikeflex’s best response is not to innovate. Both firms innovating and neither firm innovating are Nash equilibria.
Fullscreen

Figure 21.6 The decision to innovate when products are complements.

Begin with the row player
: This diagram shows Plugbike and Bikeflex’s possible actions, which are to innovate or not to innovate. Payoffs are (Plugbike’s, Bikeflex’s). If both innovate, payoffs are (1, 1). If Plugbike innovates and Bikeflex does not innovate, payoffs are (-0.5, 0). If Plugbike does not innovate and Bikeflex innovates, payoffs are (0, -0.5). If neither innovates, payoffs are (0, 0).
Fullscreen

Begin with the row player

Begin with the row player and ask: ‘What would be the best response to the column player’s decision to innovate?’

The best response
: This diagram shows Plugbike and Bikeflex’s possible actions, which are to innovate or not to innovate. Payoffs are (Plugbike’s, Bikeflex’s). If both innovate, payoffs are (1, 1). If Plugbike innovates and Bikeflex does not innovate, payoffs are (-0.5, 0). If Plugbike does not innovate and Bikeflex innovates, payoffs are (0, -0.5). If neither innovates, payoffs are (0, 0). If Bikeflex innovates, Plugbike’s best response is to innovate.
Fullscreen

The best response

The best response would be Innovate, since the payoff is 1 rather than 0. Place a dot in the top left-hand cell.

The row player’s response
: This diagram shows Plugbike and Bikeflex’s possible actions, which are to innovate or not to innovate. Payoffs are (Plugbike’s, Bikeflex’s). If both innovate, payoffs are (1, 1). If Plugbike innovates and Bikeflex does not innovate, payoffs are (-0.5, 0). If Plugbike does not innovate and Bikeflex innovates, payoffs are (0, -0.5). If neither innovates, payoffs are (0, 0). If Bikeflex innovates, Plugbike’s best response is to innovate. If Bikeflex does not innovate, Plugbike’s best response is to not innovate.
Fullscreen

The row player’s response

Then ask what the row player’s best response would be to the column player’s choice of Do not innovate: the answer is Do not innovate. Place a dot in the bottom right-hand cell.

The column player’s reasoning
: This diagram shows Plugbike and Bikeflex’s possible actions, which are to innovate or not to innovate. Payoffs are (Plugbike’s, Bikeflex’s). If both innovate, payoffs are (1, 1). If Plugbike innovates and Bikeflex does not innovate, payoffs are (-0.5, 0). If Plugbike does not innovate and Bikeflex innovates, payoffs are (0, -0.5). If neither innovates, payoffs are (0, 0). If Bikeflex innovates, Plugbike’s best response is to innovate. If Bikeflex does not innovate, Plugbike’s best response is to not innovate. If Plugbike innovates, Bikeflex’s best response is to innovate.
Fullscreen

The column player’s reasoning

Now turn to the column player. What would be the best response to the row player’s strategy of Innovate? The answer is Innovate. Place an open circle in the top left-hand cell—there will now be a dot inside a circle.

The column player’s response
: This diagram shows Plugbike and Bikeflex’s possible actions, which are to innovate or not to innovate. Payoffs are (Plugbike’s, Bikeflex’s). If both innovate, payoffs are (1, 1). If Plugbike innovates and Bikeflex does not innovate, payoffs are (-0.5, 0). If Plugbike does not innovate and Bikeflex innovates, payoffs are (0, -0.5). If neither innovates, payoffs are (0, 0). If Bikeflex innovates, Plugbike’s best response is to innovate. If Bikeflex does not innovate, Plugbike’s best response is to not innovate. If Plugbike innovates, Bikeflex’s best response is to innovate. If Plugbike does not innovate, Bikeflex’s best response is not to innovate.
Fullscreen

The column player’s response

Do the same for the column player’s response to row player’s strategy of Do not innovate. There is now a dot inside a circle.

Finding the Nash equilibria
: This diagram shows Plugbike and Bikeflex’s possible actions, which are to innovate or not to innovate. Payoffs are (Plugbike’s, Bikeflex’s). If both innovate, payoffs are (1, 1). If Plugbike innovates and Bikeflex does not innovate, payoffs are (-0.5, 0). If Plugbike does not innovate and Bikeflex innovates, payoffs are (0, -0.5). If neither innovates, payoffs are (0, 0). If Bikeflex innovates, Plugbike’s best response is to innovate. If Bikeflex does not innovate, Plugbike’s best response is to not innovate. If Plugbike innovates, Bikeflex’s best response is to innovate. If Plugbike does not innovate, Bikeflex’s best response is not to innovate. Both firms innovating and neither firm innovating are Nash equilibria.
Fullscreen

Finding the Nash equilibria

Wherever there is a dot inside a circle in a cell, this is a Nash equilibrium because it shows that each player is playing the best response to what the other does.

Imagine that you are Plugbike. If you do not innovate you will get zero, whatever Bikeflex does. If you knew that Bikeflex was not going to introduce its product, then you surely would not develop the Plugbike. What if Bikeflex does introduce its product? If you innovate you will get profits of 1. But you also stand to incur losses of 0.5 if Bikeflex does not innovate.

Unless you are pretty sure that Bikeflex is going to innovate, you may decide that you have better uses for your funds. If Bikeflex reasoned the same way, then neither firm might innovate even though they both would have profited from doing so.

Innovations that are substitutes

When two innovations are substitutes we have the opposite problem. A good example is the video format war during the 1980s between two competing standards, VHS (for ‘video home system’ developed by Victor Company of Japan (JVC)) and Sony’s Betamax format. As discussed above, videos using one format could not be played on machines designed to play the other, so both companies had an interest in making their format the most widely accepted.

We consider two hypothetical firms based on the Sony-JVC case. Here is the payoff matrix facing them. JVC is the row player, and Sony is the column player. As before, the first entry in each cell is the payoff of the row player.

This diagram shows JVC (VHS producer) and Sony’s (Betamax producer) possible actions, which are to innovate or not to innovate. Payoffs are (JVC’s, Sony’s). If both firms innovate, payoffs are (-1, -1). If JVC innovates and Sony does not innovate, payoffs are (2, -0.5). If JVC does not innovate and Sony innovates, payoffs are (-0.5, 2). If neither innovates, payoffs are (0, 0). If Sony innovates, JVC’s best response is not to innovate. If Sony does not innovate, JVC’s best response is to innovate. If JVC innovates, Sony’s best response is not to innovate. If JVC does not innovate, Sony’s best response is to innovate. There are two Nash equilibria: JVC innovating and Sony not innovating, or JVC not innovating and Sony innovating.
Fullscreen

Figure 21.7 The decision to innovate when products are substitutes.

Begin with the row player
: This diagram shows JVC (VHS producer) and Sony’s (Betamax producer) possible actions, which are to innovate or not to innovate. Payoffs are (JVC’s, Sony’s). If both firms innovate, payoffs are (-1, -1). If JVC innovates and Sony does not innovate, payoffs are (2, -0.5). If JVC does not innovate and Sony innovates, payoffs are (-0.5, 2). If neither innovates, payoffs are (0, 0).
Fullscreen

Begin with the row player

Begin with the row player and ask: ‘What would be the best response to the column player’s decision to innovate?’

The best response
: This diagram shows JVC (VHS producer) and Sony’s (Betamax producer) possible actions, which are to innovate or not to innovate. Payoffs are (JVC’s, Sony’s). If both firms innovate, payoffs are (-1, -1). If JVC innovates and Sony does not innovate, payoffs are (2, -0.5). If JVC does not innovate and Sony innovates, payoffs are (-0.5, 2). If neither innovates, payoffs are (0, 0). If Sony innovates, JVC’s best response is not to innovate.
Fullscreen

The best response

The best response would be Do not innovate, since the payoff is –0.5 rather than –1.0. Place a dot in the bottom left-hand cell.

The row player’s response
: This diagram shows JVC (VHS producer) and Sony’s (Betamax producer) possible actions, which are to innovate or not to innovate. Payoffs are (JVC’s, Sony’s). If both firms innovate, payoffs are (-1, -1). If JVC innovates and Sony does not innovate, payoffs are (2, -0.5). If JVC does not innovate and Sony innovates, payoffs are (-0.5, 2). If neither innovates, payoffs are (0, 0). If Sony innovates, JVC’s best response is not to innovate. If Sony does not innovate, JVC’s best response is to innovate.
Fullscreen

The row player’s response

Then ask what the row player’s best response would be to the column player’s choice of Do not innovate: the answer is Innovate. Place a dot in the top right-hand cell.

The column player’s reasoning
: This diagram shows JVC (VHS producer) and Sony’s (Betamax producer) possible actions, which are to innovate or not to innovate. Payoffs are (JVC’s, Sony’s). If both firms innovate, payoffs are (-1, -1). If JVC innovates and Sony does not innovate, payoffs are (2, -0.5). If JVC does not innovate and Sony innovates, payoffs are (-0.5, 2). If neither innovates, payoffs are (0, 0). If Sony innovates, JVC’s best response is not to innovate. If Sony does not innovate, JVC’s best response is to innovate. If JVC innovates, Sony’s best response is not to innovate.
Fullscreen

The column player’s reasoning

Now turn to the column player. What would be the best response to the row player’s strategy of Innovate? The answer is Do not innovate. Place an open circle in the top right-hand cell—there will now be a dot inside a circle.

The column player’s response
: This diagram shows JVC (VHS producer) and Sony’s (Betamax producer) possible actions, which are to innovate or not to innovate. Payoffs are (JVC’s, Sony’s). If both firms innovate, payoffs are (-1, -1). If JVC innovates and Sony does not innovate, payoffs are (2, -0.5). If JVC does not innovate and Sony innovates, payoffs are (-0.5, 2). If neither innovates, payoffs are (0, 0). If Sony innovates, JVC’s best response is not to innovate. If Sony does not innovate, JVC’s best response is to innovate. If JVC innovates, Sony’s best response is not to innovate. If JVC does not innovate, Sony’s best response is to innovate.
Fullscreen

The column player’s response

Do the same for the column player’s response to row player’s strategy of Do not innovate. There is now a dot inside a circle.

Finding the Nash equilibria
: This diagram shows JVC (VHS producer) and Sony’s (Betamax producer) possible actions, which are to innovate or not to innovate. Payoffs are (JVC’s, Sony’s). If both firms innovate, payoffs are (-1, -1). If JVC innovates and Sony does not innovate, payoffs are (2, -0.5). If JVC does not innovate and Sony innovates, payoffs are (-0.5, 2). If neither innovates, payoffs are (0, 0). If Sony innovates, JVC’s best response is not to innovate. If Sony does not innovate, JVC’s best response is to innovate. If JVC innovates, Sony’s best response is not to innovate. If JVC does not innovate, Sony’s best response is to innovate. There are two Nash equilibria: JVC innovating and Sony not innovating, or JVC not innovating and Sony innovating.
Fullscreen

Finding the Nash equilibria

Wherever there is a dot inside a circle in a cell, this is a Nash equilibrium because it shows that each player is playing the best response to what the other does.

If Sony is sure that JVC will innovate, then it will face a costly battle with big losses if JVC wins. The payoffs in the upper left-hand cell are negative for both firms, because the costs of developing the new product and competing for market share do not offset the uncertain prospect of profits should they win. Of course, if Sony knew that JVC was not going to invest, or if it was sure it would win a not-very-costly battle with its product should both invest, then Sony would definitely invest and enjoy the winner-take-all profits, while inflicting losses on JVC.

The result is that there is sometimes too little innovation for the good of society when ideas are complementary, and too much when the innovations are substitutes.

The role of public policy

Complements

If the payoffs in the matrix were known to everyone, then a wise government would know that the top left (Innovate, Innovate) in Figure 21.6 is the best outcome for society. It could, in the case of complementary innovations, provide both firms with sufficient subsidies so that both would find it profitable to make the investment regardless of what the other firm did. Or, more reasonably, it could help the two firms to cooperate in the innovation process, promising not to prosecute them for any anti-competitive prac­tices if coordinated decision making is prohibited by antitrust or other law.

But using public policy to avoid an unfavourable outcome is a greater challenge than our simple model would suggest. There are likely to be more than two potential innovators, and hence many proposed designs for electric bicycles and for recharging systems. The government would have to choose the cooperating firms, and the terms under which the cooperation would occur. In this case, companies have incentives to spend resources to influence government decisions (lobbying). As we shall see in Unit 22, there are many reasons why governments may fail to achieve the socially beneficial outcome in cases like this.

Private exchanges might have a role to play here. If the firms themselves have better information than the government, they might engage in private agreements. This is the equivalent to the bargaining among private economic bodies that occurred in Unit 12 as an alternative to government regulation of the use of chemical weedkillers.

Finally, firms with promising complementary innovations might agree to merge so that, as a single company, the problem of coordinating their innovation decisions would be internal to the firm.

Substitutes and standards

The substitutes in Figure 21.7 present similar challenges for government policy. There may be a great many competing substitute innovations. Sony’s Betamax and JVC’s VHS were not the only entrants in the early stages of the formatting wars. Governments may also lack the relevant information, or may be under the influence of one of the contestants.

As we will see later, sometimes one competitor’s technology wins over the other. Eventually, Betamax, for example, died out and VHS became the universal home videotape standard. Sometimes, companies in an industry apply the same standards, because consistency increases the size of the market and benefits all firms. An example is the way the shipping industry implemented the standard for the size of containers they carry, which allowed trucks and ports to become more efficient, and therefore achieve economies of scale.

Often, however, public sector agencies play an important role in encouraging agreement among all the firms in an industry about technical standards. These are usually international bodies, like the International Telecommunications Union or the European Commission. The EU, for example, helped mobile phone companies to agree on the GSM standard for phone handsets and networks, which enabled all the manufacturers and operators to benefit from a rapidly growing European mobile market, and enabled consumers to benefit from the ease of calling other networks and declining prices.

Exercise 21.4 Complements

  1. List some pairs of innovations that are complements, and some that are substitutes.
  2. In the game in Figure 21.6, what probability of one firm choosing ‘Innovate’ would make it profitable for the other firm to choose ‘Innovate’? Explain your answer. (Hint: Compare the expected payoffs of choosing either option, given that the probability of the other firm choosing ‘Innovate’ is x. What range of probabilities would give a higher expected payoff from choosing ‘Innovate’?)

Exercise 21.5 Substitutes and complements

  1. Go back to Figure 4.16a and consider the game between Bettina and Astrid, in which they choose whether to use two different programming languages, C++ and Java. Describe the similarities and differences in the strategies, payoffs, and optimal outcome of Figure 4.16 and the Sony-JVC game depicted here.
  2. In Figure 21.7, for innovating to be profitable, with what probability should the other firm choose ‘Do not innovate’?

Now suppose that decisions in Figures 21.6 and 21.7 are made sequentially rather than simultaneously. In the case of substitutes (Sony and JVC), imagine that JVC developed its product and put it on the market (or at least convinced Sony that it would definitely do this). In the case of complements (Plugbike and Bikeflex), assume that Plugbike could convince Bikeflex that it will definitely bring the new electric bicycle to the market.

  1. Explain what the outcome in those cases would be if the two firms made their decisions sequentially rather than simultaneously.

Question 21.3 Choose the correct answer(s)

The following matrix shows the payoffs for two firms according to whether they innovate or not. The first number is the payoff for firm A while the second number is for firm B.

A payoff matrix for two firms according to whether they innovate or not.
Fullscreen

Based on this information, which of the following statements is correct?

  • In this game, the two innovations are complements.
  • There are two Nash equilibria in this game: (Innovate, Innovate) and (Do not innovate, Do not innovate).
  • Firm B will definitely choose to innovate because of the potentially high profits from innovation.
  • Firm A will choose to innovate if the probability of Firm B investing is 75% or less.
  • The payoff of one’s innovation is decreased by innovation of the other. Therefore the two are substitutes.
  • The two Nash equilibria in this game are (Innovate, Do not innovate) and (Do not innovate, Innovate).
  • Firm B incurs a large loss if firm A also innovates. It is therefore not certain that firm B would innovate.
  • If the probability that Firm B chooses Innovate is x%, then Firm A gets –x + 1.5(1 – x) by choosing Innovate, compared to –0.5x by choosing Not innovate. Firm A prefers to innovate if –x + 1.5(1 – x) > –0.5x, which is true if x < 0.75.

21.4 Economies of scale and winner-take-all competition

Innovation involves developing new knowledge, and putting it to use. Recall that knowledge is unusual in two ways. It is a public good (what one consumes does not subtract from what is available to others) and its production and use are characterized by extraordinary increasing returns to scale. We discussed knowledge as a public good in Unit 12. In this section, we discuss the two ways in which knowledge-intensive innovation creates economies of scale.

The supply side: First copy costs and economies of scale in production

first copy costs
The fixed costs of the production of a knowledge-intensive good or service.

The first copy of new knowledge is costly to produce, but virtually costless to make available to others. Because first copy costs are large relative to the costs (variable or marginal) of making additional goods available, information production and distribution is different from any other part of the economy.

  • Thriller, by Michael Jackson: This is the best-selling music album in history. It cost $750,000 to produce in 1982 (about twice that amount in 2015 dollars). The marginal cost of producing additional copies is less than $1 for a CD, and almost nothing if it is a download. A CD sells for about $10, and a download for the same amount. The first copy cost of even a modest production by a new band will be at least $10,000, with marginal costs of around $1 for each CD, and zero for a download.
  • Textbooks: To develop a new high-quality textbook in the US costs between $1 million and $2 million, to compensate the writers, designers, editors, and others for their work. This is the first copy cost. The cost of producing and distributing the physical books (printing, warehousing, and delivery included) for a successful text are approximately $12 per book. This is its marginal cost. Students all over the world know that introductory course textbooks typically sell for ten times this amount.
  • Star Wars: The Force Awakens: the production budget for this film, released in 2015, was $200 million. The development cost for the computer game Star Wars: The Old Republic (2011) was between $150 million and $200 million. These figures do not include the marketing and promotion costs, such as advertising, that should be included in the first copy cost, and may be bigger than the production costs. Now that movies are distributed digitally to cinemas, making a film available costs virtually nothing. The marginal costs for movies or games sold on DVD are around the same as for a CD, and when they are sold as digital downloads, they are zero.
  • New drugs: The average first copy cost of a new drug according to a study in the US in 2003 was $403 million. This fact explains the difference in price between drugs that are still under patent, giving the producer a temporary monopoly, and the prices that users pay once the patent has expired so that other producers compete with the originator of the drug. For instance, Omeprazole, a very widely prescribed dyspepsia drug, was patented and launched in 1989, sold under the brand name Prilosec. In the US the patent expired in 2001, and by 2003, 28 tablets of brand-name Prilosec sold for $124, while the equivalent packet of generic Omeprazole cost only $24.12

In Unit 7 we studied how a firm sets prices, and how it decides how much to produce. In Figure 21.8, we show a set of cost curves for a firm producing a knowledge-intensive good. The numbers are hypothetical, and they understate the true size of the first copy cost relative to marginal cost. Even so, the vertical axis is still not drawn to scale so we can read the figure.

  • Total cost: The curve starts at the first copy cost, and then rises very little with increased production.
  • Marginal cost (MC): The curve is low and constant.
  • Average cost (AC): The curve (including economic profits and the first copy costs) falls as quantity increases, as the cost of the first copy is spread over larger units of output.
  • MC < AC: No matter how many units are produced, the marginal cost will always be less than the average.

A firm producing a knowledge-intensive good that wants to make economic profits will have to cover its first copy cost. To do so, the price will have to be at least as high as the average cost curve and therefore higher than marginal cost.

This means the production of knowledge-intensive goods cannot be described by the competitive markets of Unit 8 in which price equals marginal cost (P = MC), but instead by the model of price-setting firms in Unit 7. In Unit 7, we assumed that P > MC because of limited competition. Here it is an unavoidable consequence of first copy costs, and no matter how many competitors there are, price cannot be competed all the way down to marginal cost.

In this diagram, the horizontal axis displays the quantity. The vertical axis displays the cost in dollars. Coordinates are (quantity, cost). A horizontal line passing through point (0, 1) is the marginal cost line). A downward-sloping, convex curve which starts from point (0, 90,000) and does not intersect the marginal cost line is the average cost curve. A straight line passing through points (0, 90,000) and (10, 100,000) is the total cost line. The vertical distance between points (0, 0) and (0, 90,000) is the first copy cost.
Fullscreen

Figure 21.8 A knowledge-intensive good: Marginal, average, and first copy costs.

Earlier in this unit (and in Units 1 and 2), we explain that in the absence of intellectual property rights, competition from followers would eventually eliminate the innovation rents made by first adopters of an improved technology or new product. This is how the diffusion of a new technology happens, and results in lower prices. The same process will take place where first copy costs are important. Other firms will copy the innovator until the economic profits (rents) are eliminated, so that the price being charged offsets the average cost of production, including the first copy cost and the opportunity cost of the capital goods used. But in this situation, the price being charged must be greater than the average cost (due to the first copy costs, as shown in Figure 21.8). Figure 21.9 below illustrates these cases.

  Restricted entry (IPR or other) Unrestricted entry
Declining average costs Economic profits
P > AC > MC
No economic profits
P = AC > MC
Non-declining average costs Economic profits
P > MC ⋛ AC
No economic profits
P = MC = AC

Figure 21.9 The average cost curve, economic profits, and competition.

The demand side: Economies of scale through network effects

network external effects
An external effect of one person’s action on another, occuring because the two are connected in a network. See also: external effect.

The value of many forms of knowledge increases when more people use it. Because the benefits to users increase as the network of users grows, demand-side increasing returns are sometimes called network external effects. The external effect is that when one more person joins the network, all others benefit.

Languages are a good example. Today, more than one billion people are learning English, which is more than three times as many people who speak English as their first language. The demand for English is not due to the intrinsic superiority of the language or because it is easy to learn (as many of you will know), but simply because so many other people, in many parts of the world, speak it. There are many more people who speak Mandarin (Chinese) and Spanish as a first language, and almost as many Hindi and Arabic speakers, but none of these languages is as useful to communicate globally as is English.

Having a particular games console is better when lots of people have the same one, because developers will produce more games for it. Bicycle wheels of 29 inch diameter are more usefull when many people have them, because producer will produce wider variety of rims, tires, tubes or spokes. The more people use public transport, the better it is because buses or trams come more often (if transport agencies keep vehicle occupancy at similar levels). A credit card is more useful when many people have the same card, because lots of shops will accept it as payment.

But have you ever wondered who bought the first telephone, and what they intended to do with it? Or what you could do with the first fax machine?

The technology behind the fax, a device to send images of documents over a telephone line, was first patented by Alexander Bain in 1843—although his image-sending innovation had to use the telegraph, because nobody had invented a telephone yet. A commercial service that could transmit handwritten signatures using the telegraph was available in the 1860s. But the fax remained a niche product until 120 years later when it became so popular that, within the space of 10 years, almost every office installed its own fax machine.

This tells us the first thing we need to know about demand-side economies of scale: there is little incentive to be the first to adopt a technology with this characteristic.

The second thing we need to know is that, if two versions of this type of technology are competing, the one that gains a larger number of adopters at the outset will have an advantage, even if the other one is cheaper or better. To see this, let’s take another look at the video format war between Sony and JVC.

Sony’s Betamax format was superior to JVC’s VHS for its picture and sound quality. But in the early 1980s, Sony made a strategic error by limiting the recording time to 60 minutes. If customers wanted to use their new Sony Betamax to record a feature film, they needed to change the tape in the middle of the recording. By the time Sony had extended its recording length to 120 minutes, there were so many more VHS users that the Betamax format all but disappeared.

winner-take-all competition
Firms entering a market first can often dominate the entire market, at least temporarily.

The video formatting war, and its outcome, is an example of winner-take-all competition, in which economies of scale in production or distribution give the firm with the largest share of the market a commanding competitive edge. Winner-take-all competition does not necessarily select the best.

To see how this works, Figure 21.10 depicts competition based on the Sony and JVC case. The length of the horizontal axis is the number of people purchasing either Sony’s Betamax or JVC’s VHS. We assume that the price of the two products is identical.

To simplify our example, assume that the value of using the product for a new user is approximately the number of individuals currently using the product, n, multiplied by an index for the quality of the product, q. The net benefit of purchasing a good is then equal to the benefit from using the good, qn, minus the price that the consumer pays, p. Our simplifying assumptions then allow us to write the net value of buying the product as Π = qnp. Higher quality products have a higher value of q, so consumers faced with two products with the same number of users and same price will prefer the higher quality good.

The number of individuals buying Betamax is measured from the left to the right, starting at zero and extending potentially all the way to the entire market. The blue line shows the net benefits of using Betamax for consumers. Its equation is ΠB = qBnBp, where the superscript ‘B’ is for Betamax. If everyone buys Betamax, the value to each purchaser is shown in the figure, ΠBmax, which is equal to qBntotalp. If no one else buys Betamax, the value to that first purchaser is negative and equal to the price paid, shown by the intercept on the left-hand vertical axis below the horizontal axis.

In the same figure, the net value of JVC’s product VHS is given by the red line whose equation is ΠV = qVnVp (where the superscript ‘V’ stands for VHS). Because there are only two firms competing, the number buying VHS is just the total size of the market, minus the number buying Betamax.

Let’s assume that the Betamax format is higher quality. Within our model, this means that qB > qV. This implies that if everyone bought Betamax, the net value would be greater than if everyone bought VHS format, that is ΠBmax > ΠVmax. In Figure 21.10, this is illustrated by the fact that the height of the blue Betamax line where it intersects the right-hand axis (everyone using Betamax) is above the intercept of the red VHS line with the left-hand axis (everyone using VHS).

In this diagram, the horizontal axis displays the total number of buyers, which is 10,000. Betamax buyers are measured from the primary vertical axis. VHS buyers are measured from the secondary vertical axis. The primary vertical axis displays the benefit to VHS purchasers, and the secondary vertical axis displays the benefit to Betamax purchasers. An upward-sloping line starting from a negative value on the primary vertical axis is the benefit from choosing Betamax. This negative value corresponds to the benefit from the choice of Betamax if none uses Betamax, denoted minus p. It intersects the secondary vertical axis at point A, which is the benefit from the choice of Betamax if everyone uses Betamax, and is equal to capital pi_superscript-Bmax. A downward-sloping line connects a negative value on the secondary vertical axis with point B on the primary vertical axis, and lower than point A. The negative value on the secondary vertical axis is the benefit from the choice of VHS if none use VHS, and is equal to minus p. Point B represents the benefit from the choice of VHS if everyone uses VHS, and is equal to capital-pi_superscript-Vmax. The two straight lines intersect at point C, which corresponds to 4,000 users of Betamax and 6,000 users of VHS.
Fullscreen

Figure 21.10 The net value of becoming part of a network.

The net benefit of Betamax
: In this diagram, the horizontal axis displays the total number of buyers, which is 10,000. Betamax buyers are measured from the primary vertical axis. VHS buyers are measured from the secondary vertical axis. The primary vertical axis displays the benefit to VHS purchasers, and the secondary vertical axis displays the benefit to Betamax purchasers. An upward-sloping line starting from a negative value on the primary vertical axis is the benefit from choosing Betamax.
Fullscreen

The net benefit of Betamax

The net benefit to a consumer of Betamax is given by the blue line, reading from left to right.

If everyone buys Betamax
: In this diagram, the horizontal axis displays the total number of buyers, which is 10,000. Betamax buyers are measured from the primary vertical axis. VHS buyers are measured from the secondary vertical axis. The primary vertical axis displays the benefit to VHS purchasers, and the secondary vertical axis displays the benefit to Betamax purchasers. An upward-sloping line starting from a negative value on the primary vertical axis is the benefit from choosing Betamax. It intersects the secondary vertical axis at point A, which is the benefit from the choice of Betamax if everyon uses Betamax, and is equal to capital pi_superscript-Bmax.
Fullscreen

If everyone buys Betamax

The net benefit to each purchaser is shown in the figure by ΠBmax, which is equal to qBntotalp. This is the case where Betamax is the winning format and takes all of the market, shown by point A.

If nobody buys Betamax
: In this diagram, the horizontal axis displays the total number of buyers, which is 10,000. Betamax buyers are measured from the primary vertical axis. VHS buyers are measured from the secondary vertical axis. The primary vertical axis displays the benefit to VHS purchasers, and the secondary vertical axis displays the benefit to Betamax purchasers. An upward-sloping line starting from a negative value on the primary vertical axis is the benefit from choosing Betamax. This negative value corresponds to the benefit from the choice of Betamax if none uses Betamax, denoted minus p. It intersects the secondary vertical axis at point A, which is the benefit from the choice of Betamax if everyon uses Betamax, and is equal to capital pi_superscript-Bmax.
Fullscreen

If nobody buys Betamax

The net benefit to a purchaser would be negative and equal to the price paid for it.

The net benefit of VHS
: In this diagram, the horizontal axis displays the total number of buyers, which is 10,000. Betamax buyers are measured from the primary vertical axis. VHS buyers are measured from the secondary vertical axis. The primary vertical axis displays the benefit to VHS purchasers, and the secondary vertical axis displays the benefit to Betamax purchasers. An upward-sloping line starting from a negative value on the primary vertical axis is the benefit from choosing Betamax. This negative value corresponds to the benefit from the choice of Betamax if none uses Betamax, denoted minus p. It intersects the secondary vertical axis at point A, which is the benefit from the choice of Betamax if everyone uses Betamax, and is equal to capital pi_superscript-Bmax. A downward-sloping line connects a negative value on the secondary vertical axis with point B on the primary vertical axis, and lower than point A. The negative value on the secondary vertical axis is the benefit from the choice of VHS if none use VHS, and is equal to minus p. Point B represents the benefit from the choice of VHS if everyone uses VHS, and is equal to capital-pi_superscript-Vmax.
Fullscreen

The net benefit of VHS

The red line gives the net benefit to a consumer of the VHS format. The VHS format is the winner and takes all the market at point B.

For Betamax to break a VHS monopoly
: In this diagram, the horizontal axis displays the total number of buyers, which is 10,000. Betamax buyers are measured from the primary vertical axis. VHS buyers are measured from the secondary vertical axis. The primary vertical axis displays the benefit to VHS purchasers, and the secondary vertical axis displays the benefit to Betamax purchasers. An upward-sloping line starting from a negative value on the primary vertical axis is the benefit from choosing Betamax. This negative value corresponds to the benefit from the choice of Betamax if none uses Betamax, denoted minus p. It intersects the secondary vertical axis at point A, which is the benefit from the choice of Betamax if everyone uses Betamax, and is equal to capital pi_superscript-Bmax. A downward-sloping line connects a negative value on the secondary vertical axis with point B on the primary vertical axis, and lower than point A. The negative value on the secondary vertical axis is the benefit from the choice of VHS if none use VHS, and is equal to minus p. Point B represents the benefit from the choice of VHS if everyone uses VHS, and is equal to capital-pi_superscript-Vmax. The two straight lines intersect at point C, which corresponds to 4,000 users of Betamax and 6,000 users of VHS.
Fullscreen

For Betamax to break a VHS monopoly

For the net benefit of Betamax to be greater than the net benefit of VHS, it would require at least 4,000 buyers to purchase a Betamax recorder, shown in the diagram as all the outcomes to the right of point C.

The first thing to notice is that if at a particular moment everyone is buying VHS (point B), then a new buyer will certainly prefer VHS to Betamax. To see this in the diagram, look at the left-hand side and consider a new buyer. For this person, the value of VHS is high (the intercept with the left-hand axis), whereas the value of Betamax is negative. This is because the new user would have to pay the price of the Betamax recorder, but would not get any benefits because there are no other users, and therefore no video content is provided. This is true even though we have assumed that Betamax costs the same as VHS, and that Betamax is the better quality video cassette.

lock-in
A consequence of the network external effects that create winner-take-all competition. The competitive process results in an outcome that is difficult to change, even if users of the technology consider an alternative innovation superior.

The second lesson from the figure is that even if many consumers (but fewer than 4,000) were buying Betamax, the new consumer would still prefer VHS (the red line is still above the blue line at that point). For Betamax to break the VHS monopoly, it would have to get at least 4,000 buyers. Then Betamax rather than VHS would offer higher value, and could eventually take the entire market (at point A).

So the winner need not be the better alternative. This is sometimes called lock-in.

But this is not the whole story. The history of innovation in the knowledge economy is full of more complicated stories, in which changes are constantly occurring for many reasons.

For example:

  • Browser wars: When the Internet became popular, the market for Internet browsers was dominated by a product called Netscape Navigator. It was displaced by Microsoft Internet Explorer in the ‘browser wars’ of the early 2000s. Internet Explorer, in turn, was later challenged by Mozilla Firefox and Google Chrome.
  • Smartphones: At the beginning of 2009, Android smartphones had a market share of 1.6%, Apple’s iPhones had 10.5%, and the market was dominated by a technology called Symbian, with 48.8% share. At the beginning of 2016, 84.1% of smartphones sold were based on Android, Apple’s smartphones had a share of 14.8%, and Symbian smartphones were no longer being manufactured.
  • Social networks: In June 2006, 80% of people who used a social network used a site called MySpace. By May 2009, more people used Facebook than MySpace.

Question 21.4 Choose the correct answer(s)

Figure 21.8 shows the cost curves for a firm producing a knowledge-intensive good.

The marginal cost is constant at $1 for all output Q. Based on this information, which of the following statements is correct?

  • With positive first copy cost and constant marginal cost, the firm’s average cost will always be above its marginal cost.
  • The firm’s average costs will eventually start increasing, at which point the firm’s production no longer benefits from economies of scale.
  • The government should encourage competition to drive the price down to p = $1.
  • A small-scale bicycle valeting business is a good example of a firm with the cost structure shown in the graph.
  • With positive first copy cost (say F) and constant marginal cost (MC), the average cost (AC) will always be higher than the marginal cost by F/Q, as AC = TC/Q = (F + (MC × Q))/Q = (F/Q) + MC.
  • With positive first copy cost (say F) and constant marginal cost (MC), the average cost (AC) will always be higher than the marginal cost by F/Q, as AC = TC/Q = (F + (MC × Q))/Q = (F/Q) + MC. Now AC > MC for all values of Q, so the AC curve will always be downward sloping. Hence the firm’s production will always exhibit economies of scale.
  • A firm producing a knowledge-intensive good needs to cover its first copy cost and thus must charge a price greater than its marginal cost. It would be making a loss if the price were below its average cost.
  • A small-scale bicycle valeting business would have a very small initial fixed cost–just the cost of bicycle wash equipment–and virtually zero training cost. It is unlikely to benefit from much economies of scale.

21.5 Matching (two-sided) markets

A market is a way of putting together people who might benefit from exchanging a good or service. Often these are potential buyers and sellers of the same commodity, such as milk, and the sides of this market are farmers supplying milk and consumers demanding it. In common usage, a market may also refer to a place such as the Fulton Fish Market that we described in Unit 8, or a place where those selling fresh vegetables, cheese, and baked goods congregate, knowing that they will encounter potential customers. In these markets, buyers do not care about who produced the fish or the milk that they buy, and sellers are similarly not concerned about who is buying, as long as someone buys their products.

Matching (two-sided) markets

People also use the term market to describe a different kind of connection, in which the people on each side of the market care whom they are matched with on the other side. This is what people have in mind when they speak about the ‘marriage market’, for example. Most of us do not get married in the way that we get a carton of milk in the grocery market. The marriage market is about getting married to a person with the combination of characteristics that you find most desirable in a spouse. Markets like these are called matching markets or two-sided markets.

matching market
A market that matches members of two distinct groups of people. Each person in the market would benefit from being connected to the right member of the other group. Also known as: two-sided market.

In our ‘Economist in action’ video, Alvin Roth, an economist who specializes in how markets are designed (and who won the Nobel Prize for his work on the subject in 2012), explains how matching markets function.12 13

We have recently seen a proliferation of online platforms that connect individuals in two groups, starting with the launch of consumer-to-consumer trader eBay in 1995. These platforms make up a general-purpose technology that allows the participants to benefit from being networked together, and so are examples of two-sided markets.

Another example is Airbnb, a service that connects travellers looking for short-term apartment rentals with owners seeking to make money by making their home available while they are not living in it. Airbnb is a platform that puts the group of apartment seekers in touch with the group of apartment owners who would like to offer their apartments for rent. Tinder does the same thing for people who want to find a date for the evening. A service called JOE Network puts employers in contact with people who have recently been awarded PhDs in economics.

The CORE Project is itself a matching market, as it provides a digital platform for researchers, teachers and students in economics to connect in ways that are mutually beneficial, although it is not really a market as the services of the researchers of providing the content in the ebook and the ebook itself are provided without pay.

These matching platforms have become an important topic in economics because of the magnitude of the network connections that are now possible. But while connections on this scale are now technically feasible, there is no mechanism that will reliably bring two-sided markets into existence even if they create gains for the participants on both sides.

At an early stage, these markets—meaning the creation of the platform, or the marketplace, or whatever it is that connects people—face a chicken-and-egg problem. Think about Airbnb: it makes money by charging a commission on each deal that is struck. Unless there are a large number of apartment seekers consulting its website, there is no reason for an apartment owner seeking a rental to offer an apartment for rent. Without apartments to rent, Airbnb will not be able to make money, so there would be no incentive to create the platform in the first place.

A model of a two-sided matching market

strategic complements
For two activities A and B: the more that A is performed, the greater the benefits of performing B, and the more that B is performed the greater the benefits of performing A.

In economics, these two activities—seeking an apartment by going to Airbnb’s web page, and posting one’s apartment on it—are termed strategic complements. This term means that the more of the first (seeking) that occurs, the more benefit there is to someone who does the second (posting); also, the more posting there is, then the more benefit there is to seeking. This is closely related to the network externalities typical of a new innovation that we discussed in the previous section, where the benefit to using Betamax increased with the number of individuals using that video format. However, in this case, the external benefit depends on how many members of the opposite group are on the platform, rather than just how many people are using the platform in total.

Figure 21.11a illustrates the chicken-and-egg problem. We begin with the number of apartments posted on Airbnb. People post their apartment because they believe that many apartment seekers will see the posting and eventually rent the apartment. If there are few people logging onto the Airbnb site (seeking), then few apartment owners will think it’s worth their effort to post their apartment on the site.

The ‘posters’ curve shows hypothetically how many apartments would be posted in response to each possible number of apartment seekers who consult the site. As illustrated in the figure, unless more than 500 apartment seekers are going to the site, no apartment owner will post their home for rent. To see this, look at where the curve labelled ‘posters’ intercepts the horizontal axis. As the number of apartment seekers (those ‘demanding’ apartments) viewing the site rises beyond 500, an increasing number of owners will post their information. But there is a limit to how many people will want to rent out their home temporarily, so the ‘posters’ curve flattens out as we move to the right.

The situation is similar for those seeking to rent an apartment. The number of people checking the Airbnb site depends on how many apartments are posted there. As long as more than a minimum number of apartments are posted on the site (from the figure, more than 200), then some people will look for an apartment there. This is the intercept of the ‘seeker’s’ curve with the vertical axis. The ‘seekers’ curve shows that the more apartments are posted, the more people will look.

In this diagram, the horizontal axis displays the number of apartment seekers, and the vertical axis displays the number of apartments posted. Coordinates are (apartment seekers, apartments posted). An upward-sloping, convex curve starting from (0, 200) is the apartment seekers curve. An upward-sloping, concave curve starting from (500, 0) is the apartment posters curve. Both curves intersect twice, at point A (600, 250), and at point Z (1,800, 700). Point O, the origin, is a Nash equilibrium.
Fullscreen

Figure 21.11a A two-sided matching market: The case of Airbnb.

The number of apartment seekers checking the Airbnb site
: In this diagram, the horizontal axis displays the number of apartment seekers, and the vertical axis displays the number of apartments posted. Coordinates are (apartment seekers, apartments posted). An upward-sloping, convex curve starting from (0, 200) is the apartment seekers curve.
Fullscreen

The number of apartment seekers checking the Airbnb site

This depends on the number of those posting an apartment.

The number of apartments posted by owners
: In this diagram, the horizontal axis displays the number of apartment seekers, and the vertical axis displays the number of apartments posted. Coordinates are (apartment seekers, apartments posted). An upward-sloping, convex curve starting from (0, 200) is the apartment seekers curve. An upward-sloping, concave curve starting from (500, 0) is the apartment posters curve. Both curves intersect twice, at a low number of seekers and low number of apartments posted, and a high number of seekers and high number of apartments posted.
Fullscreen

The number of apartments posted by owners

This depends on the number of apartment seekers checking the Airbnb site.

Point Z
: In this diagram, the horizontal axis displays the number of apartment seekers, and the vertical axis displays the number of apartments posted. Coordinates are (apartment seekers, apartments posted). An upward-sloping, convex curve starting from (0, 200) is the apartment seekers curve. An upward-sloping, concave curve starting from (500, 0) is the apartment posters curve. Both curves intersect twice, at a low number of seekers and low number of apartments posted, and at point Z (1,800, 700).
Fullscreen

Point Z

At Z, the two curves intersect. This point is a Nash equilibrium.

If no apartment seekers are consulting the site
: In this diagram, the horizontal axis displays the number of apartment seekers, and the vertical axis displays the number of apartments posted. Coordinates are (apartment seekers, apartments posted). An upward-sloping, convex curve starting from (0, 200) is the apartment seekers curve. An upward-sloping, concave curve starting from (500, 0) is the apartment posters curve. Both curves intersect twice, at a low number of seekers and low number of apartments posted, and at point Z (1,800, 700). Point O, the origin, is a Nash equilibrium.
Fullscreen

If no apartment seekers are consulting the site

No owners will post their apartment. Nobody doing anything is therefore another Nash equilibrium, as shown by O.

Point A
: In this diagram, the horizontal axis displays the number of apartment seekers, and the vertical axis displays the number of apartments posted. Coordinates are (apartment seekers, apartments posted). An upward-sloping, convex curve starting from (0, 200) is the apartment seekers curve. An upward-sloping, concave curve starting from (500, 0) is the apartment posters curve. Both curves intersect twice, at point A (600, 250), and at point Z (1,800, 700). Point O, the origin, is a Nash equilibrium.
Fullscreen

Point A

At A, the curves also intersect, but the point is not a stable equilibrium.

To see how the Airbnb market works, think about point Z in the figure. Z is a mutually consistent outcome in that:

  • There are 700 apartments posted, so there will be 1,800 apartment seekers.
  • Since there are 1,800 apartment seekers, there will be 700 apartments posted.

This means that the behaviours of the posters and the seekers is mutually consistent at point Z, and so point Z is a Nash equilibrium. If the market is at point Z, with 700 apartments posted and 1,800 apartment seekers, neither the apartment posters or seekers will want to change their behaviour.

But notice that there are two other points that also have this mutual consistency property:

  • There is a Nash equilibrium in which there is no Airbnb: At point O, nobody is posting an apartment on Airbnb so there is no incentive for anyone to look at the site, and because nobody is looking at the site there is no incentive for anyone to post their apartment there. This is the chicken-and-egg problem.
  • Point A is a mutually consistent outcome, with 250 apartments posted and 600 people seeking apartments: It is unlikely to last, however, for reasons discussed below.

To see what happens in the latter case, suppose that the number of apartment seekers unexpectedly dropped from 600 to 450. The best response for the 250 apartment owners who had previously posted their homes would then be to completely pull out of the market. If all the apartment posters drop out of the market, the remaining 450 seekers will also eventually drop out. So, if we enter the blue zone, a ‘vicious cycle’ of both posters and seekers abandoning the market will ensue and the result will be no market at all, which is depicted on the diagram as point O.

unstable equilibrium
An equilibrium such that, if a shock disturbs the equilibrium, there is a subsequent tendency to move even further away from the equilibrium. 
tipping point
An unstable equilibrium at the boundary between two regions characterized by distinct movements in some variable. If the variable takes a value on one side, the variable moves in one direction; on the other, it moves in the other direction. See also: asset price bubble.

This process of adjustment away from an equilibrium is similar to the example you studied in Unit 11, about house prices and the value of durable assets. Because a small move away from point A leads to a cumulative process leading further away from A, we say that point A is unstable. A situation like point A is sometimes described as a tipping point.

Given the chicken-and-egg problem, how could Airbnb ever come into existence? Point Z is a Nash equilibrium, but how could the market ever get there?

If a sufficient number of seekers (greater than 600) somehow showed up on the site then more than 250 owners would post their apartments on the site. Or if by chance 300 owners posted their apartments, then more than 600 seekers would be motivated to check out the Airbnb site.

Figure 21.11b shows that in these cases, a virtuous cycle of both seekers and posters entering the market will take place and the number of both will grow until there are 700 posters and 1,800 seekers.

In this diagram, the horizontal axis displays the number of apartment seekers, and the vertical axis displays the number of apartments posted. Coordinates are (apartment seekers, apartments posted). An upward-sloping, convex curve starting from (0, 200) is the apartment seekers curve. An upward-sloping, concave curve starting from (500, 0) is the apartment posters curve. Both curves intersect twice, at point A (600, 250), and at point Z (1,800, 700). Point O, the origin, is a Nash equilibrium. Point B (876, 300) lies on the apartment seekers curve. Point C lies on the apartment posters curve and corresponds to 876 apartment seekers. The presence of new posters indicated by point C attracts new apartment seekers, shown by a movement along the apartment seekers curve. This process of moving along both of the curves iterates until point Z is achieved.
Fullscreen

Figure 21.11b A two-sided matching market: The case of Airbnb.

Many people seeking apartments
: In this diagram, the horizontal axis displays the number of apartment seekers, and the vertical axis displays the number of apartments posted. Coordinates are (apartment seekers, apartments posted). An upward-sloping, convex curve starting from (0, 200) is the apartment seekers curve. An upward-sloping, concave curve starting from (500, 0) is the apartment posters curve. Both curves intersect twice, at point A (600, 250), and at a high number of seekers and high number of apartments. Point O, the origin, is a Nash equilibrium. Point B (876, 300) lies on the apartment posters curve.
Fullscreen

Many people seeking apartments

Consider the case where there are 876 seekers but only 300 posters, at point B.

New posters join the market
: In this diagram, the horizontal axis displays the number of apartment seekers, and the vertical axis displays the number of apartments posted. Coordinates are (apartment seekers, apartments posted). An upward-sloping, convex curve starting from (0, 200) is the apartment seekers curve. An upward-sloping, concave curve starting from (500, 0) is the apartment posters curve. Both curves intersect twice, at point A (600, 250), and at a high number of seekers and high number of apartments. Point O, the origin, is a Nash equilibrium. Point B (876, 300) lies on the apartment seekers curve. Point C lies on the apartment posters curve and corresponds to 876 apartment seekers.
Fullscreen

New posters join the market

This encourages new posters to list their site (point C) …

New seekers respond
: In this diagram, the horizontal axis displays the number of apartment seekers, and the vertical axis displays the number of apartments posted. Coordinates are (apartment seekers, apartments posted). An upward-sloping, convex curve starting from (0, 200) is the apartment seekers curve. An upward-sloping, concave curve starting from (500, 0) is the apartment posters curve. Both curves intersect twice, at point A (600, 250), and at a high number of seekers and high number of apartments. Point O, the origin, is a Nash equilibrium. Point B (876, 300) lies on the apartment seekers curve. Point C lies on the apartment posters curve and corresponds to 876 apartment seekers. The presence of new posters indicated by point C attracts new apartment seekers, shown by a movement along the apartment seekers curve.
Fullscreen

New seekers respond

This in turn attracts new apartment seekers.

A stable equilibrium
: In this diagram, the horizontal axis displays the number of apartment seekers, and the vertical axis displays the number of apartments posted. Coordinates are (apartment seekers, apartments posted). An upward-sloping, convex curve starting from (0, 200) is the apartment seekers curve. An upward-sloping, concave curve starting from (500, 0) is the apartment posters curve. Both curves intersect twice, at point A (600, 250), and at point Z (1,800, 700). Point O, the origin, is a Nash equilibrium. Point B (876, 300) lies on the apartment seekers curve. Point C lies on the apartment posters curve and corresponds to 876 apartment seekers. The presence of new posters indicated by point C attracts new apartment seekers, shown by a movement along the apartment seekers curve. This process of moving along both of the curves iterates until point Z is achieved.
Fullscreen

A stable equilibrium

The upward spiral leads to point Z, which is a stable Nash equilibrium.

A better outcome
: In this diagram, the horizontal axis displays the number of apartment seekers, and the vertical axis displays the number of apartments posted. Coordinates are (apartment seekers, apartments posted). An upward-sloping, convex curve starting from (0, 200) is the apartment seekers curve. An upward-sloping, concave curve starting from (500, 0) is the apartment posters curve. Both curves intersect twice, at point A (600, 250), and at point Z (1,800, 700). Point O, the origin, is a Nash equilibrium. Point B (876, 300) lies on the apartment seekers curve. Point C lies on the apartment posters curve and corresponds to 876 apartment seekers. The presence of new posters indicated by point C attracts new apartment seekers, shown by a movement along the apartment seekers curve. This process of moving along both of the curves iterates until point Z is achieved.
Fullscreen

A better outcome

Comparing the three equilibria, point Z is the preferred one, better than no market, and better than the unstable equilibrium at point A.

The figure explains why we might end up either with no market at all, or with a functioning market that matches some of the 1,800 seekers with the 700 posters. To see that the second is preferable to the first, think about a particular transaction: all of those posting and seeking are doing so voluntarily, so they must all see a personal benefit in doing it. When one of the seekers is paired with a poster, both seeker and renter benefit (otherwise they would not agree). This is true for every market participant. So, having the market must be better than not having it.

The figure also shows that the market can come into existence and persist if we somehow started out with more than 600 seekers and or 250 posters. But that is a big if.

Market failures in matching markets

The economic policy challenge is to find a way to ensure that someone will create the platforms that produce benefits for participants that are sufficient to justify the cost. This is sometimes done by the public sector playing a role in creating the platform, as it did in the case of the Internet, or physical marketplaces in cities and towns. But in many cases (such as Airbnb, Tinder, and many other private platforms), the existence of a two-sided market is the haphazard result of a forward-looking individual having both the idea and the resources to launch a large, risky project.

For example, to solve the chicken-and-egg startup problem in the Airbnb market, the originator of the platform could have paid the first 250 posters to post their apartments, giving them an incentive to post on the website even when nobody was consulting the site. That could have kicked off the virtuous cycle of additional seekers and posters joining the market.

A common strategy for solving the chicken-and-egg problem is for companies to charge low or zero prices to one group of users, which then attracts the other group. For example, Adobe lets you download its PDF reader for no cost. If many people read documents as PDFs, it incentivizes document creators to pay for Adobe Acrobat, the software used to create PDF files.

While some two-sided markets, such as Wikipedia, are not designed to be money-makers, most are. And some of those who succeeded in creating widely used platforms have gained extraordinary wealth. In 2017, Facebook was valued at $245 billion and Mark Zuckerberg, who founded the company, owned 28.4% of it.

These innovation rents, unlike those associated with a new technical innovation like the spinning jenny studied in Unit 2, may not be competed away because would-be competitors face the very same chicken-and-egg problem that the successful innovators solved.

The problem is similar to the example of the strategic interaction between Plugbike and Bikeflex discussed earlier in this unit. There are probably many potentially mutually beneficial two-sided markets that do not exist (or do not exist yet) because of this chicken-and-egg problem. For instance, there has been little new competition in the credit card industry. It would be difficult to persuade merchants to accept a new type of card if not many shoppers carried it, and it would be difficult to encourage shoppers to carry a card that not many merchants would accept.

A catalogue of policies

The last three sections have introduced three reasons why market competition for profits cannot create an efficient innovation process by itself: external (network) effects, public goods and economies of scale. Public policies can encourage useful innovations and accelerate their diffusion to all users who may benefit. We have already mentioned the possible coordinating role of government-set standards.

In the next three sections we study two other types of policies:

  • Intellectual property rights: These policies support innovation rents accruing to successful innovators.
  • Subsidizing innovation: These policies either directly or indirectly provide basic research and low-cost dissemination of information.

Exercise 21.6 Understanding matching markets

Watch the ‘Economist in action’ video of Alvin Roth. Based on the video, answer the following questions:

  1. How are matching markets different from commodity markets?
  2. Even if a market might be Pareto-improving, why might it not exist? Outline how the New England program helped to resolve a ‘repugnant markets’ problem.
  3. What are some aspects of the relationship between buyers and sellers that could be a source of market failure in matching markets?

Exercise 21.7 Why do curves in the matching markets model slope upwards?

Explain why both curves in the matching markets model, shown in Figure 21.11a, are upward sloping. (Hint: Remember that posting (supplying) apartments and seeking (demanding) apartments are strategic complements.)

Exercise 21.8 Mismatched posters and seekers in a matching market model

Imagine that for some reason there were 1,850 seekers and 750 posters in the matching markets model in Figure 21.11a. Locate this point in the figure. How would posters respond to the number of seekers? How would seekers respond to the number of posters? Which point would the market move to, and why?

Exercise 21.9 Chicken-and-egg

Platforms such as Airbnb, Uber, YouTube, and eBay have successfully overcome the chicken-and-egg problem mentioned above.

  1. Pick one of the platforms mentioned above. What are the gains that this platform offers, and which other markets have they disrupted?
  2. What factors made it possible for this platform to disrupt existing markets?

Question 21.5 Choose the correct answer(s)

Figure 21.11a shows a hypothetical market for Airbnb, a service that connects travellers looking for short-term apartment rentals with owners looking to rent out their home while they are away.

Based on this information, which of the following statements is correct?

  • There will be no posting of apartments when the number of seekers is below 200, while there will be no seeking when the number of apartments posted is below 500.
  • There are three stable Nash equilibria.
  • As long as there are more than 200 seekers and 500 apartments posted, there will always be a positive number of matches.
  • An initial number of 2,000 seekers and 800 posters would result in an equilibrium of 1,800 seekers and 700 apartments posted.
  • The numbers are the other way round: no posters when the seekers are fewer than 500 and no seekers when the posters are fewer than 200.
  • There are indeed three Nash equilibria, at points O, A, and Z. Two are stable (O and Z) while one is unstable (A).
  • For the number of seekers between 500 and 600 and the number of posters between 200 and 250, there will be a downward spiral process where the numbers on both sides of the market are driven down to zero, attaining the zero activity Nash equilibrium (point O).
  • There are initially some seekers and posters who cannot find a match, so they leave the market. Eventually, the market reaches an equilibrium (point Z), where all seekers and posters benefit from being in the market.

21.6 Intellectual property rights

Patent protection may be unnecessary for an innovator if secrecy is possible, or social norms prevent copying. The formula for Coca-Cola has famously remained a secret for 100 years. The company claims it is known by only two executives at any time, who never travel on the same aeroplane. A chef’s signature dish is not a secret, but social norms among chefs would make the costs of copying a recipe without permission extraordinarily high. Comedians rarely steal each other’s jokes for the same reason.

In other cases, an innovation may be known, but barriers to copying can be built into the product itself. Digital watermarking technology allowed some music distributors (briefly) to make recorded music that could not be copied. Seed companies successfully accomplished the same thing by introducing hybrid corn and other varieties that do not reproduce well.

Firms can also rely on superior capabilities that are complementary to a technological product to protect their innovation rents. Such capabilities could be a superior sales force, the ability to bring products to market more quickly, or exclusive contracts with input suppliers.

Secrecy, barriers to copying, or complementary capabilities may not be effective against rivals who manage to invent the same product independently, or who reverse-engineer it by starting with the finished product and working out how it was made.

patent
A right of exclusive ownership of an idea or invention, which lasts for a specified length of time. During this time it effectively allows the owner to be a monopolist or exclusive user.
trademark
A logo, a name, or a registered design typically associated with the right to exclude others from using it to identify their products.
copyright
Ownership rights over the use and distribution of an original work.

Where a novel idea is both codifiable (it can be written down) and non-excludable (imitation cannot be prevented), governments have created laws protecting intellectual property rights. There are many kinds of intellectual property, but the most commonly used are patents, trademarks, and copyright. What they have in common is that they give the holder of the right exclusive use of the thing covered by the right for some designated period of time. In economic terms, the holder of the intellectual property right is made a temporary monopolist.

Intellectual property rights

Codifiable and non-excludable ideas can be protected by the following forms of intellectual property rights in the following ways:

Patents

Patents require the innovator to disclose their idea in a patent application, which is examined by a patent office and subsequently published. If the examiners are convinced the idea is sufficiently new and inventive, they will grant the innovator a patent. In most cases, a patent gives the innovator the right to take any imitator to court for 20 years: this can be extended to 25 years in the case of pharmaceutical patents. Some countries vary the length of patent protection.

Trademarks

A trademark gives the owner of a logo, a name, or a registered design the right to exclude others from using it to identify their products. Trademarks can be extended indefinitely. Patents and trademarks are generally registered at a dedicated office.

Copyright gives the author of an intellectual work such as a book, an opera, or software code the right to exclude others from reproducing, adapting, and selling it. Copyright is generally not registered. The author must make a claim if he or she believes it has been violated. Copyright terms are far longer than those for patents, and have been progressively extended. Copyright applies for a minimum of 25 years and in the US currently for 70 years after the death of the creator. Long copyright terms are controversial, because often the benefits go to people who did not create the work.

How intellectual property rights affect innovation

Until recently, it had been thought that patents encourage the development and use of innovations. Now economists and historians are taking a second look at whether intellectual property rights promote or actually destroy innovation. The answer depends on which of two opposite effects is more important:

  • Creating a monopoly: This has a beneficial effect for the holder of the intellectual property rights, and creates economic profits (innovation rents) which stimulate research and development.
  • Impeding innovation and diffusion of new ideas: These rights limit the ability of others to copy the innovation.

An important historical case is the steam engine, which was so important to the Industrial Revolution. There were several types of steam engine invented during the eighteenth century, but the most successful type was patented in 1769 by James Watt. He was an engineer, and did nothing to commercialize his innovation. In fact, he did not begin production in earnest until six years after he invented it.

The commercial value of the patent was an afterthought for Watt. The businessman Matthew Boulton bought a share in the patent, and persuaded Watt to move to Birmingham (one of the centres of the Industrial Revolution) to develop the new engine he had invented. Boulton also campaigned successfully to extend the period of the patent from 14 to 31 years.

Creative commons licensing

Creative commons logo

The CORE economics text you are reading is available online under what is called a Creative Commons copyright license.

  • It allows anyone to access our curriculum material and use it in non-commercial ways, as long as they credit CORE as the originator.
  • We do this so that as many people as possible can access the work of our contributors at no cost, but not make a profit from it.

Afterwards, Watt and Boulton used the courts vigorously to prevent any other steam engines from being sold, even if they were different to Watt’s design. Among these was Jonathan Hornblower’s rival invention, which was more efficient than the Watt design. Watt and Boulton challenged Hornblower’s patent, eventually winning the case in 1799.

Another superior invention, created by an employee, was blocked when Watt and Boulton succeeded in broadening their patent to cover the new design, even though they had not had any part in its development. Ironically, Watt knew how to make his machine more efficient, but he couldn’t make the improvement. Someone else held the patent.

Under the Watt-Boulton patent, the UK added about 750 horsepower of steam engines per year. In the 30 years after it expired, more than 4,000 horsepower a year of steam engines were installed in England. Fuel efficiency, which had barely improved while the patent was in force, increased by a factor of five between 1810 and 1835.

There is no doubt that patent protection is essential to the process of new knowledge creation in some industries. When the patent on a pharma­ceutical blockbuster drug (a drug with annual sales of more than $1 billion in the US) expires, firms specializing in copying drug formulations and selling generic versions of the drug can enter the market, and the drug’s price decreases as it is exposed to price competition. The patent owner’s profits decrease significantly. Rapid falls in profits demonstrate that the monopolies created by patents can be immensely valuable for the patent owner, but costly for users of the patented innovation.

When the DVD was introduced, it became apparent that the technology would allow consumers to not just own, but also to copy music and films from these disks in high quality. This posed a significant dilemma for the music and film industries that was addressed through new laws making it illegal to subvert digital rights management (DRM) technology, which the film companies used to stop people copying the content without permission. These same laws are now often used when users share content that is copyright protected over the Internet. Today DRM technology helps to protect the companies we now call content providers, who use the Internet as a distribution device—think of a television company that streams sports events live to computers and phones.

Figure 21.12 is a schematic representation of the innovation process. Arrows represent inputs, pointing towards the aspect of innovation that they affect. The figure highlights how the creation of new knowledge always builds on existing knowledge. For instance, Hornblower built on the existing Watt-Boulton design to improve efficiency. As was the case in the early days of the Industrial Revolution, existing patents restrict the ability to build on existing knowledge, and can therefore have a negative effect on innovation. On the other hand, by securing innovation rents for creators, they encourage innovation.

This flowchart shows that patents can have a negative effect because elements of old knowledge are required to create new knowledge, but have a positive effects on innovation investment and creativity to create new knowledge. Once new knowledge is created, patents have a negative effect in promoting its diffusion.
Fullscreen

Figure 21.12 Patents and the production of new knowledge.

Old knowledge helps make new knowledge
: This flowchart shows that patents can have a negative effect because elements of old knowledge are required to create new knowledge.
Fullscreen

Old knowledge helps make new knowledge

Patents slow down this process. As Watt and Boulton found out, patents can impede the use of some aspects of old knowledge that are covered by patents.

Patents encourage innovation
: This flowchart shows that patents can have a negative effect because elements of old knowledge are required to create new knowledge, but have a positive effects on innovation investment and creativity to create new knowledge.
Fullscreen

Patents encourage innovation

The creation of new knowledge gives successful inventors recognition and innovation rents. Watt did not invent the steam engine to profit from the patent he would receive, but other innovators are strongly motivated by the prospect of commercializing their inventions.

Patents slow diffusion
: This flowchart shows that patents can have a negative effect because elements of old knowledge are required to create new knowledge, but have a positive effects on innovation investment and creativity to create new knowledge. Once new knowledge is created, patents have a negative effect in promoting its diffusion.
Fullscreen

Patents slow diffusion

Patents prevent other innovators from realizing the full benefits of new knowledge after it has been created. Watt and Boulton managed to use patents to stop rival inventors from creating their own, perhaps better, steam engines.

When Petra Moser, an economic historian, studied the number and quality of technical inventions shown at mid-nineteenth century technology expositions, she found that countries with patent systems were no more inventive than countries without patents. Patents did, however, affect the kinds of inventive activities in which countries excelled.

When economists disagree Intellectual property rights: Dynamo or drag?

Recall that in one of our ‘Economist in action’ videos, F. M. Scherer argues that patents incentivize R&D in pharmaceutical companies (unlike in many other sectors, he says), so that they continue to develop new blockbuster drugs.

Petra Moser explains that copyright protection for nineteenth-century Italian operas led to more and better operas being written. But she also presents evidence suggesting that intellectual property rights may do more harm than good for the innovation process if they are too broad or too long term.14 15

Exercise 21.10 Thomas Jefferson

Thomas Jefferson (1743–1826), America’s third president, noted the peculiar and wonderful nature of an idea:

Its peculiar character … is that no one possesses the less, because every other possesses the whole of it. He who receives an idea from me, receives instruction himself without lessening mine; as he who lights his taper [candle] at mine, receives light without darkening me. (‘Thomas Jefferson to Isaac McPherson’, Writings, 1813)

Jefferson went on to say something that even then was controversial:

It would be curious, then, if an idea, the fugitive fermentation of an individual brain could … be claimed in exclusive and stable property.

To him, granting to an individual the exclusive right to own and exclude others from the use of an idea just did not make sense, any more than it would make sense for a person to refuse to tell someone what time of day it was.

  1. Rewrite the first part of Jefferson’s quote using the economic terms you learned in this course.
  2. Do you agree with Jefferson’s statement that ideas should not be ‘claimed in exclusive and stable property’? Why, or why not?

Watch our ‘Economist in action’ video, in which Petra Moser discusses copyright protection for nineteenth-century Italian operas.

  1. Outline Petra Moser’s research question, and her approach to answering it.
  2. What were Petra Moser’s findings about patents and copyrights?
  3. What factors should governments consider when determining the effective time period of IPR protection laws such as patents and copyrights?

Exercise 21.12 Intellectual property rights

Why does an extension of copyright terms (for example, an extension of the life of the protection) not change incentives to improve intellectual works (texts and operas) as much as the introduction of copyright itself? In your answer, consider who benefits from extended copyright terms.

Question 21.6 Choose the correct answer(s)

Which of the following statements is correct regarding laws protecting intellectual property rights?

  • A patent is a non-registered right that gives the producer of an intellectual work (such as a book or software code) the right to exclude others from reproducing, adapting and selling it.
  • Copyright is granted if a work is found to be sufficiently new and inventive after an examination by the copyright office.
  • Trademarks give the owners of a registered design the right to exclude others from using it.
  • Economists agree that patents, copyright, and trademarks all promote innovation unambiguously.
  • Patents are registered and are granted after an examination by the patent office. Copyright is the non-registered right to exclude others from reproducing one’s work.
  • This statement is for patents and the patent office. Copyright does not require an examination.
  • This is the definition of trademark.
  • Whilst intellectual property rights exist to protect revenues from a technological innovation, they can also hinder innovation when the new knowledge is based on the old knowledge already protected by intellectual property rights. Economists disagree about the conditions under which intellectual property rights promote or destroy innovation.

21.7 Optimal patents: Balancing the objectives of invention and diffusion

Patents confront us with an economic problem: how best to balance the competing objectives of making good use of existing knowledge, devoting sufficient economic resources and creativity to producing new knowledge, and diffusing the new knowledge that is created. An ‘optimal patent’ is one that best advances the use of knowledge in the economy. Currently, agreements administered by the World Trade Organization, which regulates international trade, may prevent countries from choosing patent length, but given complete freedom of choice, how could a policymaker decide the optimal patent length?

In Figure 21.13, we look first at the decision of an innovator in the upper panel. Work through the analysis in Figure 21.13 to understand the timing of costs and benefits of innovation and who receives them.

There are two diagrams. In diagram 1, the horizontal axis displays time. The vertical axis displays cost and profits per unit of time. A horizontal line above the horizontal axis shows zero economic profits. Cost and profits is a step function that is negative during the period of invention, above the zero economic profit line after successful innovation occurs, and on the zero economic profit line after the invention is copied by others and does not change even when the innovation is copied in the absence of a patent. When the patent expires, it jumps on the zero economic profits line. This downward jump is called the patent cliff. The area between the horizontal axis and the lower portion of the step function is the cost of invention. The area between the upper-most portion of the step function and the zero economic profits line up to when the innovation is copied is innovation rents. The remaining area between the upper-most portion of the step function and the zero economic profits line is reduced rents with no patent. In diagram 2, the horizontal axis displays time. The vertical axis displays benefits to others. There is a benefit to others between the time a successful innovation occurs and the patent expires. This benefit is higher after the patent expires. Benefits to others would be at this high level sooner if the patent did not delay copying.
Fullscreen

Figure 21.13 Costs and rents associated with innovation for the inventor and others.

The innovator incurs costs
: In this diagram, the horizontal axis displays time. The vertical axis displays cost and profits per unit of time. A horizontal line above the horizontal axis shows zero economic profits. Cost and profits is a step function that is negative during the period of invention, above the zero economic profit line after successful innovation occurs, and on the zero economic profit line after the invention is copied by others. The area between the horizontal axis and the lower portion of the step function is the cost of invention.
Fullscreen

The innovator incurs costs

The costs of innovation are shown by the red rectangle.

The innovation is successful
: In this diagram, the horizontal axis displays time. The vertical axis displays cost and profits per unit of time. A horizontal line above the horizontal axis shows zero economic profits. cost and profits is a step function that is negative during the period of invention, above the zero economic profit line after successful innovation occurs, and on the zero economic profit line after the invention is copied by others. The area between the horizontal axis and the lower horizontal line is the cost of invention. Part of the area between the upper-most horizontal line and the zero economic profits line is innovation rents.
Fullscreen

The innovation is successful

The firm makes innovation rents above economic profits. This is the rectangle above the dotted zero economic profits line.

A patent
: In this diagram, the horizontal axis displays time. The vertical axis displays cost and profits per unit of time. A horizontal line above the horizontal axis shows zero economic profits. Cost and profits is a step function that is negative during the period of invention, above the zero economic profit line after successful innovation occurs, and on the zero economic profit line after the invention is copied by others and does not change even when the innovation is copied in the absence of a patent. When the patent expires, it jumps on the zero economic profits line. This downward jump is called the patent cliff. The area between the horizontal axis and the lower portion of the step function is the cost of invention. The area between the upper-most portion of the step function and the zero economic profits line up to when the innovation is copied is innovation rents. The remaining area between the upper-most portion of the step function and the zero economic profits line is reduced rents with no patent.
Fullscreen

A patent

The firm benefits from innovation rents for the life of the patent.

The benefits to others in the economy
: There are two diagrams. In diagram 1, the horizontal axis displays time. The vertical axis displays cost and profits per unit of time. A horizontal line above the horizontal axis shows zero economic profits. Cost and profits is a step function that is negative during the period of invention, above the zero economic profit line after successful innovation occurs, and on the zero economic profit line after the invention is copied by others and does not change even when the innovation is copied in the absence of a patent. When the patent expires, it jumps on the zero economic profits line. This downward jump is called the patent cliff. The area between the horizontal axis and the lower portion of the step function is the cost of invention. The area between the upper-most portion of the step function and the zero economic profits line up to when the innovation is copied is innovation rents. The remaining area between the upper-most portion of the step function and the zero economic profits line is reduced rents with no patent. In diagram 2, the horizontal axis displays time. The vertical axis displays benefits to others. There is a benefit to others between the time a successful innovation occurs and the patent expires.
Fullscreen

The benefits to others in the economy

The lower panel shows the benefits that arise from the innovation. If the innovation did not exist, there would be no benefits to others.

A patent
: There are two diagrams. In diagram 1, the horizontal axis displays time. The vertical axis displays cost and profits per unit of time. A horizontal line above the horizontal axis shows zero economic profits. Cost and profits is a step function that is negative during the period of invention, above the zero economic profit line after successful innovation occurs, and on the zero economic profit line after the invention is copied by others and does not change even when the innovation is copied in the absence of a patent. When the patent expires, it jumps on the zero economic profits line. This downward jump is called the patent cliff. The area between the horizontal axis and the lower portion of the step function is the cost of invention. The area between the upper-most portion of the step function and the zero economic profits line up to when the innovation is copied is innovation rents. The remaining area between the upper-most portion of the step function and the zero economic profits line is reduced rents with no patent. In diagram 2, the horizontal axis displays time. The vertical axis displays benefits to others. There is a benefit to others between the time a successful innovation occurs and the patent expires. This benefit is higher after the patent expires. Benefits to others would be at this high level sooner if the patent did not delay copying.
Fullscreen

A patent

The patent reduces benefits to others, because it delays copying and diffusion.

In the lower panel of Figure 21.13, we include the benefits to others in the economy that arise from the innovation. The term ‘patent cliff’ is from the point of view of the innovator, and refers to the significant decrease in profits when the patent expires. But in the lower panel we see the opposite effect—the benefits of the innovation shoot up when the patent expires, because the innovation is now free to diffuse throughout the economy.

This demonstrates the trade-off. Without the innovation, there are no benefits to others and the likelihood of the innovation increases with longer patents. However, for any given innovation, the benefits are reduced by the duration of the patent. Earlier imitation of the innovation brings benefits to the economy, shown by the dashed rectangle in the lower panel.

From this, we can see that a long patent emphasizes the benefits of rapid innovation, and a short patent emphasizes the benefits of rapid imitation. But we can’t decide by looking at Figure 21.13 how long the optimal patent should be.

The trade-off between the benefits of diffusion and of invention

isototal benefits curve
The combinations of the probability of innovation and the total benefits to society from a firm’s innovation that yield the same total benefits.

Figure 21.14 shows how we can represent the benefits of innovation to society as a whole. On the horizontal axis are shown the total benefits to others in the economy if the firm innovates. This is called B. On the vertical axis we estimate the probability of innovation, called pI. The downward-sloping curves are indifference curves called isototal benefits curves. The total benefits to others from innovation are:

In this diagram, the horizontal axis displays the benefits of diffusion if the firm innovates, denoted B. The vertical axis displays the probability of innovation in case of an invention, denoted p-I.Two downward-sloping, parallel, convex curves are the isotototal benefits curves.The lower curve has utility equals p-I times B, and corresponds to lower total benefits than the upper curve. Points C and D lie on this isototal curve. C corresponds to high diffusion and low invention. D corresponds to low diffusion and high invention.
Fullscreen

Figure 21.14 Isototal benefits curves: The trade-off between the benefits of invention and diffusion.

The isototal benefits curve
: In this diagram, the horizontal axis displays the benefits of diffusion if the firm innovates, denoted B. The vertical axis displays the probability of innovation in case of an invention, denoted p-I. A downward-sloping, convex curve is the isotototal benefits curve, and corresponds to utility equals p-I times B.
Fullscreen

The isototal benefits curve

The downward-sloping curve is an indifference curve, called an isototal benefits curve. Along the curve the total benefits arising from an innovation are equal to pIB and remain constant.

Rectangles that touch the curve
: In this diagram, the horizontal axis displays the benefits of diffusion if the firm innovates, denoted B. The vertical axis displays the probability of innovation in case of an invention, denoted p-I. A downward-sloping, convex curve is the isotototal benefits curve, and corresponds to utility equals p-I times B. Points C and D lie on this isototal benefits curve. C corresponds to high diffusion and low invention. D corresponds to low diffusion and high invention.
Fullscreen

Rectangles that touch the curve

Any rectangle with a corner on the curve has the same area as any other. Points C and D illustrate this.

A preferable curve
: In this diagram, the horizontal axis displays the benefits of diffusion if the firm innovates, denoted B. The vertical axis displays the probability of innovation in case of an invention, denoted p-I.Two downward-sloping, parallel, convex curves are the isotototal benefits curves.The lower curve has utility equals p-I times B, and corresponds to lower total benefits than the upper curve. Points C and D lie on this isototal curve. C corresponds to high diffusion and low invention. D corresponds to low diffusion and high invention.
Fullscreen

A preferable curve

The higher isototal benefits curve is preferable to the curve through C and D.

Feasible invention and diffusion

What are the constraints? What limits the total benefits that will occur if the innovation takes place? This will depend on the length of the patent, because a longer period of patent protection is thought at least initially to increase the probability of innovation, pI, but to reduce the amount of total benefits for others, B, if the innovation occurs because of the delay in copying.

Even when there is no patent, innovation can occur, as shown on the vertical axis of Figure 21.15. In these cases the innovator could capture innovation rents just by being the first in the market, because it takes competitors some time to catch up.

Figure 21.15 shows that as the duration of patents increases (moving to the right along the horizontal axis), so does the probability of innovation because innovation rents are protected for a longer period of time. Beyond a particular length of patent protection, however, the probability of innovation begins to decline because long-term patents will prevent other potential innovators from using protected knowledge or processes to develop an idea.

In this diagram, the horizontal axis displays the duration of patents, and ranges from no patent to permanent monopoly. The vertical axis displays the probability of innovation in case of invention. A concave curve is shown, which does not intersect the vertical axis. When there is no patent, innovation can occur. The upward-sloping portion of the curve shows that by protecting innovation rents, patents may encourage innovation. The downward-sloping portion of the curve shows that too strong patents impede innovation.
Fullscreen

Figure 21.15 Patent duration and probability of innovation.

We can show the feasible set in Figure 21.16, which presents the trade-off between a higher probability of innovation and the total benefits to others if the firm innovates.

In this diagram, the horizontal axis displays the total benefits to others if firm innovates, and ranges from low benefits with infinitely lived patents to high benefits with no patent. The vertical axis displays the probability of innovation in case of invention. A concave curve is shown, which does not intersect the vertical axis. The upward-sloping portion of the curve shows that weakening patents from very strong increases the probability of innovation and benefits to others. The downward-sloping portion of the curve shows that there is a trade-off: weaker patents expand benefits to others but reduce incentives for innovation.
Fullscreen

Figure 21.16 The feasible set: Innovation probability and benefits to others.

Each point on the feasible set is the result of a given patent length, starting at the left-hand side with a patent that never expires. As we move to the right, the duration of a patent declines. There are increasing benefits to others. Initially this increases both the benefits to others should the innovation occur, and (as we saw in Figure 21.15) the probability of innovation. This gives the positively sloped section of the feasible set. However, as we have also seen, at some point there will be a trade-off: a further reduction in patent duration will decrease the probability of innovation, even though it expands the total benefits that would result should the innovation occur. This explains the downward-sloping portion of the frontier of the feasible set.

Optimal patent duration

If we now put the feasible set together with the isototal benefits curves, we can determine the length of the patent that maximizes the expected benefits consistent with the constraints imposed by the trade-off between the incentive for innovation and stimulating diffusion. The highest attainable level of total benefits is shown by the tangency of the isototal benefits curve with the feasible set. This is point A in Figure 21.17.

In this diagram, the horizontal axis displays the total benefits to others if firm innovates, and ranges from low benefits with infinitely lived patents to high benefits with no patent. The vertical axis displays the probability of innovation in case of invention. A concave curve is shown, which does not intersect the vertical axis. Two downward-sloping, parallel, convex curves are isototal benefit curves. The higher curve corresponds to utility equals p-I times B and is tangent to the concave curve at point A, which corresponds to a probability of innovation p-star. Point A shows that given the benefits of innovation to others shown by the horizontal axis, p-star is the optimal probability of innovation. The lower isototal benefits curve intersects the concave curve at point E, with a lower benefit and higher probability of innovation than point A, denoted p-I, and at a higher benefits and lower probability of innovation than point A.
Fullscreen

Figure 21.17 The optimal probability of innovation for society.

Maximizing expected benefits to society
: In this diagram, the horizontal axis displays the total benefits to others if firm innovates, and ranges from low benefits with infinitely lived patents to high benefits with no patent. The vertical axis displays the probability of innovation in case of invention. A concave curve is shown, which does not intersect the vertical axis. Two downward-sloping, parallel, convex curves are isototal benefit curves. The higher curve corresponds to utility equals p-I times B and is tangent to the concave curve. The lower isototal benefits curve intersects the concave curve at a lower benefits and higher probability of innovation, and at a higher benefits and lower probability of innovation.
Fullscreen

Maximizing expected benefits to society

Combining the feasible set with the isototal benefits curves, we can determine the length of the patent that maximizes the expected benefits to society as a whole.

The highest attainable level of total benefits
: In this diagram, the horizontal axis displays the total benefits to others if firm innovates, and ranges from low benefits with infinitely lived patents to high benefits with no patent. The vertical axis displays the probability of innovation in case of invention. A concave curve is shown, which does not intersect the vertical axis. Two downward-sloping, parallel, convex curves are isototal benefit curves. The higher curve corresponds to utility equals p-I times B and is tangent to the concave curve at point A. The lower isototal benefits curve intersects the concave curve at a lower benefits and higher probability of innovation than point A, and at a higher benefits and lower probability of innovation than point A.
Fullscreen

The highest attainable level of total benefits

This is shown by the tangency of the isototal benefits curve with the feasible set at point A.

The optimal probability of innovation
: In this diagram, the horizontal axis displays the total benefits to others if firm innovates, and ranges from low benefits with infinitely lived patents to high benefits with no patent. The vertical axis displays the probability of innovation in case of invention. A concave curve is shown, which does not intersect the vertical axis. Two downward-sloping, parallel, convex curves are isototal benefit curves. The higher curve corresponds to utility equals p-I times B and is tangent to the concave curve at point A, which corresponds to a probability of innovation p-star. Point A shows that given the benefits of innovation to others shown by the horizontal axis, p-star is the optimal probability of innovation. The lower isototal benefits curve intersects the concave curve at a lower benefits and higher probability of innovation than point A, and at a higher benefits and lower probability of innovation than point A.
Fullscreen

The optimal probability of innovation

From the perspective of society as a whole, the optimal probability of innovation is p*.

Higher probability of innovation but lower benefits to society
: In this diagram, the horizontal axis displays the total benefits to others if firm innovates, and ranges from low benefits with infinitely lived patents to high benefits with no patent. The vertical axis displays the probability of innovation in case of invention. A concave curve is shown, which does not intersect the vertical axis. Two downward-sloping, parallel, convex curves are isototal benefit curves. The higher curve corresponds to utility equals p-I times B and is tangent to the concave curve at point A, which corresponds to a probability of innovation p-star. Point A shows that given the benefits of innovation to others shown by the horizontal axis, p-star is the optimal probability of innovation. The lower isototal benefits curve intersects the concave curve at point E, with a lower benefit and higher probability of innovation than point A, denoted p-I, and at a higher benefits and lower probability of innovation than point A.
Fullscreen

Higher probability of innovation but lower benefits to society

At E, with a longer patent than the optimal one at A, innovation is more likely but because of less diffusion, its benefits to society as a whole are lower as shown by the lower isototal benefits curve.

This outcome on its own is not a policy, but it allows us to determine one. We can now go back to Figure 21.15 and ask what patent duration would a policymaker set so that the innovating firm will choose society’s optimal probability of innovation, p*? Figure 21.18 shows the answer.

There are two diagrams. The horizontal axis displays the total benefits to others if firm innovates, and ranges from low benefits with infinitely lived patents to high benefits with no patent. The vertical axis displays the probability of innovation in case of invention. A concave curve is shown, which does not intersect the vertical axis. Two downward-sloping, parallel, convex curves are isototal benefit curves. The higher curve corresponds to utility equals p-I times B and is tangent to the concave curve at point A, which corresponds to a probability of innovation p-star. The lower isobenefit curve intersects the concave curve at point E with a lower benefit and higher probability of innovation than point A, and at higher benefits and a lower probability of innovation than point A. In diagram 2, the horizontal axis displays the duration of patents, and ranges from no patent to permanent monopoly. The vertical axis displays the probability of innovation in case of invention. Coordinates are (duration, probability). A point on the vertical axis shows that with no patent, innovation still occurs with low probability. A concave curve is shown, which does not intersect the vertical axis. The upward-sloping portion of the curve shows that by protecting innovation rents, patents may encourage innovation. The downward-sloping portion of the curve shows that too strong patents impede innovation. Point (d-star, p-star) lies on the upward-sloping portion of the curve, where p-star has the same value as in diagram 1.
Fullscreen

Figure 21.18 The optimal patent duration.

The optimal probability of innovation
: In this diagram, the horizontal axis displays the total benefits to others if firm innovates, and ranges from low benefits with infinitely lived patents to high benefits with no patent. The vertical axis displays the probability of innovation in case of invention. A concave curve is shown, which does not intersect the vertical axis. Two downward-sloping, parallel, convex curves are isototal benefit curves. The higher curve corresponds to utility equals p-I times B and is tangent to the concave curve at point A, which corresponds to a probability of innovation p-star. The lower isobenefit curve intersects the concave curve at point E with a lower benefit and higher probability of innovation than point A, and at higher benefits and a lower probability of innovation than point A.
Fullscreen

The optimal probability of innovation

Given the benefits of innovation to others, we established in Figure 21.17 that p* is the optimal probability of innovation. This can tell us what the duration of the patents should be.

The optimal duration of patents
: There are two diagrams. In diagram 1, the horizontal axis displays the total benefits to others if firm innovates, and ranges from low benefits with infinitely lived patents to high benefits with no patent. The vertical axis displays the probability of innovation in case of invention. A concave curve is shown, which does not intersect the vertical axis. Two downward-sloping, parallel, convex curves are isototal benefit curves. The higher curve corresponds to utility equals p-I times B and is tangent to the concave curve at point A, which corresponds to a probability of innovation p-star. The lower isobenefit curve intersects the concave curve at point E with a lower benefit and higher probability of innovation than point A, and at higher benefits and a lower probability of innovation than point A. In diagram 2, the horizontal axis displays the duration of patents, and ranges from no patent to permanent monopoly. The vertical axis displays the probability of innovation in case of invention. Coordinates are (duration, probability). A concave curve is shown, which does not intersect the vertical axis. The upward-sloping portion of the curve shows that by protecting innovation rents, patents may encourage innovation. The downward-sloping portion of the curve shows that too strong patents impede innovation. Point (d-star, p-star) lies on the upward-sloping portion of the curve, where p-star has the same value as in diagram 1.
Fullscreen

The optimal duration of patents

If we know p*, we can use Figure 21.15 (the right-hand figure here) to determine the optimal duration of patents, d*.

What if there were no patents?
: There are two diagrams. The horizontal axis displays the total benefits to others if firm innovates, and ranges from low benefits with infinitely lived patents to high benefits with no patent. The vertical axis displays the probability of innovation in case of invention. A concave curve is shown, which does not intersect the vertical axis. Two downward-sloping, parallel, convex curves are isototal benefit curves. The higher curve corresponds to utility equals p-I times B and is tangent to the concave curve at point A, which corresponds to a probability of innovation p-star. The lower isobenefit curve intersects the concave curve at point E with a lower benefit and higher probability of innovation than point A, and at higher benefits and a lower probability of innovation than point A. In diagram 2, the horizontal axis displays the duration of patents, and ranges from no patent to permanent monopoly. The vertical axis displays the probability of innovation in case of invention. Coordinates are (duration, probability). A point on the vertical axis shows that with no patent, innovation still occurs with low probability. A concave curve is shown, which does not intersect the vertical axis. The upward-sloping portion of the curve shows that by protecting innovation rents, patents may encourage innovation. The downward-sloping portion of the curve shows that too strong patents impede innovation. Point (d-star, p-star) lies on the upward-sloping portion of the curve, where p-star has the same value as in diagram 1.
Fullscreen

What if there were no patents?

We can see that innovation will still occur, but below the optimal level for society.

Exercise 21.13 Optimal patents

  1. Consider two contrasting technologies. For one, the government would optimally choose a short patent duration. For the other, it would choose a longer patent duration. In each case, draw the feasible set and label the optimal point, as in Figure 21.17. Assume the same isototal benefits curves.
  2. The length of patents and copyrights has increased steadily since the Industrial Revolution. Explain why this may have happened, and discuss whether this could be a good or a bad thing.
  3. How should patent offices react if firms seek to cement patent monopolies by patenting improved versions of the original technology at a later date? (This is a process known as ‘evergreening’—it’s described in the Journal of Health Economics by C. Scott Hemphill and Bhaven N. Sampat.)

Question 21.7 Choose the correct answer(s)

Figure 21.13 depicts the costs and rents associated with innovation for the inventor and others.

Based on this diagram, which of the following statements is correct?

  • When there is no patent, the innovation is copied immediately.
  • At the point that the patent expires, the patent owner ‘falls off a patent cliff’ where they lose innovation rents.
  • There is zero benefit to others from the innovation during the patent period.
  • The innovator’s benefit from the patent outweighs the lost benefits to the others.
  • The diagram shows that there is a period during which innovation is not copied despite an absence of patent, allowing the innovator to recover some (or all) of the cost of innovation.
  • As shown in the figure, profits of the patent owner return to zero economic profits once the patent expires.
  • The diagram shows a small but positive benefit to others during the patent period.
  • In terms of the size of the area, the benefits lost to the others is shown to be much larger than the benefit gained by the innovator.

Question 21.8 Choose the correct answer(s)

The following diagram depicts the probability of innovation as the duration of patents is increased.

The probability of innovation as the duration of patents is increased
Fullscreen

Based on this information, which of the following statements is correct?

  • There is no innovation in the absence of patents.
  • Longer patent duration will always lead to an increase in the probability of innovation.
  • The downward sloping part of the graph demonstrates the trade-off between greater incentive to innovate from higher innovation rent income and the disincentive for potential innovators from using patented knowledge.
  • The optimal duration of patents is where the probability of innovation is maximized.
  • The positive intercept on the vertical axis implies that innovation occurs even in the absence of patents.
  • Evidently not, as the diagram shows. At some point, the longer-term patents will impede potential innovators from using already patented knowledge for further innovation.
  • True. The curve slopes downwards because these two effects work in opposite directions. At points to the right of the maximum, the disincentive to innovation because patents prevent the knowledge being used by other innovators outweighs the positive incentive effect from increased innovation rents.
  • The optimal duration of patents depends on the probability of innovation, as well as the total benefits to others from innovation if it occurs (which is decreasing in the duration of patents). Therefore the optimal duration occurs to the left of the maximum probability point.

21.8 Public funding of basic research, education, and information infrastructure

The pros and cons of various kinds of intellectual property rights are just a part of the problem of designing an effective innovation system. Another important element is the role of the government. Recall, for example, from the introduction of this unit, that in some cases the expected beneficial effects on markets from the spread of mobile phones did not materialize because necessary public infrastructure—mostly roads and means of transport—were lacking. Governmental provision of goods and services, such as the roads that would have allowed Indian farmers to benefit from their new access to price information, are essential to successful diffusion of the benefits of innovation. As we shall see, the origins of the computer, and by extension, the entire information revolution makes the essential role of government clear in the innovation process itself.

Adequate public policies concerning innovation can help in two main ways:

  • Increasing the pace of innovation: This occurs through such interventions as the support of basic research and communications infrastructure, set­ting standards, as well as the design of patents, copyright, and trademarks.
  • Influencing the direction of innovation: This tilts the process towards the production of novel ideas and applications with environmental, learning, medical, or other socially valued applications.

Government-funded research

The roots of the IT revolution can be traced to the building of the first elec­tronic programmable computers after the Second World War, although as with any technology, some elements are older. Charles Babbage first pro­posed a calculating machine called the Difference Engine, in a learned paper published in 1822 (and was funded by the British government to develop it), and his ideas helped Ada Lovelace develop the first computer program.

The British and American governments’ efforts during and after the Second World War pioneered programmable electronic computing in practice. In the US, the early focus was on supporting the development of missile systems and the Manhattan Project to develop the atomic bomb. These projects demanded huge numbers of rapid calculations in ballistics and predicting atomic reactions. US government money supported private entities such as Bell Labs in New Jersey, as well as federal research facilities like Los Alamos.

There was a close partnership between the private sector, government agencies and universities, resulting in the building of the ENIAC machine in 1946 under the auspices of the US Army. It was the first electronic computer, although it could not store programs. Other innovations followed swiftly, such as the development of the transistor by William Shockley at Bell Labs in 1948, as well as the creation of new companies such as Fairchild Semiconductor. American government support for the industry has continued through research funding, including, famously, the creation of the Internet (in 1969) in a project financed by the Defense Advance Research Projects Agency, or DARPA.

In the UK, early progress in computing was focused on the efforts at Bletchley Park, where the mathematician Alan Turing worked, to crack Germany’s Enigma code. The Colossus machine developed there remained a secret until the 1970s, but Bletchley Park scientists and engineers went on to build in 1948 the world’s first postwar stored-program computer with a memory, called Baby, at the University of Manchester, another publicly funded institution. The commercial development of computers followed swiftly, by companies such as Ferranti.

This pattern of government funding of early-stage research, either through government agencies including the military or through universities, followed by commercial applications is common. As well as the computer and electronics industries, the Internet, and the World Wide Web (created by Tim Berners-Lee at the CERN research laboratory funded by a consortium of governments), the modern pharmaceuticals and biotech sectors, and commer­cial applications of new materials, such as graphene, all have roots in publicly financed basic research and early-stage development. Touch screens and the computer mouse were also the result of US government-funded research.16

The MP3 format was created by a small group of researchers at a public research lab in Germany, belonging to the Fraunhofer Gesellschaft. Their patent allows shrinking the size of audio files by a factor of 12, while maintaining sound quality. This innovation made music sharing via the Internet possible and contributed to major upheaval in the global music industry. Commercial firms did not initially adopt it as a standard, and it became widely diffused because the creators responded by distributing encoding software to users for a low price and did not pursue hackers who then made it available for free.

Mariana Mazzucato, an economist who specializes in the causes and impacts of innovation, uses the example of some of the basic digital innovations such as the Internet, GPS and touch screens to argue that the government has an essential role in funding research and start-up technology companies. She sees the government’s role not just as filling in activities the market will not undertake, perhaps because the returns are too far in the future and uncertain, but also as shaping what kind of activities the private sector will do. In her view, strategic investment by the US government helps explain why American companies dominate high-tech industries including digital and biotechnology.

Competitions and prizes

A quite different policy for the support of innovation is to award a prize for the successful development of a solution to a problem that will meet some specifications. The prize-winner is rewarded for the cost of development, rather than with a monopoly over the novel idea or method, and the innovation then goes immediately into the public domain.

For example, in the aftermath of the Deepwater Horizon oil rig disaster, the XPrize Foundation offered $1 million to any team who could significantly improve current technology for the clean-up of oil spills. Within a year, a team had devised a method that quadrupled the industry-standard recovery rate.

A more famous example is the invention by watchmaker John Harrison of the marine chronometer, a device that for the first time allowed the (reasonably) accurate measurement of a vessel’s longitude at sea. Harrison started work on his chronometer in 1730 in response to an offer made in 1714 by the British government of a cash prize (about £2.5 million in 2014 prices) for the invention of a device to measure longitude. Harrison’s approach to the challenge was to build an accurate clock small enough to be seaborne in order that the Greenwich time at which the sun reached its zenith could be determined. This would allow the ship’s position west of Greenwich to be calculated. The problem had attracted some of the best minds of the time, including Isaac Newton’s. Harrison produced many versions, each better than the last, but argued with the government about whether he deserved the prize money. The argument arose because Harrison’s solution to the problem was rather different from that expected by the government. He was awarded a series of smaller sums over the years.

Today, the Longitude Prize is funded by the UK government. Unusually the Longitude Committee, which will award the prize, asked the general public to choose from six challenges, which they could direct the prize money towards.

The public selected the problem of resistance to antibiotics, which we noted in Unit 12, a choice that many experts would support. This is interesting because many people are skeptical that government agencies are good at picking where R&D investments should be directed, in spite of the history of rather good investment decisions in technologies during and after the Second World War.

If you believe that the general public are better than governments at identifying pressing problems, then the Longitude Committee have solved this problem neatly by letting us choose.

Another example of where competitions work well is the creation of prizes for the successful development of drugs for neglected diseases. These drugs treat illnesses that are common in parts of the world in which there is little pharmaceutical innovation because the private market for them is limited by the low incomes of those afflicted with the diseases.17

Exercise 21.14 Government-funded research

  1. What are the arguments for and against direct government investment in the commercial application of new technologies?
  2. Describe ways in which governments could pick technologies in which to invest, so that the process would be more transparent to taxpayers.
  3. Do you think it would be sensible to involve taxpayers in the choices about which technologies in which to invest? Explain your answer.
  4. Which kind of technologies do you think governments should spend more on and which technologies should governments leave to the private sector? Explain your answer.

Question 21.9 Choose the correct answer(s)

Which of the following policies promote efficient innovation processes?

  • prizes for successful innovation to resolve the coordination problem in the innovation of substitute goods
  • subsidizing the supply of inputs to innovation, such as public infrastructure, research and education, to alleviate the coordination problem of complementary innovations
  • establishing a patent system to address high first copy costs of knowledge-intensive innovations
  • promoting low-cost dissemination of information
  • Here the problem is the ‘winner-take-all’ nature of the competitive market of substitute goods (such as Sony’s Betamax video tape standard vs JVC’s VHS). The public sector agencies should instead promote agreement among the participants in the industry, for example deciding on new technical standards.
  • In this situation, firms may be reluctant to invest in one innovation unless other firms are investing in the complementary innovation. Subsidizing inputs lowers the costs of innovation and increases the likelihood that firms will invest.
  • A patent allows the inventor to temporarily profit from his idea, which would make inventors more willing to pay the high up-front costs of innovation.
  • The public goods nature of the knowledge produced means that the innovating firms cannot capture all of the benefits their innovations generate. A patent system would alleviate this problem.

Question 21.10 Choose the correct answer(s)

Which of the following statements are correct regarding public policies for innovation?

  • The government should not invest in innovations whose returns are too far in the future and uncertain.
  • By taking equity stakes in innovation companies, the government would enhance its ability to enforce competition policy.
  • The government could support innovation by setting up a scheme that awards a prize for successful development of a solution to a specific problem.
  • The government can fund early-stage research, through government agencies such as the military or universities, that can then be used for commercial applications.
  • These are exactly the kind of innovations that the government should be investing in, instead of relying on firms.
  • The government may find it more difficult to regulate firms in which it has an equity stake, such as enforcing competition policy or environment standards.
  • Examples are in the text.
  • Examples are in the text.

21.9 Conclusion

The UK and the Netherlands, birthplaces of capitalism and the Industrial Revolution, were not unique in the intelligence and creativity of their peoples. China, arguably, had proven to be an equally, if not more inventive society, in earlier years having first developed paper, printing, gunpowder, the compass, and literally hundreds of other important innovations. Other countries, notably Japan, were adept at the adaptation and spread of novel methods and ideas. But the combined pull of innovation rents and the push of competition to survive that was characteristic of the innovation and diffusion process under capitalism made it a uniquely dynamic economic system that transformed the British and Dutch economies.

Public policy also played an important part. For innovators to take the risk of introducing a new product or production process, it is crucial that their innovation rents not be seized by the government or others. This requires that property rights be protected by a well-functioning legal system as was the case in the UK, the Netherlands, and other countries that experienced the kink in the hockey stick of per capita income early.

More recently, Silicon Valley, the German innovation system, and other successful examples of innovation have been assisted by governments that provide complementary inputs such as physical infrastructure, basic research and public education, guaranteed markets (like those for military goods), and allow the innovator only a temporary monopoly so that competition eventually will reduce prices.18

In a nutshell, it is this combination of private incentives and supportive public policy that explains why capitalism can be such a dynamic economic system. Among the consequences in many countries are the increased living standards as measured by income per capita (documented in Unit 1), as well as the reduction in working hours seen in Unit 3.

But remember that Joseph Schumpeter, the economist who contributed most to our current understanding of innovation (and who you encountered in Unit 16) called the process of technological change ‘creative destruction’.

In this unit, we have stressed the creative part: the development of new processes and products that allow us to produce our livelihoods with pro­gressively less time at work. But in Unit 16 we studied the ways in which the process of technological change also puts people out of work and devalues once respected and well-paid skills. And in Unit 20, you saw that the expansion of production and the substitution of fossil-fuel-based energy for human and other animal energy made possible by technological change has posed challenges to our environment, even as improved technologies hold out the hope that under the right policies, these challenges may be addressed.

Economists can help to design these policies and to evaluate the benefits and costs of ways of promoting beneficial innovations and also addressing the ‘destructive’ aspect of new technologies.

Concepts introduced in Unit 21

Before you move on, review these definitions:

21.10 References

Consult CORE’s Fact checker for a detailed list of sources.

  • Boldrin, Michele, and David K. Levine. 2008. Against Intellectual Monopoly. New York, NY: Cambridge University Press.
  • Boseley, Sarah. 2016. ‘Big Pharma’s Worst Nightmare’. The Guardian, Updated 5 February 2016.
  • DiMasi, Joseph A., Ronald W. Hansen, and Henry G. Grabowski. 2003. ‘The Price of Innovation: New Estimates of Drug Development Costs’. Journal of Health Economics 22 (2): pp. 151–85.
  • Edsall, Thomas B. 2016. ‘Boom or Gloom?’New York Times. Updated 27 January 2016.
  • Engel, Jerome S. 2015. ‘Global Clusters of Innovation: Lessons from Silicon Valley.’  California Management Review  57 (2): pp. 36–65. University of California Press.
  • Gordon, Robert J. 2016. The Rise and Fall of American Growth: The US Standard of Living since the Civil War. Princeton, NJ: Princeton University Press.
  • Hall, Peter A., and David Soskice. 2001.  Varieties of Capitalism: The Institutional Foundations of Comparative Advantage. New York, NY: Oxford University Press.
  • Hemphill, C. Scott, and Bhaven N. Sampat. 2012. ‘Evergreening, Patent Challenges, and Effective Market Life in Pharmaceuticals’. Journal of Health Economics  31 (2): pp. 327–39.
  • Janeway, William H. 2012. Doing Capitalism in the Innovation Economy: Markets, Speculation and the State. Cambridge: Cambridge University Press.
  • Jensen, Robert. 2007. ‘The Digital Provide: Information (Technology), Market Performance, and Welfare in the South Indian Fisheries Sector.’ The Quarterly Journal of Economics 122 (3): pp. 879–924.
  • Kornai, János. 2013. Dynamism, Rivalry, and the Surplus Economy: Two Essays on the Nature of Capitalism. Oxford: Oxford University Press.
  • Koromvokis, Lee. 2016. ‘Are the Best Days of the US Economy Over?’. PBS NewsHour. 28 January 2016.
  • Kremer, Michael, and Rachel Glennerster. 2004. Strong Medicine: Creating Incentives for Pharmaceutical Research on Neglected Diseases. Princeton, NJ: Princeton University Press.
  • Landes, David S. 2000. Revolution in Time. Cambridge, MA: Harvard University Press.
  • Mazzucato, Mariana. 2013. ‘Government – investor, risk-taker, innovator’.
  • Moser, Petra. 2013. ‘Patents and Innovation: Evidence from Economic History’. Journal of Economic Perspectives 27 (1): pp. 23–44.
  • Moser, Petra. 2015. ‘Intellectual Property Rights and Artistic Creativity’. Voxeu.org. Updated 4 November 2015.
  • Mowery, David C., and Timothy Simcoe. 2002. ‘Is the Internet a US Invention?—an Economic and Technological History of Computer Networking’. Research Policy  31 (8–9): pp. 1369–87.
  • Roth, Alvin. 1996. ‘Matching (Two-Sided Matching)’. Stanford University.
  • Rysman, Marc. 2009. ‘The Economics of Two-Sided Markets’. Journal of Economic Perspectives 23 (3): pp. 125–43.
  • Saxenian, AnnaLee. 1996. Regional Advantage: Culture and Competition in Silicon Valley and Route 128. Cambridge, MA: Harvard University Press.
  • Swarns, Rachel L. 2001. ‘Drug Makers Drop South Africa Suit over AIDS Medicine’. New York Times. Updated 20 April 2001.
  • The Economist. 2007. ‘To Do with the Price of Fish’. Updated 10 May 2007.
  • Witt, Stephen. 2015. How Music Got Free: The End of an Industry, the Turn of the Century, and the Patient Zero of Piracy. New York, NY: Viking.
  1. Swarns Rachel L. 2001. ‘Drug Makers Drop South Africa Suit over AIDS Medicine’. New York Times. Updated 20 April 2001. 

  2. Sarah Boseley. 2016. ‘Big Pharma’s Worst Nightmare’. The Guardian, Updated 5 February 2016. 

  3. ‘To Do with the Price of Fish’. The Economist. Updated 10 May 2007. 

  4. Robert Jensen. 2007. ‘The Digital Provide: Information (Technology), Market Performance, and Welfare in the South Indian Fisheries Sector’. The Quarterly Journal of Economics 122 (3): pp. 879–924. 

  5. Peter A. Hall, and David Soskice. 2001. Varieties of Capitalism: The Institutional Foundations of Comparative Advantage. New York, NY: Oxford University Press. 

  6. Stephen Witt. 2015. How Music Got Free: The End of an Industry, the Turn of the Century, and the Patient Zero of Piracy. New York, NY: Viking. 

  7. David C. Mowery and Timothy Simcoe. 2002. ‘Is the Internet a US Invention? An Economic and Technological History of Computer Networking’.  Research Policy 31 (8–9): pp. 1369–87. 

  8. Jerome S. Engel. 2015. ‘Global Clusters of Innovation: Lessons from Silicon Valley’. California Management Review  57 (2). University of California Press: pp. 36–65. 

  9. AnnaLee Saxenian. 1996. Regional Advantage: Culture and Competition in Silicon Valley and Route 128. Cambridge, MA: Harvard University Press. 

  10. Michele Boldrin and David K. Levine. 2008. Against Intellectual Monopoly. New York, NY: Cambridge University Press. 

  11. János Kornai. 2013. Dynamism, Rivalry, and the Surplus Economy: Two Essays on the Nature of Capitalism. Oxford: Oxford University Press. 

  12. Marc Rysman. 2009. ‘The Economics of Two-Sided Markets’. Journal of Economic Perspectives 23 (3): pp. 125–43.  2

  13. Alvin Roth. 1996. ‘Matching (Two-Sided Matching)’. Stanford University. 

  14. Petra Moser. 2013. ‘Patents and Innovation: Evidence from Economic History’. Journal of Economic Perspectives  27 (1): pp. 23–44. 

  15. Petra Moser. 2015. ‘Intellectual Property Rights and Artistic Creativity’. VoxEU.org. Updated November 4 2015. 

  16. William H. Janeway. 2012. Doing Capitalism in the Innovation Economy: Markets, Speculation and the State. Cambridge: Cambridge University Press. 

  17. Michael Kremer and Rachel Glennerster. 2004. Strong Medicine: Creating Incentives for Pharmaceutical Research on Neglected Diseases. Princeton, NJ: Princeton University Press. 

  18. David S. Landes. 2000. Revolution in Time. Cambridge, MA: Harvard University Press.