Internalization theory is an economic framework in international business, originated by Peter J. Buckley and Mark Casson in 1976, that explains the emergence and operations of multinational enterprises (MNEs) as mechanisms for internalizing markets for intermediate products—especially intangible, knowledge-intensive assets—to overcome failures in external markets caused by high transaction costs, opportunism risks, and imperfect intellectual property enforcement.[1] The theory posits that firms opt for foreign direct investment (FDI) over alternatives like licensing or exporting when the benefits of hierarchical control exceed the costs of market transactions, particularly in sectors where proprietary knowledge is vulnerable to dissipation or underpricing in arm's-length exchanges.[1] Drawing from Ronald Coase's transaction cost economics, it emphasizes causal mechanisms rooted in governance efficiency, where MNEs coordinate international production through internal managerial hierarchies rather than fragmented external contracts.[2]Developed amid 1970s observations of FDI concentration in knowledge-driven industries and bilateral investment flows between developed economies, the theory provides a parsimonious, generalizable explanation for MNE growth without relying on nation-specific or industry-unique factors alone.[1] Key propositions include the superiority of internalization for safeguarding firm-specific advantages, its contingency on environmental factors like weak property rights, and its applicability to diverse MNE strategies such as market-seeking or efficiency-seeking expansions.[3] Over time, extensions have incorporated entrepreneurial coordination, industry-level dynamics, and integrations with global value chains, enhancing its relevance to contemporary phenomena like emerging market MNEs, though debates persist on its static assumptions and empirical testability amid dynamic capabilities.[1][4]Internalization theory remains a cornerstone of international business scholarship, underpinning analyses of FDI patterns and firm boundaries while privileging transaction-level causal realism over ad hoc explanations.[5]
Historical Development
Foundational Ideas and Early Influences
The foundational ideas of internalization theory trace back to Ronald Coase's 1937 article "The Nature of the Firm," which explained firm emergence as a mechanism to mitigate transaction costs—such as discovery, negotiation, and enforcement expenses—that arise in market-mediated exchanges. Coase reasoned that absent these frictions, economic coordination would occur solely via prices, but in reality, firms internalize activities to avoid repeated market transactions, expanding until internal organizational costs equate with external market alternatives. This framework established a causal basis for delineating firm boundaries through cost comparisons, influencing subsequent analyses of why certain exchanges occur within hierarchies rather than markets.[6][7]Stephen Hymer's 1960 dissertation, "The International Operations of National Firms: A Study of Direct Foreign Investment," extended these principles to international contexts by rejecting neoclassical explanations of FDI as mere portfolio flows under perfect competition. Hymer asserted that FDI requires persistent market imperfections, enabling firms to deploy control-intensive advantages—like proprietary technology or managerial expertise—directly abroad, as arm's-length mechanisms such as licensing would erode these edges through diffusion or opportunism. His analysis shifted focus from factor endowments to industrial organization dynamics, positing that multinationals exploit structural barriers to entry and monopoly rents via ownership and control, rather than risk dissipation in fragmented markets.[8][9]Post-World War II economic recovery amplified these theoretical tensions, as U.S. direct investment in Europe surged from negligible levels in the 1940s to over $7 billion by 1960, defying predictions from trade models that emphasized exporting or passive investment. Economists observed that multinational firms, confronting imperfect markets for intangible assets like knowledge, increasingly favored internal governance to preserve competitive edges against hazards such as partner hold-up or unintended knowledge leakage. This pattern of hierarchical expansion across borders, documented in rising FDI stocks amid stable trade volumes, highlighted the inadequacy of arm's-length alternatives and prompted causal inquiries into why firms internalized cross-border flows of intermediates, predating formalized internalization models.[10][11]
Buckley and Casson Formulation (1976)
In their 1976 book The Future of the Multinational Enterprise, Peter J. Buckley and Mark Casson formalized internalization theory as a framework explaining the existence and growth of multinational enterprises (MNEs) through the preference for internal markets over imperfect external ones for intermediate products.[12] The core proposition posits that firms internalize the production and transfer of intermediates—particularly intangible assets such as knowledge, technology, and proprietary information—when external transaction costs exceed the benefits of arm's-length exchanges, thereby reducing risks like opportunism, contract enforcement failures, and unintended knowledge dissipation.[13] This approach frames foreign direct investment (FDI) as a mechanism to exploit firm-specific advantages internally across borders, rather than relying on exporting or licensing, which expose firms to market failures in valuing and protecting such assets.[14]Buckley and Casson distinguished between final product markets, which they characterized as generally competitive and efficient due to observable prices and standardized goods, and intermediate product markets, which suffer from inherent imperfections such as uncertainty in assessing the quality and applicability of R&D outputs, asymmetric information between buyers and sellers, and the public-good nature of knowledge that incentivizes free-riding or imitation.[3] In these imperfect intermediate markets, externalization via licensing or subcontracting incurs high monitoring costs and risks of intellectual property theft, prompting firms to integrate vertically through FDI to safeguard and efficiently deploy their core competencies.[15] This causal logic underscores internalization as a rational response to transaction-specific frictions, where the MNE emerges as an organizational form minimizing these costs while optimizing location-specific opportunities.[13]The initial model developed by Buckley and Casson predicted greater FDI prevalence in industries characterized by high R&D intensity, where firm-specific advantages in innovation are pronounced and difficult to commoditize externally.[16] Empirical proxies for such advantages included expenditures on research and development as a percentage of sales, correlating positively with multinationality due to the need for internalized knowledge flows to maintain competitive edges in oligopolistic sectors.[17] Their framework thus anticipated that MNEs would concentrate in knowledge-intensive fields like pharmaceuticals and electronics, where external markets fail to capture the full value of proprietary technologies, contrasting with low-tech industries favoring arm's-length trade.[18] This formulation laid the groundwork for viewing the MNE as a governance structure prioritizing internal hierarchies for efficiency in cross-border intermediate exchanges.[15]
Evolution Through the 1980s and 1990s
In 1981, Alan Rugman advanced internalization theory through his book Inside the Multinationals: The Economics of Internal Markets, building on Buckley and Casson's foundational model by stressing the centrality of knowledge-based firm-specific advantages in fostering hierarchical control within multinational enterprises (MNEs).[19] Rugman posited that MNEs internalize transactions to preserve the value of intangible assets, such as proprietary technology and managerial expertise, which are prone to leakage in external markets like licensing. This extension emphasized how internal markets enable efficient coordination across borders, reducing opportunism and information asymmetries that plague intermediate product exchanges.[20]Concurrently, John H. Dunning incorporated internalization as the "I" component in his 1980 eclectic paradigm (OLI framework), arguing that FDI arises when firms leverage ownership-specific advantages (e.g., proprietary capabilities), location-bound opportunities, and gains from internalizing cross-border intermediate markets over alternatives like exporting or arm's-length contracts.[21] Dunning's synthesis positioned internalization not as a standalone explanation but as complementary to ownership and location factors, enabling predictions about the extent and pattern of international production; for instance, empirical tests in his work showed that internalization advantages correlate with higher FDI propensity in knowledge-intensive industries.[22]Jean-François Hennart's 1982 A Theory of the Multinational Enterprise further matured the theory by delineating how MNEs achieve superior efficiency through vertical integration of intermediate goods markets, where external contracting fails due to hold-up risks and specificity of assets.[23] Hennart distinguished pure internalization (e.g., wholly owned subsidiaries) from hybrid forms, providing a nuanced view of governance choices. Initial empirical validations in the 1980s included Rugman's tests linking R&D expenditures—a proxy for knowledge assets—to multinationality, revealing that firms with higher R&D intensity exhibit greater internalization via FDI over licensing.[24]By the 1990s, research increasingly addressed dynamic internalization processes, where persistent or emerging market imperfections—such as evolving intellectual property enforcement or technological spillovers—prompt sequential FDI waves, as MNEs progressively internalize additional value-chain stages to safeguard advantages.[10] This literature highlighted causal mechanisms like feedback loops between initial internalization successes and subsequent investments, explaining patterns in industry-specific FDI surges during periods of rapid innovation.[25] Such developments underscored internalization theory's robustness in accounting for temporal shifts in MNE boundaries without relying on static assumptions.[11]
Core Concepts and Mechanisms
Imperfections in Intermediate Product Markets
Internalization theory identifies imperfections in markets for intermediate products as the core driver compelling firms to internalize transactions, particularly across borders, to bypass inefficient external exchanges. Intermediate products consist of flows linking production stages, encompassing both tangible components like semi-processed inputs and intangible elements such as proprietary knowledge and managerial coordination services.[3][26] These markets fail to function effectively due to inherent structural flaws that elevate transaction costs beyond those achievable through internal hierarchies.Knowledge flows represent a primary type of intermediate product prone to market failure, as tacit, firm-specific information resists precise valuation and contractual specification. Asymmetric information pervades these exchanges, with originators holding superior insight into the knowledge's scope, durability, and potential applications, while recipients face incentives to underbid or misrepresent their capabilities.[26]Opportunism further compounds inefficiencies, as external partners may exploit incomplete contracts through hold-up tactics—such as post-transfer renegotiation, unauthorized dissemination, or suboptimal utilization—thereby diluting returns and inducing underinvestment in research and development when licensing or spot markets are pursued.[3][4]Managerial coordination services, involving oversight and integration across activities, encounter analogous imperfections, including unreliable enforcement of service-level agreements and risks of shirking in decentralized arrangements. External markets thus exhibit thinness or outright absence for these intermediates, as bounded rationality limits comprehensive contracting amid uncertainty and self-interest.[26]Internalization addresses these by substituting market prices with hierarchical authority, which curtails agency costs through monitoring, incentive alignment, and residual control rights, enabling sustained flows without dissipation or expropriation.[3][4]
Internalization as a Response to Knowledge Flows
Internalization theory posits that firms generate competitive advantages through investments in research and development, producing firm-specific knowledge that is often tacit in nature, meaning it is difficult to codify and transfer via contracts without significant risk of dissipation or unauthorized appropriation.[3] This knowledge, encompassing skills, processes, and heuristics not fully captured in patents or explicit instructions, requires direct oversight to ensure effective cross-border application, as external markets for such intermediates suffer from high transaction costs due to information asymmetries and moral hazard.[27] Firms thus prefer internalization—establishing wholly owned subsidiaries via foreign direct investment (FDI)—to retain hierarchical control, enabling the supervised replication of tacit elements that licensing or arm's-length arrangements cannot reliably safeguard.[3]The causal mechanism begins with R&D yielding proprietary advantages, but externalization exposes these to piracy, where licensees may underinvest in complementary efforts or divert knowledge to competitors, eroding returns; alternatively, suboptimal use arises from the licensee's inability to fully absorb or apply the tacit components without ongoing firm involvement.[28] Internalization mitigates this by substituting market governance with internal organization, where equity ownership facilitates monitoring, incentive alignment, and iterative knowledge flows through personnel transfers and integrated operations, preserving the advantage's value across borders.[11] This approach is particularly pronounced in knowledge-intensive sectors, where the public good characteristics of knowledge amplify leakage risks in fragmented external transactions.[29]In the pharmaceutical industry, for instance, firms internalize production and distribution of complex drugs where tacit knowledge in formulation scaling and quality control cannot be adequately conveyed through licensing, as evidenced by preferences for FDI over technology transfer agreements that risk diluting proprietary processes amid weak intellectual property enforcement abroad.[30] Similarly, in technology sectors like software and electronics, internalization via subsidiaries protects tacit design interfaces and integration know-how from misappropriation, avoiding the inefficiencies of licensing where recipients fail to replicate full functionality without embedded firm expertise.[31] These patterns underscore how internalization sustains returns on intangible assets by countering the inherent vulnerabilities of knowledge markets.[32]
Comparison with Exporting and Licensing Alternatives
Internalization theory posits that exporting final products is viable for standardized goods with low transport costs relative to value, as it avoids the need for foreign production facilities while leveraging economies of scale in the home market. However, exporting becomes inefficient for firm-specific advantages involving tacit knowledge or customized intermediates, where high transportation costs, trade barriers, and the difficulty of adapting products to local conditions erode competitive rents. In such cases, local production via FDI enables better integration of intermediate markets, reducing adaptation expenses and protecting knowledge flows that cannot be effectively "exported" without internalization.[28][33]Licensing, by contrast, facilitates rapid market entry with minimal capital commitment from the licensor, relying on royalties to capture value from technology transfer. Yet, this mode is undermined by agency problems, including licensee opportunism—such as underreporting sales or reverse-engineering to avoid ongoing payments—and the resulting dissipation of the licensor's monopoly rents as licensees compete against the originator. Buckley and Casson formalize this in their analysis of weak intellectual property regimes, where post-licensing imitation leads to rent erosion, making licensing's net present value inferior to FDI, particularly for knowledge-intensive assets. Empirical patterns in high-technology sectors reinforce this, with multinational enterprises favoring FDI to retain control and maximize returns over licensing's fragmented revenue streams.[11][34]The choice of FDI over these alternatives hinges on a threshold condition: internalization proceeds when the efficiency gains from internal governance—such as reduced transaction hazards and full rent appropriation—outweigh the higher setup and operational costs of foreign subsidiaries. This decision is efficiency-driven, prioritizing superior returns from protecting imperfect intermediate markets rather than motives of market power or dominance. Exporting and licensing serve as lower-commitment options in low-hazard environments or for non-proprietary advantages, but fail to deliver comparable value where market failures in knowledgeexchange are pronounced.[28][35]
Theoretical Foundations and Relations
Links to Transaction Cost Economics
Internalization theory and transaction cost economics (TCE) both originate from Ronald Coase's 1937 insight that firms emerge to economize on the costs of market transactions, favoring internal hierarchies when external markets incur high frictions.[36] In TCE, as formalized by Oliver Williamson in works such as Markets and Hierarchies (1975), firms choose governance structures to mitigate opportunism arising from asset specificity, uncertainty, and bounded rationality, preferring vertical integration or other hierarchical forms over arm's-length contracts when these factors elevate transaction costs relative to internal production costs.[37] Internalization theory adopts a parallel economizing logic, positing that multinational enterprises (MNEs) internalize cross-border activities to reduce costs associated with imperfect intermediate markets, particularly those involving intangible assets like knowledge.[3]A core synergy lies in their mutual emphasis on hierarchies as superior to markets under conditions of high asset specificity and uncertainty; for instance, both frameworks predict that specialized investments vulnerable to hold-up problems—such as those in R&D or customized production—prompt internalization to safeguard returns and ensure efficient coordination.[38] However, internalization theory distinguishes itself by focusing on international imperfections in markets for knowledge flows, where exporting or licensing alternatives falter due to difficulties in monitoring and appropriating proprietary technology across borders, thus driving foreign direct investment (FDI) to link R&D and production stages internally.[39] In contrast, TCE more broadly addresses governance between successive production stages, often in domestic or non-knowledge-intensive contexts, without the explicit international dimension or emphasis on intangible asset transfer that internalization prioritizes.[36]Empirical studies from the 1980s, including firm-level analyses of MNE entry modes, demonstrate overlap in predictive power, with bounded rationality and specificity assumptions from both theories explaining patterns of vertical integration in international operations; for example, data on U.S. multinationals showed that high uncertainty in host markets correlated with greater use of wholly owned subsidiaries over joint ventures, aligning TCE's governance predictions with internalization's FDI rationale.[37][40] These findings underscore how internalization extends TCE's domestic logic to global settings, where cross-border knowledge transactions amplify the need for hierarchical control.[41]
Integration with Ownership-Location-Internalization (OLI) Paradigm
The Ownership-Location-Internalization (OLI) paradigm posits that foreign direct investment (FDI) arises when firms possess three simultaneous advantages: ownership-specific (O) advantages, such as proprietary knowledge or managerial capabilities that confer competitiveness over indigenous firms; location-specific (L) advantages, including host-country attributes like resource endowments, infrastructure, or market access that favor production abroad; and internalization-specific (I) advantages, which render intra-firm transactions superior to arm's-length market exchanges for exploiting O advantages in L contexts.[42] These elements collectively determine not only the propensity for international production but also the preference for FDI over alternatives like exporting or licensing.[3]Internalization theory complements the OLI framework by furnishing the micro-foundations for I advantages, emphasizing that imperfections in external markets—particularly for intangible, knowledge-intensive assets—incur high transaction costs from risks like dissipation, moral hazard, or hold-up, thereby incentivizing firms to internalize control hierarchies.[43] Given O advantages motivating outward expansion and L factors selecting host sites, internalization theory rigorously explains the modal choice of FDI as the efficient governance structure for cross-border asset transfer, conditional on those prerequisites, rather than treating mode selection as residual or ad hoc.[3] This conditional logic preserves the distinct roles of O and L in initiating international commitment while grounding I in verifiable market failures, enhancing the paradigm's predictive coherence without subsumption.[44]Dunning's 1988 restatement of the eclectic paradigm explicitly incorporated Buckley and Casson's internalization theory as the basis for I, crediting it for elucidating why multinational enterprises favor integrated networks over fragmented contracts to safeguard and deploy O advantages amid L opportunities.[42][10] This synthesis underscores internalization's operational role in actualizing FDI, as internal markets minimize dissipative losses in knowledge flows—evident in empirical patterns where high R&D-intensive industries exhibit elevated FDI shares over licensing—while OLI's broader envelope accommodates diverse FDI motives without diluting the causal primacy of governanceefficiency in mode decisions.[18]
Firm Boundaries and Multinational Enterprise Structure
In internalization theory, firm boundaries are established at the point where the advantages of internalizing imperfect external markets for intermediate products—particularly proprietary knowledge and technology—exceed the costs of hierarchical coordination and bureaucratic inefficiencies. These advantages arise from avoiding transaction-specific risks such as opportunism, information asymmetry, and dissipation of intangible assets that plague arm's-length exchanges, while costs include monitoring, agency problems, and reduced flexibility in diversified operations. For multinational enterprises (MNEs), this delineation is critical due to heightened cross-border market failures, including weak intellectual property enforcement and cultural distances, which favor internal governance over exporting or licensing to define optimal scope.[1]Vertical integration decisions within MNEs prioritize internalization when upstream activities like research and development generate knowledge that must flow securely to downstream production without hold-up risks in external markets, ensuring innovations are exploited without leakage or misappropriation. Horizontal integration, conversely, internalizes the replication of firm-specific capabilities across host countries, bypassing licensing contracts that could erode control over proprietary processes amid varying local conditions. Both forms reflect a profit-maximizing calculus refined to shareholder value, where environmental contingencies such as market size and rivalry influence the tradeoff between integration benefits and the dis-economies of scale in sprawling internal structures.[45]MNEs manifest as networked hierarchies that internalize strategic resource flows—knowledge, capital, and managerial directives—across subsidiaries, with headquarters serving as a central intelligence to orchestrate global operations and mitigate sub-unit opportunism through equity stakes that align incentives with firm-wide goals. Equity control is essential for governing knowledge-intensive interactions, as partial ownership risks free-riding or withholding by subsidiaries, particularly in jurisdictions with lax contractenforcement, thereby preserving the integrity of internalized advantages. This structure inherently balances centralized oversight for core competencies against devolved autonomy in peripheral activities, embodying the theory's emphasis on adaptive governance to sustain competitive edges in diversified, border-spanning enterprises.[1]
Empirical Evidence and Validation
Key Empirical Studies and Datasets
One of the earliest empirical validations inspired by Buckley and Casson (1976) involved analyses of UK multinational enterprises using data from the 1980s, where higher R&D intensity—measured as R&D expenditures relative to sales—was associated with a greater propensity for FDI rather than licensing, as firms sought to protect proprietary knowledge from dissipation in external markets.[46] Similar patterns emerged in U.S. manufacturing data from the 1980 Census of Population, which linked elevated R&D intensity (proxied by the proportion of scientists and engineers in the workforce) in knowledge-intensive industries to higher FDI inflows, demonstrating the theory's explanatory power for internalizing intangible assets over exporting or contractual arrangements.[47]Cross-national evidence from the 1990s onward, drawing on UNCTAD's FDI datasets and sectoral breakdowns, confirmed internalization dynamics in knowledge-intensive sectors such as electronics, where FDI shares exceeded those in low-knowledge sectors by factors of 2-3 times, reflecting the causal role of imperfect markets for intermediate products like technology transfers.[48] These datasets, covering over 150 countries and spanning annual inflows from 1990 to 2000, highlighted that firms in high-R&D industries (e.g., semiconductors) favored wholly-owned subsidiaries to safeguard firm-specific advantages, with regression analyses showing R&D/sales ratios as significant predictors of FDI intensity.[49]Firm-level panel regressions have further tested the theory's predictive power, often using dynamic models to control for endogeneity via instruments such as lagged patent stocks, which proxy for accumulated knowledge assets. For instance, studies of emerging-market multinationals from 2000-2015 found that higher R&D intensity positively moderates the relationship between FDI expansion and performance gains from internalized orchestration of capabilities, with patent-intensive firms exhibiting 15-20% higher returns on foreign assets compared to low-patent peers.[50] These approaches, applied to datasets like Orbis and Compustat, reveal causal links wherein knowledge internalization drives FDI decisions, robust to controls for firm size, host-country factors, and time-varying shocks.[51]
Tests of Predictive Power in FDI Decisions
Firm-level empirical studies have validated the predictive power of internalization theory in distinguishing equity-based FDI from contractual alternatives, particularly when firms possess knowledge-intensive assets vulnerable to opportunistic behavior. Analysis of U.S. multinationals' foreign expansion modes demonstrated that gains from FDI exceed those from non-FDI options when internal markets reduce transaction costs associated with proprietarytechnology transfers, with firm-specific R&D intensity serving as a key predictor of internalization via wholly-owned subsidiaries.[52] Similarly, transaction cost proxies, including asset specificity and uncertainty in host markets, outperform capability-based explanations in predicting full-ownership entry modes over joint ventures or licensing in samples of European and U.S. firms entering developing economies during the 1990s.[53]In high-technology sectors, internalization theory accurately forecasts a preference for equity FDI to safeguard intangible assets, as evidenced by firm-level regressions showing that higher parent-subsidiary R&D expenditure ratios correlate with reduced reliance on external licensing and increased use of internal hierarchies for knowledge flows. Tests using data from manufacturing affiliates confirm that relative R&D investments explain variations in mode choice, with firms exhibiting above-average knowledge intensity internalizing operations in over two-thirds of cases to avoid dissipation risks.[54] This pattern holds across industries, where firm-specific advantages in technology prompt internalization even beyond high-tech domains, affirming the theory's efficiency-driven predictions.[55]Augmented gravity models of bilateral FDI further support comparative statics from internalization theory, incorporating variables for market opacity and institutional quality to show that greater informational asymmetries elevate FDI probabilities relative to arm's-length trade. In emerging markets, where weak intellectual propertyenforcement heightens external transaction hazards, knowledge-intensive multinationals disproportionately select wholly-owned equity modes, with empirical patterns indicating amplified internalization as IP regimes weaken, consistent with theory's emphasis on control for efficiency.[56] Firm surveys and affiliate data from transition and developing economies reinforce this, linking low enforcement environments to higher equity commitments by technology owners.[57]
Challenges in Measurement and Causality
Empirical tests of internalization theory frequently employ proxies such as research and development (R&D) intensity—measured as R&D expenditures relative to sales—or patent counts to approximate the value of firm-specific knowledge assets that motivate internalization over external transactions.[17] These metrics capture codified elements of intellectual property but systematically underrepresent tacit knowledge, which internalization theory posits as particularly vulnerable to opportunistic dissipation in markets due to its non-transferable nature, thereby introducing measurement error that attenuates estimated effects on foreign direct investment (FDI) decisions.[58]Establishing causality faces obstacles from unobserved firm heterogeneity, including idiosyncratic managerial capabilities or proprietary processes not reflected in standard datasets, which correlate with both knowledge asset accumulation and the propensity to internalize, leading to endogeneity bias in regressions linking asset specificity to FDI entry modes.[59] Standard FDI flow data exacerbate this by recording only parent-subsidiary transfers, omitting reinvested earnings or intra-firm reallocations that constitute much of sustained internalization activity.[59]Datasets drawn from sources like the Bureau van Dijk Orbis or national FDI registries exhibit selection bias, as they disproportionately include surviving multinational enterprises (MNEs), overrepresenting instances where internalization succeeded in generating superior performance while undercounting failures or aborted attempts, which inflates apparent predictive power. To mitigate this, researchers apply hazard rate models that incorporate entry and exit dynamics, estimating survival probabilities conditional on internalization choices to correct for truncation.Advancements in causal inference include instrumental variable (IV) strategies leveraging exogenous policy shocks, such as bilateral investment treaty ratifications or host-country regulatory liberalizations, which provide quasi-random variation in the costs of externalization without directly affecting firm-level knowledge assets.[60] For example, studies exploiting tax policy reforms as instruments have isolated internalization effects on FDI inflows, yielding more credible estimates of how knowledge intensity drives equity-based entry over arm's-length alternatives.[60]
Refinements and Extensions
Industry-Level Applications
Internalization theory, rooted in Coasean analysis of firm boundaries, extends beyond individual firms to explain aggregate industry dynamics, where competitive pressures favor firms that internalize intermediate markets to minimize transaction costs in knowledge-intensive sectors. At the industry level, this manifests in evolutionary processes where fragmented markets, characterized by high uncertainty in proprietary knowledge transfer, select for internalizing strategies over arm's-length licensing or exporting. Firms succeeding in internalization gain advantages in coordinating global value chains, leading to market consolidation as non-internalizers face higher costs or opportunistic risks, resulting in mergers, acquisitions, or exits. This dynamic aligns with historical patterns in industries like pharmaceuticals, where high sunk costs in R&D and trust deficits in licensing propelled waves of M&A activity, reducing the number of independent players and fostering oligopolistic structures.[61]Empirical evidence from sectoral FDI data supports this, showing correlations between internalization advantages and concentrated industry structures. In the pharmaceutical sector, for instance, internalization of intangible assets like patents and formulations correlated with defensive FDI and M&A surges, as firms bypassed licensing booms due to elevated transaction hazards from intellectual property leakage and asymmetric information; notable examples include the 2000 merger forming GlaxoSmithKline, which consolidated capabilities amid rising R&D costs exceeding $1 billion per drug by the early 2000s. Similarly, in automotive components, modular design innovations from the 1980s onward—coupled with tariff reductions post-1945—drove global production shifts via FDI, favoring internalizers who integrated supply chains to exploit economies of scale, leading to supplier consolidation from thousands of fragmented players to a handful of dominant multinationals by the 2010s. Buckley and Hashai (2016) outline an agenda for this extension, arguing that industry recipes—combinations of firm-specific and sector-wide internalization—explain such waves over sporadic licensing expansions.[61][62]These patterns underscore how internalization at the industry level predicts transitions from competitive fragmentation to oligopoly, particularly in sectors with immobile, knowledge-based assets, as global competition filters technologies based on profitability metrics rather than inherent superiority. Sectoral analyses reveal that FDI inflows concentrate in internalizing industries, with oligopolistic concentration ratios rising as MNEs preempt rivals through hierarchical control, evidenced by post-1990s data showing pharmaceuticals' Herfindahl-Hirschman Index climbing amid FDI dominance over licensing deals. This aggregation challenges purely firm-centric views, highlighting systemic selection where industry evolution reinforces internalization as a dominant governance mode.[61]
Adaptations for Services and Non-Knowledge Assets
Internalization theory has been adapted to service industries, where firms internalize foreign direct investment (FDI) to coordinate relational assets such as customer relationships and distribution networks, which are prone to dissipation through external markets. In banking, multinational enterprises (MNEs) prefer FDI over arm's-length contracts to safeguard proprietary relational knowledge embedded in client trust and risk assessment processes, thereby mitigating opportunism and moral hazard in cross-border operations.[3] Similarly, in franchising, while franchising serves as a quasi-internalization mode for peripheral activities, core brand protection and operational coordination often necessitate FDI to internalize high transaction costs associated with monitoring franchisee compliance and preserving system-wide reputation.[3] These adaptations extend the theory beyond pure knowledge intangibles to emphasize governance of network effects and relational contracting in services.[63]For non-knowledge assets, internalization theory incorporates transaction cost logic to explain vertical integration in extractive industries, where firms counter hold-up risks from asset-specific investments in natural resources. In sectors like mining and oil, MNEs internalize upstream and downstream stages to secure supply chains against supplier opportunism, as external contracting exposes firms to renegotiation hazards amid site-specific infrastructure commitments.[64] This rationale aligns with the theory's core prediction that internalization advantages arise when market failures, such as incomplete contracts over physical assets, exceed hierarchical costs.Post-2000 empirical studies validate these adaptations, demonstrating internalization theory's explanatory power for service FDI, which constituted approximately 60% of global FDI inflows by the mid-2010s. Analyses of datasets from UNCTAD and OECD reveal that service MNEs, including those in finance and distribution, select FDI modes consistent with internalization predictions, prioritizing equity control to manage relational and branding assets over licensing amid imperfect intermediate markets.[65] These findings underscore the theory's robustness across asset types, though measurement challenges persist in distinguishing relational from tangible asset governance.[3]
Incorporation of Dynamic Capabilities
Internalization theory has been extended through integration with dynamic capabilities, a concept rooted in evolutionary economics that emphasizes firms' abilities to sense opportunities, seize them via investment decisions, and reconfigure assets in response to environmental changes. David Teece's 2014 framework posits that multinational enterprises (MNEs) leverage internalization to safeguard and cultivate these capabilities, particularly in turbulent global markets where external markets may fail to protect proprietary knowledge or routines effectively.[66] By choosing hierarchical governance over arm's-length transactions, firms preserve the tacit routines essential for ongoing adaptation, aligning with causal mechanisms where internalized operations foster cumulative learning and competency evolution rather than static efficiency gains.[67]This incorporation views repeated internalization decisions as iterative processes that build location-bound advantages, enabling MNEs to adapt firm-specific competencies to heterogeneous international contexts. For instance, subsidiaries established through foreign direct investment (FDI) allow firms to refine sensing capabilities by embedding local knowledge acquisition routines, which in turn enhance seizing through targeted resource reconfiguration. Empirical extensions pre-2020, such as those linking dynamic capabilities to MNE growth, demonstrate that internalization cycles causally strengthen these adaptive processes, as firms internalize knowledge flows to mitigate opportunism risks while amplifying evolutionary fitness in diverse locations.[68] Such dynamics contrast with purely transactional views, highlighting how internalization accumulates competencies over time, with evidence from MNE case studies showing improved reconfiguration speed post-internalization.[66]Scholars have further linked this to hybrid governance forms, treating alliance portfolios as quasi-internalization mechanisms that approximate dynamic capability development without full ownership. In volatile environments, alliances enable partial internalization of external technologies and routines, serving as a bridge to build sensing and seizing competencies while testing reconfiguration viability. Pre-2020 research on alliance activity underscores its role as a dynamic capability proxy, where competent firms use portfolios to internalize technological spillovers selectively, enhancing overall adaptive resilience in line with evolutionary principles.[69] This quasi-internalization approach causally supports competency building by allowing firms to experiment with location-specific advantages iteratively, though it requires strong orchestration capabilities to avoid governance hazards inherent in partial hierarchies.[70]
Controversies and Criticisms
Assumptions About Transaction Cost Knowledge
Internalization theory rests on the premise that firms possess adequate knowledge to foresee and evaluate the differential transaction costs of internalizing intermediate product markets—such as proprietary knowledge—through foreign direct investment (FDI) rather than external modes like licensing or arm's-length trade. This assumption implies that managers can rationally discern when market imperfections, including opportunism and asymmetric information, render external transactions inefficient, prompting hierarchical governance to safeguard firm-specific advantages.[3]A key criticism draws from bounded rationality, positing that cognitive constraints limit firms' ability to comprehensively anticipate future contingencies, leading to incomplete cost assessments and potential overvaluation of internalization benefits. Under this view, decision-makers may underestimate internal coordination costs or overestimate control gains due to heuristic biases and information overload, as highlighted in behavioral extensions of transaction cost frameworks. Such limitations suggest that FDI choices often reflect satisficing rather than optimizing behavior, risking persistent errors in governance alignment..pdf)[71]Proponents defend the assumption by emphasizing the theory's empirical track record in forecasting FDI prevalence in knowledge-intensive sectors, where repeated market exposures foster experiential learning sufficient for effective cost comparisons. Studies confirm that licensing arrangements frequently underperform in preserving intangible assets, with dissipation risks validating internalization even under bounded foresight, as firms adapt through iterative decision-making in oligopolistic settings.[3]Evidence of failed FDI, including elevated divestment rates in the 1990s—where foreign-controlled entities exhibited failure proportions exceeding 70% within five years in certain U.S. contexts—underscores gaps in predictive accuracy, attributable to unforeseen environmental shifts or misjudged asset specificity. These outcomes indicate bounded knowledge imposes real constraints but do not undermine the theory, as surviving internalizations often yield superior long-term value, aligning with probabilistic rather than deterministic foresight.[72]
Limitations in Explaining Non-FDI Internationalization
Internalization theory posits that firms prefer foreign direct investment (FDI) to externalize intermediate markets like exporting or licensing when imperfections—such as opportunism risks in knowledge transactions—make arm's-length exchanges inefficient, yet it offers less explanatory power for scenarios where firms pursue internationalization via lower-commitment non-FDI modes, including direct exports, contractual alliances, or born-global exporting strategies without hierarchical control.[63] This gap arises particularly in low-imperfection markets or high psychic distance contexts, where transaction costs do not strongly favor full internalization, leading firms to favor exports to test markets incrementally rather than committing via FDI.[56] Critics note that the theory's focus on ownership-specific advantages driving hierarchical governance undervalues alternative modes that mitigate risks without equity stakes, such as relational contracting in alliances.[73]Empirical studies underscore this limitation, revealing that non-FDI modes constitute a significant share of internationalization activity, challenging the theory's universality in predicting FDI dominance. For example, analysis of European firms shows that only 13.3% of internationalized entities primarily serve foreign markets through FDI, while indirect exporting— a non-internalized mode—accounts for 5.5%, with direct exports and other low-commitment approaches comprising the majority of initial foreign engagement.[74]Productivity data further supports prevalence of exports: direct exporters exhibit about 40% higher productivity than non-internationalized firms, yet many do not progress to FDI, indicating non-FDI paths suffice for value capture in imperfect but navigable markets.[75]While some extensions view hybrids like joint ventures as partial internalization—retaining control over key assets without full ownership—these do not fully resolve the theory's underemphasis on pure non-FDI choices, often requiring integration with complementary frameworks such as the Uppsala model, which attributes exporting to psychic distance barriers that delay commitment escalation.[63][76] This complementarity highlights internalization theory's strength in ownership decisions but weakness in sequencing low-commitment entries, as evidenced by born-global firms that export knowledge-intensive products globally from inception without prior FDI.[77]
Debates on Applicability in Digital and Networked Economies
Critics argue that the rise of digital platforms and technologies such as APIs and cloud computing has diminished traditional market imperfections in knowledge transfer, thereby undermining the rationale for internalization through FDI in favor of externalization and platformorchestration.[78] For instance, in the platform economy, firms like social media companies leverage network effects and modular interfaces to coordinate value creation without hierarchical control, reviving debates on whether internalization theory adequately explains reduced transaction costs in digital contexts.[32] A 2023 study posits that digitalization variably erodes these imperfections across industries, potentially favoring non-equity modes of internationalization over FDI for tech MNEs.[79]Proponents counter that core assets in digital economies, including tacit knowledge embedded in AI algorithms and proprietary data sets, remain difficult to transfer externally without opportunistic risks, sustaining the incentives for internalization.[32]Empirical evidence supports this, as born-digital firms increasingly pursue FDI to access complementary local resources and mitigate scalability limits in culturally distant markets, with a positive association between geographic distance and FDI incidence observed in datasets from 2010–2020.[80] Global FDI inflows into digital-intensive sectors, such as information and communication technology, rose by 12% annually from 2015 to 2022, contradicting claims of theoretical obsolescence.Extensions incorporating social network theory position these frameworks as complementary to internalization, rather than replacements, by emphasizing how relational ties enhance knowledge governance in networked settings without supplanting firm boundaries.[81] A 2022 review of 210 studies from 2010–2022 highlights that social networks facilitate rapid internationalization for digital firms but interact with internalization motives, such as protecting intangible assets, in hybrid models.[81] Overall, while digital trends challenge static applications, internalization theory adapts by integrating dynamic elements like platform externalities, maintaining its relevance for explaining FDI persistence amid technological disruption.[78]
Applications and Implications
Strategic Decision-Making in MNEs
Internalization theory informs multinational enterprises' (MNEs) entry mode decisions by emphasizing the internalization of markets for firm-specific intermediate products, particularly knowledge-intensive assets, when external transaction costs—such as opportunism, imitation risks, or thin markets—exceed the costs of hierarchical governance. Firms opt for foreign direct investment (FDI) over licensing or exporting when proprietary advantages are core to competitive positioning and external markets fail to price risks adequately, as in cases of tacit knowledge difficult to contract or enforce. Conversely, divestiture or arm's-length arrangements suit non-core assets where marketefficiency prevails, preserving capital for high-value internalization. This mode selection framework, rooted in comparing internalization benefits against alternatives, drives MNEs toward configurations that minimize total costs across global operations.[13]FDI yields advantages in asset protection and value chain coordination but incurs drawbacks like operational rigidity and exposure to host-country uncertainties, contrasting with licensing's lower commitment yet higher leakage risks. In the upstream oil sector, major firms internalized exploration and production to circumvent market imperfections in crude supply contracts, evidenced by integrated refineries maintaining 81.6% utilization rates in 1950 compared to 54.3% for non-integrated peers, reflecting reduced variability from secured, specific investments such as pipelines. Standard Oil of Indiana, for example, pursued backward integration from 1919 onward after 1918 shortages limited refinery input to 18,000 barrels per day against a 55,000-barrel need, achieving 40% self-sufficiency by 1925 through acquisitions like Midwest Refining, which lowered dependency on volatile spot markets and opportunistic suppliers. These cases demonstrate verifiable efficiency from internalization, prioritizing causal reductions in hold-up and coordination costs over external dependencies.[82]Decision-making under internalization theory exhibits skepticism toward claims of broad managerial discretion, instead demanding evidence of quantifiable savings, such as avoided licensing fees in weak intellectual property regimes or pre-emptive control against rivals. MNEs thus calibrate wholly-owned subsidiaries for high-internalization scenarios, like technology transfers requiring tacit skills, where empirical patterns show preferences for full ownership to internalize complementary assets amid technological opportunities. This efficiency-oriented lens subordinates rent-seeking motives to first-principles cost comparisons, ensuring strategic choices align with sustainable competitive edges rather than unverified expansions.[13][83]
Policy Considerations for Host and Home Governments
Host governments shape multinational enterprise (MNE) entry decisions under internalization theory by affecting transaction costs associated with knowledge governance. Weak intellectual property (IP) protection incentivizes FDI over arm's-length licensing, as firms internalize proprietary assets to mitigate dissipation risks in external markets; empirical analyses confirm that strengthening IP regimes shifts technology transfer toward licensing while reducing FDI reliance on internalization.[84][85] Conversely, regulatory mandates such as forced joint ventures (JVs) with local partners elevate agency costs and expropriation hazards, deterring greenfield FDI; evidence from cross-country studies shows forced JVs correlate with lower efficiency and reduced investment volumes compared to wholly-owned operations.[86][87]Home governments influence outbound MNE strategies through fiscal policies that can distort optimal internalization. Tax incentives favoring licensing exports over FDI subsidiaries may encourage premature externalization of firm-specific advantages, heightening knowledge leakage without enhancing national welfare; theoretical models grounded in internalization predict that such distortions lead to suboptimal global allocation of production.[88] Empirical work on profit taxation and entry modes indicates that neutral or coordinated tax systems better support FDI-driven internalization, avoiding protectionist biases that prioritize short-term licensing revenues over long-term technological control.[89]Proponents of interventionist policies, including technology transfer mandates, argue they facilitate host-country catch-up by compelling spillovers, yet internalization theory highlights disincentives for innovation when proprietary assets face coerced diffusion. In China's pre-2020 framework, JV requirements and implicit transfer demands yielded limited advanced technology inflows, as MNEs scaled back R&D commitments to protect core competencies; post-reform easing of mandates correlated with revived FDI and larger innovation scopes, underscoring how such policies erode originator incentives and overall efficiency.[90][91] Internalization theory thus advocates for host and home policies minimizing distortions—such as robust yet non-expropriatory IP enforcement and mode-neutral taxation—to leverage market mechanisms for efficient cross-border knowledge deployment.
Influence on Broader International Business Research
Internalization theory has profoundly shaped international business (IB) scholarship by providing a parsimonious explanation for the existence and boundaries of multinational enterprises (MNEs), emphasizing the internalization of imperfect external markets for knowledge and intermediate goods to minimize transaction costs. Recent retrospectives affirm its enduring relevance, positioning it as a general theory of the MNE that integrates economic principles with strategic imperatives, thereby bridging microeconomic analysis and firm-level strategy in IB research.[3][92] For instance, extensions in the early 2020s have integrated internalization with global value chain (GVC) governance, highlighting how MNEs optimize location, internalization degrees, and relational management beyond traditional models, particularly in fragmented production networks.[93][94]The theory's robustness is evident in its predictive power regarding organizational forms and entry modes, with empirical studies validating its capacity to forecast MNE configurations across diverse contexts, from emerging markets to service industries.[95] Adaptations for the digital economy further demonstrate its adaptability, as digital service MNEs leverage internalization to manage data and platform-specific advantages, where low marginal costs reinforce hierarchical governance over arm's-length transactions despite reduced geographic barriers.[32] Integrations with family firm dynamics have also extended its scope, showing how socioemotional wealth considerations influence internalization choices in GVCs, diverging from profit-maximizing assumptions in conventional models.[96] These developments underscore achievements in explaining coopetition, diffused innovation, and virtual teams in modern MNEs.[97]Despite these advances, gaps persist, particularly in linking internalization to sustainability imperatives, where the theory's focus on efficiency gains from market imperfections has underexplored how MNEs internalize environmental externalities or align transaction cost minimization with long-term ecological resilience.[98] While robust in predicting economic behaviors, further theoretical refinements are needed to incorporate non-market pressures like regulatory sustainability mandates, ensuring broader applicability amid evolving global challenges.[3]