{"id":545656,"date":"2025-11-03T06:01:28","date_gmt":"2025-11-03T06:01:28","guid":{"rendered":"https:\/\/www.europesays.com\/uk\/545656\/"},"modified":"2025-11-03T06:01:28","modified_gmt":"2025-11-03T06:01:28","slug":"fractal-clusters-and-urban-scaling-shape-spatial-inequality-in-u-s-patenting","status":"publish","type":"post","link":"https:\/\/www.europesays.com\/uk\/545656\/","title":{"rendered":"Fractal clusters and urban scaling shape spatial inequality in U.S. patenting"},"content":{"rendered":"<p>Patent database<\/p>\n<p>A comprehensive database was developed to analyze the spatial and temporal distribution of patent activity within a subset of contemporary U.S. patents spanning from 1905 to 2024. We identified eight high-impact innovation sectors, i.e., plug-in cars, fracking, end-user multimedia applications, video games, chip architectures, machine learning, biotechnology enzymes, and mobile devices-that play pivotal roles in driving technological progress and economic growth within the United States. The United States Patent and Trademark Office (USPTO) maintains a high-quality data source called PatentView<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 39\" title=\"USPTO. PatentsView. &#010;                  https:\/\/www.patentsview.org\/download\/&#010;                  &#010;                 (2019).\" href=\"http:\/\/www.nature.com\/articles\/s44260-025-00054-y#ref-CR39\" id=\"ref-link-section-d455046205e1580\" target=\"_blank\" rel=\"noopener\">39<\/a>, which we have used to compile the database. Older patents (prior to 1976) were retrieved by manually downloading records from the USPTO\u2019s public database (June 2024). This effort produced a unified, georeferenced patent dataset capturing contemporary technological trends across key innovative U.S. sectors, providing a valuable basis for both quantitative and spatial analyses of patent productivity over the past six decades. The resultant patent database comprises 139.387 USPTO patents.<\/p>\n<p>For each patent, we collected detailed information, including the title, brief description, country of origin, assignee, names of inventors<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 40\" title=\"Monath, N., McCallum, A., Wick, M., Sullivan, J. &amp; Kobren, A. Discriminative hierarchical coreference for inventor disambiguation. In PatentsView Inventor Disambiguation Technical Workshop, U.S. Patent and Trademark Office (USPTO) (Alexandria, Virginia, 2015).\" href=\"http:\/\/www.nature.com\/articles\/s44260-025-00054-y#ref-CR40\" id=\"ref-link-section-d455046205e1587\" target=\"_blank\" rel=\"noopener\">40<\/a>, and geographic coordinates (latitude and longitude) of the first inventor\u2019s location. To categorize patents by technology type, we first identified Cooperative Patent Classification (CPC) codes that correspond to each sector. CPC codes serve as a standardized method of classifying inventions based on their technical characteristics and applications. We assigned patents to technology sectors by searching the entire patent database for these CPC codes, which represent high-value innovation sectors. Each patent was categorized into a sector if it contained at least one relevant CPC code in its documentation. In the following, we describe each technological sector, its CPC code criteria, and its relevance in the U.S. innovation landscape.<\/p>\n<p>The End-User Applications (24.165 patents with CPC code H04N 21\/47) sector comprises patents for end-user multimedia applications characterized by their interactivity, service structure, and diverse functionalities. Relevant patents cover a wide range of services, from local multimedia applications to high-bandwidth uplink technologies, showcasing the growing influence of digital media in consumer technology.<\/p>\n<p>The Plug-In Electric Cars (9.320 patents with CPC code: Y02T 90\/14) sector encompasses patents related to plug-in electric vehicle technologies, which contribute to the development of sustainable transportation. These patents reflect innovation within the electric vehicle industry.<\/p>\n<p>The Fracking (4.609 patents with CPC code: E21B 43\/16) sector covers patents for enhanced hydrocarbon recovery methods, specifically hydraulic fracturing, used to mobilize hydrocarbons in reservoirs. Innovations in fracking technologies have transformed U.S. energy production and reshaped the nation\u2019s role in global energy markets.<\/p>\n<p>The Machine Learning (21.462 patents with CPC code: G06N 20\/00) sector includes patents related to machine learning technologies, which equip machines with adaptive capabilities based on experiential data. These patents range from algorithms to physical machine implementations and reflect the transformative impact of artificial intelligence on diverse industries.<\/p>\n<p>The Video Games (20.490 patents with CPC code: A63F 13\/00) sector comprises patents encompassing a wide range of video game technologies, including hardware accessories, software innovations such as virtual camera animation, network-based game features, and cutting-edge game content generation, shaping the landscape of the video game industry and driving technological advancements. The dataset captures the technological complexity of modern electronic entertainment.<\/p>\n<p>The Biotech Enzymes (51.853 patents with CPC code: C12Q) sector includes patents for processes involving enzymes, nucleic acids, and microorganisms, with applications in diagnostics and bioengineering. These patents reflect advances in healthcare, genomics, and environmental biotechnology, representing areas of life sciences innovation.<\/p>\n<p>The Mobile Devices (2.108 patents with CPC code: H04M 1\/0202) sector captures patents for portable telephones, including mobile phones and cordless handsets, specifically focusing on structural features of mobile devices. The growth of this sector illustrates the evolution of mobile communication and connectivity technologies.<\/p>\n<p>The Chip Architecture (5.380 patents with CPC code: G06T 1\/20) sector includes patents for graphics processors, including GPUs, pipelines, and architectures for image data processing. Patents in this sector are central to advancements in computer graphics, parallel processing, and the increasing performance demands of visual data applications.<\/p>\n<p>Database reconstruction pipeline<\/p>\n<p>The study recovers the patent database using a customized pipeline that combines many data sources and methods to deliver a complete georeferenced patent dataset. This reconstruction was designed to enable robust data collection, systematic processing, and integration of patent metadata (see Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s44260-025-00054-y#Fig7\" target=\"_blank\" rel=\"noopener\">7<\/a>a). First, by consolidating data from multiple sources, it ensures comprehensive coverage and minimizes biases inherent in individual datasets. Second, separating records based on temporal and classification differences allows for better handling of data heterogeneity, such as the pre-1976 and post-1976 patent formats. Lastly, the inclusion of georeferenced inventor locations supports the spatial multiscale analyses that are central to this study.<\/p>\n<p><b id=\"Fig7\" class=\"c-article-section__figure-caption\" data-test=\"figure-caption-text\">Fig. 7: Construction of the patent database.<\/b><a class=\"c-article-section__figure-link\" data-test=\"img-link\" data-track=\"click\" data-track-label=\"image\" data-track-action=\"view figure\" href=\"https:\/\/www.nature.com\/articles\/s44260-025-00054-y\/figures\/7\" rel=\"nofollow noopener\" target=\"_blank\"><img decoding=\"async\" aria-describedby=\"Fig7\" src=\"https:\/\/www.europesays.com\/uk\/wp-content\/uploads\/2025\/11\/44260_2025_54_Fig7_HTML.png\" alt=\"figure 7\" loading=\"lazy\" width=\"685\" height=\"271\"\/><\/a><\/p>\n<p><b>a<\/b> This diagram summarizes the multi-stage reconstruction process used to build a spatially and sectorally resolved dataset of U.S. patents. The pipeline processes more than 4 million patents and extracts approximately 140,000 records across 8 technological domains. Two distinct workflows are implemented: one for pre-1976 patents (left side), which rely on scanned image records and OCR-based extraction, and another for post-1976 patents (right side), based on structured digital records. Both workflows include sector-specific keyword filtering, geolocation of assignees and inventors, and deduplication procedures (see \u201cMethods\u201d). <b>b<\/b> Annual counts of granted patents in the United States (1905\u20132024) are plotted on a logarithmic scale to emphasize long-term growth dynamics and visualize year-to-year variation across multiple orders of magnitude. The red dashed line indicates the year 1972, marking the threshold after which 99% of the total cumulative patents in the dataset were issued. The plot highlights the sharp rise in patenting activity over recent decades, which accounts for the majority of the dataset and motivates the focus on post-1970 dynamics in several analyses.<\/p>\n<p>The process begins with data collection from three main platforms: (1) Patent View, (2) Dataverse, and (3) <a href=\"http:\/\/patents.google.com\" target=\"_blank\" rel=\"noopener\">patents.google.com<\/a>. Each of these sources contributes distinct datasets that complement one another. For instance, Google Patents provides raw patent records divided into pre-1976 and post-1976 periods to account for differences in archival formats. Patent View adds enriched metadata fields, while Dataverse contributes supplementary datasets such as classifications and regional identifiers. Semi-automated Python scripts were used to interact with these platforms, querying and downloading records based on patent identifiers.<\/p>\n<p>Once the data is retrieved, parsing routines extract relevant information from the HTML files obtained from <a href=\"http:\/\/patents.google.com\" target=\"_blank\" rel=\"noopener\">patents.google.com<\/a>. This structured data includes fields such as patent titles, inventor locations, and classification codes, which are consolidated into an intermediate database labeled db1.<\/p>\n<p>In the next stage, the database is refined by integrating additional fields from PatentView and Dataverse, such as CPC subgroup classifications (see above), geospatial coordinates, and temporal data. This step enhances data granularity and completeness, resulting in an updated database, db2, that contains both the parsed information and the enriched metadata.<\/p>\n<p>Finally, the fully processed dataset is exported into a standardized CSV file, labeled full.csv, for use in subsequent analyses. This final export contains all fields necessary for the study, ensuring compatibility with statistical and geospatial analysis tools. The CSV format facilitates its use in a wide range of applications, from hierarchical clustering and spatio-temporal pattern analysis to inequality measurements.<\/p>\n<p>Our analysis spans the time interval from 1905 to 2024; however, the bulk of the patent publications are concentrated in the last 52 years. As shown in Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s44260-025-00054-y#Fig7\" target=\"_blank\" rel=\"noopener\">7<\/a>b, the cumulative number of patents surpasses 99% of the total dataset starting from the year 1972.<\/p>\n<p>Fitting Zipf\u2019s law<\/p>\n<p>To investigate the heterogeneous distribution of patent activity across U.S. innovation sectors, we use Zipf\u2019s law, a principle commonly used to analyze ranked distributions. Zipf\u2019s law describes a situation where the frequency f(r) of an item is proportional to its rank r, expressed as<\/p>\n<p>$$f(r) \\sim {r}^{-\\gamma },$$<\/p>\n<p>\n                    (2)\n                <\/p>\n<p>where \u03b3 is the exponent characterizing the distribution. By plotting patent counts against rank on a log-log scale, we can assess whether patenting activity follows Zipf\u2019s law across different geographic locations. For each sector, we ranked locations by patent count and fitted a power-law model using maximum likelihood estimation from the Python Powerlaw package version 1.5<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 41\" title=\"Alstott, J., Bullmore, E. &amp; Plenz, D. powerlaw: a python package for analysis of heavy-tailed distributions. PLoS ONE 9, e85777 (2014).\" href=\"http:\/\/www.nature.com\/articles\/s44260-025-00054-y#ref-CR41\" id=\"ref-link-section-d455046205e1771\" target=\"_blank\" rel=\"noopener\">41<\/a> (based on the work of Clauset et al.<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 42\" title=\"Clauset, A., Shalizi, C. R. &amp; Newman, M. E. Power-law distributions in empirical data. SIAM Rev. 51, 661&#x2013;703 (2009).\" href=\"http:\/\/www.nature.com\/articles\/s44260-025-00054-y#ref-CR42\" id=\"ref-link-section-d455046205e1775\" target=\"_blank\" rel=\"noopener\">42<\/a>), to evaluate if the observed rank-frequency distribution adheres to Zipf\u2019s law or whether it can be best described by other distributions.<\/p>\n<p>Analysis of size-conditional productivity<\/p>\n<p>Temporal analysis of patenting activity reveals a two-phase pattern in the geographic diffusion of innovation. The initial phase is characterized by rapid spatial expansion, with each new patent frequently introducing a previously untapped location into the innovation network. This \u201cboom\u201d phase spans the emergence of the first 40\u201380 patents, regardless of technological domain (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s44260-025-00054-y#Fig2\" target=\"_blank\" rel=\"noopener\">2<\/a>b), and is typically concentrated in established innovation hubs such as Silicon Valley (machine learning, software<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 43\" title=\"L&#xE9;cuyer, C. Making Silicon Valley: Innovation and the Growth of High Tech, 1930-1970 (MIT Press, 2006).\" href=\"http:\/\/www.nature.com\/articles\/s44260-025-00054-y#ref-CR43\" id=\"ref-link-section-d455046205e1790\" target=\"_blank\" rel=\"noopener\">43<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 44\" title=\"Engel, J. S. Global clusters of innovation: lessons from Silicon Valley. Calif. Manag. Rev. 57, 36&#x2013;65 (2015).\" href=\"http:\/\/www.nature.com\/articles\/s44260-025-00054-y#ref-CR44\" id=\"ref-link-section-d455046205e1793\" target=\"_blank\" rel=\"noopener\">44<\/a>), Houston (fracking<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 45\" title=\"Gold, R. The boom: How Fracking Ignited the American Energy Revolution and Changed the World (Simon and Schuster, 2014).\" href=\"http:\/\/www.nature.com\/articles\/s44260-025-00054-y#ref-CR45\" id=\"ref-link-section-d455046205e1797\" target=\"_blank\" rel=\"noopener\">45<\/a>), and Boston (biotechnology<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 46\" title=\"Powell, W. W., Koput, K. W., Bowie, J. I. &amp; Smith-Doerr, L. The spatial clustering of science and capital: accounting for biotech firm-venture capital relationships. Reg. Stud. 36, 291&#x2013;305 (2002).\" href=\"http:\/\/www.nature.com\/articles\/s44260-025-00054-y#ref-CR46\" id=\"ref-link-section-d455046205e1801\" target=\"_blank\" rel=\"noopener\">46<\/a>).<\/p>\n<p>As patenting activity increases, the number of new entrants gradually decreases. Secondary hubs, such as Austin (electric vehicles<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 47\" title=\"Klier, T. &amp; Rubenstein, J. M. The emerging geography of electric vehicle production in North America: revolution or evolution? Econ. Perspect., 4, &#010;                  https:\/\/doi.org\/10.21033\/ep-2024-4&#010;                  &#010;                 (2024).\" href=\"http:\/\/www.nature.com\/articles\/s44260-025-00054-y#ref-CR47\" id=\"ref-link-section-d455046205e1808\" target=\"_blank\" rel=\"noopener\">47<\/a>, gaming<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 48\" title=\"Towse, R. &amp; Handka, C. Handbook on the Digital Creative Economy (Edward Elgar Publishing, 2013).\" href=\"http:\/\/www.nature.com\/articles\/s44260-025-00054-y#ref-CR48\" id=\"ref-link-section-d455046205e1812\" target=\"_blank\" rel=\"noopener\">48<\/a>) and Raleigh (biotechnology<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 49\" title=\"Koo, J., Bae, J. &amp; Kim, D. What does it take to become a biotech hot spot? Environ. Plan. C 27, 665&#x2013;683 (2009).\" href=\"http:\/\/www.nature.com\/articles\/s44260-025-00054-y#ref-CR49\" id=\"ref-link-section-d455046205e1816\" target=\"_blank\" rel=\"noopener\">49<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 50\" title=\"Feldman, M. The locational dynamics of the us biotech industry: knowledge externalities and the anchor hypothesis. Ind. Innov. 10, 311&#x2013;329 (2003).\" href=\"http:\/\/www.nature.com\/articles\/s44260-025-00054-y#ref-CR50\" id=\"ref-link-section-d455046205e1819\" target=\"_blank\" rel=\"noopener\">50<\/a>), gradually enter the network, but do not displace the dominance of primary centers. The consistent temporal pattern shows that general principles, in combination with sector-specific dynamics<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 51\" title=\"Hughes, J. D. A reality check on the shale revolution. Nature 494, 307&#x2013;308 (2013).\" href=\"http:\/\/www.nature.com\/articles\/s44260-025-00054-y#ref-CR51\" id=\"ref-link-section-d455046205e1823\" target=\"_blank\" rel=\"noopener\">51<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 52\" title=\"Gertler, M. S. &amp; Levitte, Y. M. Local nodes in global networks: the geography of knowledge flows in biotechnology innovation. Ind. Innov. 12, 487&#x2013;507 (2005).\" href=\"http:\/\/www.nature.com\/articles\/s44260-025-00054-y#ref-CR52\" id=\"ref-link-section-d455046205e1826\" target=\"_blank\" rel=\"noopener\">52<\/a>, shape the geography of innovation over time.<\/p>\n<p>To quantify temporal dynamics, we analyze the size-conditional productivity of locations and the rate at which new locations enter the innovation network. We divide the patenting history into discrete time intervals, each containing 1\/10th of the total patenting sequence, and measure patenting rates across these periods. Using a sliding window approach, we define a \u201cpresent\u201d bin containing 300 patents while considering all preceding patenting activity as the \u201cpast.\u201d<\/p>\n<p>For each location of size i (i.e., locations that had already produced i patents before the bin), we compute the normalized patenting productivity as:<\/p>\n<p>$${P}_{i}=\\frac{\\Delta {N}_{i}}{\\Delta N}$$<\/p>\n<p>where \u0394Ni represents the increase in patent counts for locations of size i, and \u0394N is the total number of patents in the interval. By removing P0 data (productivity associated with locations that had never produced a patent before), we characterize the cumulative advantage effect (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s44260-025-00054-y#Fig2\" target=\"_blank\" rel=\"noopener\">2<\/a>a). Fitting a power-law relation to the empirical productivity values yields an estimated exponent \u03b3\u2009=\u20090.962\u2009\u00b1\u20090.021, supporting the hypothesis that preferential attachment governs the growth of established innovation hubs.<\/p>\n<p>In contrast, the entrant rate is quantified by measuring the fraction of patents associated with first-time entrants. Using sliding bins of 300 patents (with 25-patent step sizes), we track entrant rates over time across all eight technological sectors, obtaining an average trajectory.<\/p>\n<p>To better understand the mechanisms driving these empirical trends, we consider two limiting cases in a stochastic innovation model: (1) the Scenario I (Constant Innovation Rate) assumes a fixed rate of new locations entering the innovation landscape. Under this scenario, diversity would grow linearly with the total number of patents, implying that patenting activity remains widely distributed without strong hub entrenchment; (2) the Scenario II (Resource-Limited Expansion) assumes that the system is bounded by an initial diversity pool, meaning that as patenting activity grows, new entrant opportunities diminish exponentially due to resource constraints and hub dominance.<\/p>\n<p>To formalize these dynamics, we implement a stochastic model that simulates the evolution of the innovation landscape. The system begins with a pool of locations that have yet to produce patents. At each iteration, a location is selected based on cumulative advantage dynamics, meaning that locations with a strong history of innovation are more likely to continue patenting. When a patent is assigned, the location either already exists in the innovation network (reinforcing its status as a hub) or is a completely new entrant. Depending on the scenario, new locations are either introduced at a fixed rate (Scenario I) or drawn from a finite, resource-limited pool (Scenario II). Neither of these idealized cases fully captures the empirical patterns, as seen in Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s44260-025-00054-y#Fig2\" target=\"_blank\" rel=\"noopener\">2<\/a>b. The observed entrant rate declines sublinearly, following a power-law slowdown. This suggests that while innovation hubs consolidate their dominance over time, geographic expansion continues at a diminishing rate rather than abruptly ceasing<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 53\" title=\"Maskell, P., Bathelt, H. &amp; Malmberg, A. Building global knowledge pipelines: the role of temporary clusters. Eur. Plan. Stud. 14, 997&#x2013;1013 (2006).\" href=\"http:\/\/www.nature.com\/articles\/s44260-025-00054-y#ref-CR53\" id=\"ref-link-section-d455046205e1941\" target=\"_blank\" rel=\"noopener\">53<\/a>.<\/p>\n<p>To examine how entrant dynamics influence the overall diversity of the innovation landscape, we track the temporal evolution of location diversity. If the system were purely resource-limited, diversity would saturate due to rapid entrant decay. Conversely, a constant-rate model would predict a steady increase in diversity. By tracking entrant rates and diversity growth under these conditions, we observe two contrasting behaviors. First, if new locations continue to emerge at a constant rate, diversity grows linearly with total patent output, and innovation remains widely distributed rather than being concentrated in hubs. And second, if location entry is resource-limited, diversity plateaus over time, and patenting activity becomes highly concentrated in a small number of dominant hubs.<\/p>\n<p>Neither of these idealized cases fully captures empirical patterns, which instead exhibit a power-law slowdown in entrant rates and a long-term, sublinear increase in diversity (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s44260-025-00054-y#Fig2\" target=\"_blank\" rel=\"noopener\">2<\/a>c). This suggests that while hub entrenchment is non-negligible, geographic expansion persists over long time scales, albeit at a diminishing rate.<\/p>\n<p>Multiscale graph analysis<\/p>\n<p>We develop a multiscale spatial framework that integrates geometric graph theory, percolation analysis, and fractal scaling to characterize the spatial organization of patenting activity. This approach quantifies how connected components of innovation emerge and how their structural properties vary with spatial resolution. Given a set of geolocated patents \\({\\mathcal{P}}={\\{({x}_{i},{y}_{i})\\}}_{i = 1}^{N}\\), we deduplicate it to obtain L unique coordinates \\({\\mathcal{L}}={\\{({x}_{j},{y}_{j})\\}}_{i = 1}^{L}\\), each with an associated weight Ni denoting the number of patents at that location. For each spatial scale R, we construct a geometric graph GR\u2009=\u2009(V, ER) where nodes represent unique locations and undirected edges connect nodes within distance R:<\/p>\n<p>$${E}_{R}=\\{(i,j)\\in V\\times V| \\parallel ({x}_{i}-{x}_{j}){\\parallel }_{2}\\le R\\}.$$<\/p>\n<p>We compute the set of connected components \\({Q}_{1},{Q}_{2},\\ldots ,{Q}_{{n}_{R}}\\) of GR, where each Qi contains all nodes linked through paths of edges not exceeding distance R. We define the \u201cfragmentation function&#8221; as the fraction of connected components:<\/p>\n<p>$$f(R)=\\frac{{N}_{clusters}}{L}$$<\/p>\n<p>\n                    (3)\n                <\/p>\n<p>where Nclusters is the number of connected components and L\u2009=\u2009\u2223V\u2223 is the total number of nodes. This function tracks how the geometric graph fragmentation decreases as spatial proximity increases.<\/p>\n<p>Empirical data exhibit a power-law decay in the fragmentation function:<\/p>\n<p>$$f(R) \\sim {R}^{-\\beta },$$<\/p>\n<p>\n                    (4)\n                <\/p>\n<p>indicating a scale-invariant, fractal-like structure in the spatial distribution of patenting<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 36\" title=\"Plaszczynski, S., Nakamura, G., Deroulers, C., Grammaticos, B. &amp; Badoual, M. Levy geometric graphs. Phys. Rev. E 105, 054151 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s44260-025-00054-y#ref-CR36\" id=\"ref-link-section-d455046205e2532\" target=\"_blank\" rel=\"noopener\">36<\/a>. This behavior contrasts with conventional spatial null models, where f(R) typically decays exponentially or exhibits a sharp percolation transition<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 54\" title=\"Stauffer, D. &amp; Aharony, A. Introduction to percolation theory (Taylor &amp; Francis, 2018).\" href=\"http:\/\/www.nature.com\/articles\/s44260-025-00054-y#ref-CR54\" id=\"ref-link-section-d455046205e2542\" target=\"_blank\" rel=\"noopener\">54<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 55\" title=\"Dorogovtsev, S. N., Goltsev, A. V. &amp; Mendes, J. F. Critical phenomena in complex networks. Rev. Mod. Phys. 80, 1275&#x2013;1335 (2008).\" href=\"http:\/\/www.nature.com\/articles\/s44260-025-00054-y#ref-CR55\" id=\"ref-link-section-d455046205e2545\" target=\"_blank\" rel=\"noopener\">55<\/a>. In the fractal regime\u2013where the above scaling law holds\u2013a single dominant connected component does not emerge, and patenting remains distributed across a hierarchy of spatial components. Although fractal patterns have been documented in urban geography<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 30\" title=\"Batty, M. &amp; Longley, P. A. Fractal cities: a geometry of form and function (Academic Press, 1994).\" href=\"http:\/\/www.nature.com\/articles\/s44260-025-00054-y#ref-CR30\" id=\"ref-link-section-d455046205e2550\" target=\"_blank\" rel=\"noopener\">30<\/a>, the scaling observed in patenting networks emerges independently of population distribution, as demonstrated by the failure of population-based null models to reproduce the empirical (\u03b2) exponent (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s44260-025-00054-y#Fig3\" target=\"_blank\" rel=\"noopener\">3<\/a>c\u2013e).<\/p>\n<p>To capture the full history of spatial aggregation, we track the merging of connected components across a range of radius thresholds and encode this information in a clustering tree. Initially, each node forms a singleton component. To efficiently identify spatial neighbors at each scale, we insert the points in \\({\\mathcal{L}}\\) into a quadtree spatial index<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 56\" title=\"Finkel, R. A. &amp; Bentley, J. L. Quad trees a data structure for retrieval on composite keys. Acta Inform. 4, 1&#x2013;9 (1974).\" href=\"http:\/\/www.nature.com\/articles\/s44260-025-00054-y#ref-CR56\" id=\"ref-link-section-d455046205e2581\" target=\"_blank\" rel=\"noopener\">56<\/a>. As R increases, overlapping connected components are merged, and these events are recorded as branches in a hierarchical tree. Each node in the clustering tree corresponds to a connected component, and merging events generate parent nodes from their child components. This structure preserves the complete multiscale aggregation history, enabling structural analyses of spatial organization. Figure <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s44260-025-00054-y#Fig4\" target=\"_blank\" rel=\"noopener\">4<\/a>a illustrates an example clustering tree whose hierarchical architecture is consistent across spatial scales. To assess the topological self-similarity of clustering trees, we adopt the shape tree analysis proposed by Herrada et al.<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 32\" title=\"Herrada, E. A. et al. Universal scaling in the branching of the tree of life. PLoS ONE 3, e2757 (2008).\" href=\"http:\/\/www.nature.com\/articles\/s44260-025-00054-y#ref-CR32\" id=\"ref-link-section-d455046205e2591\" target=\"_blank\" rel=\"noopener\">32<\/a>. For each region i, we define a subtree Si as the union of the connected component rooted at i and all its descendant regions. The size Ai is the total number of patents within Si, and the cumulative branch size Ci is:<\/p>\n<p>$${C}_{i}=\\sum _{j\\in {S}_{i}}{A}_{j}.$$<\/p>\n<p>\n                    (5)\n                <\/p>\n<p>This measure reflects the extent and asymmetry of spatial aggregation. Balanced trees minimize Ci, whereas more asymmetric or chain-like trees yield larger Ci values. Herrada et al. demonstrated that for many empirical trees, Ci and Ai satisfy an allometric scaling law:<\/p>\n<p>$$C \\sim {A}^{\\tau },$$<\/p>\n<p>\n                    (6)\n                <\/p>\n<p>with a universal exponent \u03c4\u2009\u2248\u20091.44. We observe a similar exponent in the spatial organization of patenting (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s44260-025-00054-y#Fig4\" target=\"_blank\" rel=\"noopener\">4<\/a>b), suggesting that the clustering trees obey similar scaling constraints to those found in biological or information networks. To further assess scale invariance, we examine the complementary cumulative distribution functions (CCDFs) of A and C:<\/p>\n<p>$$F(A) \\sim {A}^{1-{\\tau }_{A}},\\quad F(C) \\sim {C}^{1-{\\tau }_{C}},$$<\/p>\n<p>\n                    (7)\n                <\/p>\n<p>both functions exhibit power-law tails (Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s44260-025-00054-y#Fig4\" target=\"_blank\" rel=\"noopener\">4<\/a>c), which supports the fractal nature of empirical clustering trees.<\/p>\n<p>Coefficient of variation across scales<\/p>\n<p>To explore whether spatial inequality in patenting exhibits scale dependence, we measure the coefficient of variation CV(R)-the ratio of standard deviation to mean of patent counts-across different aggregation scales R. Using the same geometric graph framework described above, we compute the connected components \\({Q}_{1},{Q}_{2},\\ldots ,{Q}_{{n}_{R}}\\) at each spatial scale. The total patent count of component Qi is:<\/p>\n<p>$${W}_{i}=\\sum _{{v}_{j}\\in {Q}_{i}}{N}_{j}.$$<\/p>\n<p>From the set {Wi}, we compute:<\/p>\n<p>$${\\mu }_{R}=\\frac{1}{{n}_{R}}\\mathop{\\sum }\\limits_{i=1}^{{n}_{R}}{W}_{i},$$<\/p>\n<p>$${\\sigma }_{R}=\\sqrt{\\frac{1}{{n}_{R}}\\mathop{\\sum }\\limits_{i=1}^{{n}_{R}}{({W}_{i}-{\\mu }_{R})}^{2}},$$<\/p>\n<p>and the coefficient of variation:<\/p>\n<p>$$\\,\\text{CV}\\,(R)=\\frac{{\\sigma }_{R}}{{\\mu }_{R}}.$$<\/p>\n<p>\n                    (8)\n                <\/p>\n<p>To assess robustness, we perform bootstrapping with 100 resampled datasets. The resulting confidence intervals (yellow curves in Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s44260-025-00054-y#Fig5\" target=\"_blank\" rel=\"noopener\">5<\/a>, top row) confirm that the observed peak in inequality is not an artifact of sampling variability, but rather a persistent feature of the spatial distribution of innovation. Our Python implementation automatically calculates the multiscale coefficient of variation CV(R) from a set of geographical locations.<\/p>\n<p>A stochastic model of innovation dynamics<\/p>\n<p>To investigate the emergence of spatial patterns in U.S. patenting, we develop a minimal stochastic model that reproduces key empirical regularities described in Results. The model simulates the evolution of innovation activity through two fundamental processes: cumulative advantage and non-local dispersal.<\/p>\n<p>The simulation unfolds over a geographic domain corresponding to the continental United States. It is initialized with the locations of the 80 most populous U.S. cities (population\u2009&gt;\u2009500,000), which act as potential starting points for innovation. At each time step, a new patent is added to the system. With some probability, the patent is assigned to an existing location i drawn from the current set \\({\\mathcal{L}}\\) of active locations. The selection probability is given by a reinforcement mechanism:<\/p>\n<p>$${\\pi }_{i}=\\frac{p{N}_{i}+1}{{\\sum }_{j}(p{N}_{j}+1)},$$<\/p>\n<p>\n                    (9)\n                <\/p>\n<p>where Ni is the cumulative number of patents at location i, and p controls the strength of cumulative advantage. This rule ensures that locations with a history of patenting activity are more likely to continue growing, reinforcing existing hubs over time<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 12\" title=\"Barab&#xE1;si, A.-L. &amp; Albert, R. Emergence of scaling in random networks. Science 286, 509&#x2013;512 (1999).\" href=\"http:\/\/www.nature.com\/articles\/s44260-025-00054-y#ref-CR12\" id=\"ref-link-section-d455046205e3569\" target=\"_blank\" rel=\"noopener\">12<\/a>.<\/p>\n<p>With probability \u03bd(N), a new location is introduced instead. The rate of new location entry decays exponentially as the system matures:<\/p>\n<p>$$\\nu (N)={\\nu }_{0}{e}^{-\\lambda N},$$<\/p>\n<p>\n                    (10)\n                <\/p>\n<p>where N is the total number of patents added so far, \u03bd0 is the initial rate of entry, and \u03bb controls the rate of decay. This reflects the empirically observed decline in new geographic entrants as existing hubs accumulate more patents (see Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s44260-025-00054-y#Fig2\" target=\"_blank\" rel=\"noopener\">2<\/a>c).<\/p>\n<p>When a new location is added, its spatial coordinates are determined by a L\u00e9vy flight originating from the most recent location. L\u00e9vy flights combine frequent short steps with occasional long jumps and are widely used to model exploratory dynamics in spatially embedded systems<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 57\" title=\"Sims, D. W. et al. Scaling laws of marine predator search behaviour. Nature 451, 1098&#x2013;1102 (2008).\" href=\"http:\/\/www.nature.com\/articles\/s44260-025-00054-y#ref-CR57\" id=\"ref-link-section-d455046205e3668\" target=\"_blank\" rel=\"noopener\">57<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 58\" title=\"Reynolds, A. M. &amp; Rhodes, C. J. The l&#xE9;vy flight paradigm: random search patterns and mechanisms. Ecology 90, 877&#x2013;887 (2009).\" href=\"http:\/\/www.nature.com\/articles\/s44260-025-00054-y#ref-CR58\" id=\"ref-link-section-d455046205e3671\" target=\"_blank\" rel=\"noopener\">58<\/a>. Distances are sampled from a L\u00e9vy distribution (via scipy.stats.levy, with scale s\u2009=\u20090.1), and angles are drawn uniformly from [0, 2\u03c0). If the resulting location falls outside U.S. borders (defined by Co), the jump is repeated until a valid point is found using Natural Earth 110\u2009m landmass shapefiles<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 59\" title=\"Kelso, N. V. &amp; Patterson, T. Natural earth vector. Cartogr. Perspect. 64, 45&#x2013;50 (2009).\" href=\"http:\/\/www.nature.com\/articles\/s44260-025-00054-y#ref-CR59\" id=\"ref-link-section-d455046205e3691\" target=\"_blank\" rel=\"noopener\">59<\/a>.<\/p>\n<p>The simulation proceeds for L iterations, producing a synthetic history of patenting locations and timestamps. This output is evaluated using the spatial methods introduced above?, including rank distributions, fractal clustering, and the coefficient of variation CV(R). By systematically varying parameters p and \u03bd0, we map a continuous space of innovation regimes, from highly concentrated to widely dispersed systems (see Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s44260-025-00054-y#Fig6\" target=\"_blank\" rel=\"noopener\">6<\/a>b).<\/p>\n<p>This model can be interpreted as a spatial urn process with reinforcement and exploration<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 25\" title=\"Duran-Nebreda, S., O&#x2019;Brien, M. J., Bentley, R. A. &amp; Valverde, S. Dilution of expertise in the rise and fall of collective innovation. Humanit. Soc. Sci. Commun. 9, 1&#x2013;10 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s44260-025-00054-y#ref-CR25\" id=\"ref-link-section-d455046205e3718\" target=\"_blank\" rel=\"noopener\">25<\/a>. The pseudocode below summarizes the model\u2019s implementation and variable definitions.<\/p>\n<p>                  Algorithm 1<\/p>\n<p><b>Stochastic model of patenting with reinforcement and L\u00e9vy dispersal<\/b><\/p>\n<p><img decoding=\"async\" class=\"c-article-section__figure--1-border-image\" alt=\"\" width=\"703\" src=\"https:\/\/www.europesays.com\/uk\/wp-content\/uploads\/2025\/11\/44260_2025_54_Figa_HTML.png\"\/><br \/>\n                Analytical estimation of the inequality horizon<\/p>\n<p>This section presents a macroscopic model for estimating the peak of spatial inequality in U.S. patenting activity-termed the inequality horizon (see above). The inequality horizon corresponds to the aggregation scale R* at which the coefficient of variation (CV) across patenting locations reaches its maximum.<\/p>\n<p>The coefficient of variation, denoted CV(R), is defined as the ratio of the standard deviation \u03c3R to the mean \u03bcR of aggregated patent counts within clusters defined by a spatial radius R around each location. Modeling the behavior of \u03bcR and \u03c3R as a function of R is thus essential for understanding spatial inequality.<\/p>\n<p>In the framework of L\u00e9vy geometric graphs (LGG)<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 36\" title=\"Plaszczynski, S., Nakamura, G., Deroulers, C., Grammaticos, B. &amp; Badoual, M. Levy geometric graphs. Phys. Rev. E 105, 054151 (2022).\" href=\"http:\/\/www.nature.com\/articles\/s44260-025-00054-y#ref-CR36\" id=\"ref-link-section-d455046205e3788\" target=\"_blank\" rel=\"noopener\">36<\/a>, \u03bcR corresponds to the expected size of the connected component a node belongs to, given a connection radius R. For small R, most nodes remain isolated, and \u03bcR follows a power-law scaling:<\/p>\n<p>$${\\mu }_{R} \\sim {R}^{a},$$<\/p>\n<p>where a is a local aggregation exponent that quantifies the early-stage growth of clusters. Higher values of a indicate faster local connectivity, typically arising from dense short-range interactions. The value of a is influenced by the L\u00e9vy exponent s in the step-length distribution P(l)\u2009~\u2009l\u2212s, where smaller s implies more spatial dispersion and suppressed local aggregation.<\/p>\n<p>Unlike standard LGGs, the innovation dynamics modeled here include a reinforcement mechanism through node revisitation. At each time step, with probability 1\u2009\u2212\u2009\u03bd, the process returns to an existing node (<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"equation anchor\" href=\"http:\/\/www.nature.com\/articles\/s44260-025-00054-y#Equ10\" target=\"_blank\" rel=\"noopener\">10<\/a>), selected preferentially by accumulated patenting activity (<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"equation anchor\" href=\"http:\/\/www.nature.com\/articles\/s44260-025-00054-y#Equ9\" target=\"_blank\" rel=\"noopener\">9<\/a>). This accelerates the growth of clusters relative to memoryless models, increasing the observed exponent a.<\/p>\n<p>As R approaches a critical scale Rc, a percolation-like transition occurs, and \u03bcR diverges as:<\/p>\n<p>$${\\mu }_{R} \\sim | {R}_{c}-R{| }^{-b},$$<\/p>\n<p>where b is a divergence exponent characterizing the sharpness of the connectivity transition. Due to reinforcement, Rc tends to be lower than in standard LGGs, and the divergence can be steeper-potentially resembling explosive percolation<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 60\" title=\"Tian, L. &amp; Shi, D.-N. The nature of explosive percolation phase transition. Phys. Lett. A 376, 286&#x2013;289 (2012).\" href=\"http:\/\/www.nature.com\/articles\/s44260-025-00054-y#ref-CR60\" id=\"ref-link-section-d455046205e3992\" target=\"_blank\" rel=\"noopener\">60<\/a>.<\/p>\n<p>To describe the full range of R\u2009Rc, we propose the following unified scaling ansatz:<\/p>\n<p>$${\\mu }_{R}=c\\,{R}^{a}| {R}_{c}-R{| }^{-b},$$<\/p>\n<p>\n                    (11)\n                <\/p>\n<p>where c is a scale-dependent prefactor reflecting the overall density of innovation nodes. This expression captures both the early growth regime and the divergence near criticality. Figure <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s44260-025-00054-y#Fig6\" target=\"_blank\" rel=\"noopener\">6<\/a>g shows a least-squares fit of Eq. (<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"equation anchor\" href=\"http:\/\/www.nature.com\/articles\/s44260-025-00054-y#Equ11\" target=\"_blank\" rel=\"noopener\">11<\/a>) to the simulation data.<\/p>\n<p>Empirical analysis reveals that the standard deviation \u03c3R scales with the mean cluster size \u03bcR following a power-law relation:<\/p>\n<p>$${\\sigma }_{R} \\sim {\\mu }_{R}^{\\delta },$$<\/p>\n<p>\n                    (12)\n                <\/p>\n<p>where \u03b4 is the Taylor exponent, a measure of spatial heterogeneity. This form is equivalent to the classical Taylor\u2019s law stated in terms of variance: Var\u2009~\u2009\u03bc2\u03b4. We adopt the standard deviation form for clarity and analytic convenience. In this formulation, \u03b4\u2009=\u20091\/2 corresponds to Poisson fluctuations; \u03b4\u2009&gt;\u20091\/2 indicates clustering; and \u03b4\u2009\u03b4\u2009\u2248\u20093\/4, suggesting nontrivial correlations.<\/p>\n<p>While Eq. (<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"equation anchor\" href=\"http:\/\/www.nature.com\/articles\/s44260-025-00054-y#Equ12\" target=\"_blank\" rel=\"noopener\">12<\/a>) holds asymptotically, empirical deviations occur at low \u03bcR due to sampling noise and finite-size effects. Figure <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s44260-025-00054-y#Fig6\" target=\"_blank\" rel=\"noopener\">6<\/a>g shows that \u03c3R initially grows slowly before transitioning to a power-law regime. To account for this behavior, we adopt a composite form:<\/p>\n<p>$${\\sigma }_{R} \\sim {\\mu }_{R}^{\\delta }{\\mathcal{M}}({\\mu }_{R}),$$<\/p>\n<p>\n                    (13)\n                <\/p>\n<p>where<\/p>\n<p>$${\\mathcal{M}}({\\mu }_{R})=\\frac{1}{1+\\exp [-k({\\mu }_{R}-{\\mu }_{0})]}$$<\/p>\n<p>\n                    (14)\n                <\/p>\n<p>is a logistic modulation term that suppresses variability for small cluster sizes (see inset in Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s44260-025-00054-y#Fig6\" target=\"_blank\" rel=\"noopener\">6<\/a>f). Here, \u03bc0 sets the inflection point of the transition, and k controls its steepness.<\/p>\n<p>This analytical framework applies for aggregation scales \\(R &gt; {R}_{\\min }\\approx 0.0{3}^{\\circ }\\) (approximately 3.3\u2009km), above which spatial heterogeneity becomes scale-dependent. Below this threshold, empirical data show that CV(R) remains approximately constant (see Fig. <a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s44260-025-00054-y#Fig5\" target=\"_blank\" rel=\"noopener\">5<\/a>), indicating a scale-invariant regime not captured by our model. This plateau suggests the presence of statistically homogeneous urban cores-consistent with longstanding urban design theories that define compact, walkable neighborhoods with diameters between 3 and 4\u2009km<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 33\" title=\"Perry, C. The neighbourhood unit. 25&#x2013;44 (Routledge\/Thoemmes, 1998).\" href=\"http:\/\/www.nature.com\/articles\/s44260-025-00054-y#ref-CR33\" id=\"ref-link-section-d455046205e4488\" target=\"_blank\" rel=\"noopener\">33<\/a>. These stable substructures likely reflect deeply rooted spatial organization in historical city planning and may serve as foundational units in the broader innovation landscape.<\/p>\n<p>To obtain an analytic prediction for the inequality horizon, we approximate the logistic term with a rational function:<\/p>\n<p>$${\\mathcal{M}}({\\mu }_{R}) \\sim \\frac{{\\mu }_{R}}{{\\mu }_{R}+{\\mu }_{0}}.$$<\/p>\n<p>Substituting this into Eq. (<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"equation anchor\" href=\"http:\/\/www.nature.com\/articles\/s44260-025-00054-y#Equ13\" target=\"_blank\" rel=\"noopener\">13<\/a>) and dividing by \u03bcR gives an explicit expression for the coefficient of variation:<\/p>\n<p>$${CV}(R)=\\kappa \\left(\\frac{{\\mu }_{R}^{\\delta }}{{\\mu }_{R}+{\\mu }_{0}}\\right),$$<\/p>\n<p>\n                    (15)\n                <\/p>\n<p>where \u03ba is a constant of proportionality.<\/p>\n<p>To find the scale R* that maximizes CV(R), we treat \u03bcR\u2009=\u2009\u03bc as the independent variable and set the derivative to zero:<\/p>\n<p>$$\\frac{d\\,{CV}}{d\\mu }=\\kappa \\frac{d}{d\\mu }\\left(\\frac{{\\mu }^{\\delta }}{\\mu +{\\mu }_{0}}\\right)=0.$$<\/p>\n<p>Applying the quotient rule:<\/p>\n<p>$$\\frac{d\\,{CV}}{d\\mu }=\\kappa \\frac{\\delta {\\mu }^{\\delta -1}(\\mu +{\\mu }_{0})-{\\mu }^{\\delta }}{{(\\mu +{\\mu }_{0})}^{2}}.$$<\/p>\n<p>Setting the numerator to zero and dividing both sides by \u03bc\u03b4\u22121:<\/p>\n<p>$$\\delta (\\mu +{\\mu }_{0})-\\mu =0.$$<\/p>\n<p>Solving for \u03bc gives the critical mean cluster size:<\/p>\n<p>$${\\mu }^{* }={\\mu }_{0}\\left(\\frac{\\delta }{1-\\delta }\\right).$$<\/p>\n<p>Substituting \u03bcR for clarity:<\/p>\n<p>$${\\mu }_{R}^{* }={\\mu }_{0}\\left(\\frac{\\delta }{1-\\delta }\\right).$$<\/p>\n<p>\n                    (16)\n                <\/p>\n<p>This condition holds only if \u03b4\u2009\u03b4\u2009\u2265\u20091, spatial heterogeneity grows faster than mean aggregation, and CV(R) increases monotonically toward the percolation threshold.<\/p>\n<p>Given the parametric form of \u03bcR in Eq. (<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"equation anchor\" href=\"http:\/\/www.nature.com\/articles\/s44260-025-00054-y#Equ11\" target=\"_blank\" rel=\"noopener\">11<\/a>), we can numerically invert \\({\\mu }_{R}^{* }\\) to find the corresponding inequality horizon R*. The parameters a, b, c, Rc, \u03b4, \u03bc0, and \u03ba are fit to empirical data using nonlinear regression. Our Python code implements this procedure and provides theoretical estimates of the inequality horizon for any spatially distributed dataset of innovation activity.<\/p>\n","protected":false},"excerpt":{"rendered":"Patent database A comprehensive database was developed to analyze the spatial and temporal distribution of patent activity within&hellip;\n","protected":false},"author":2,"featured_media":545657,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[5311],"tags":[175350,175351,38987,11705,70125,56061,22100,53,49,978,659],"class_list":{"0":"post-545656","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-united-states","8":"tag-applications-of-graph-theory-and-complex-networks","9":"tag-applications-of-nonlinear-dynamics-and-chaos-theory","10":"tag-complex-networks","11":"tag-complex-systems","12":"tag-complexity","13":"tag-interdisciplinary-studies","14":"tag-mathematical-models-of-cognitive-processes-and-neural-networks","15":"tag-technology","16":"tag-united-states","17":"tag-us","18":"tag-usa"},"share_on_mastodon":{"url":"https:\/\/pubeurope.com\/@uk\/115484248828578827","error":""},"_links":{"self":[{"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/posts\/545656","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/comments?post=545656"}],"version-history":[{"count":0,"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/posts\/545656\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/media\/545657"}],"wp:attachment":[{"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/media?parent=545656"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/categories?post=545656"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/tags?post=545656"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}