Data science (DS) plays a transformative role in advancing scientific discovery and informing policy by enabling the systematic analysis of large and complex data sets. In the health sector, it supports the development of predictive models, optimizes intervention strategies, and informs evidence-based decision-making, particularly in resource-constrained settings. Data science can be used to develop tools for diagnosis, treatment, and prevention of diseases that disproportionately overburden Africa to improve health outcomes. Harnessing the full potential of data science (DS) capacity is challenging because there is a lack of financial investment in DS in Africa. There are compelling reasons to channel more resources into African DS infrastructure. An opportunity exists to intensify investments to train and retain data scientists in Africa, build and scale-up data science (DS) infrastructure, and coordinate efforts between governments to sustain DS initiatives. To achieve these outcomes, sustained funding and investment for DS is needed. This commentary discusses the critical role of DS in addressing health challenges in Africa, emphasizing the need for greater investment in DS infrastructure, talent development, and research funding. It highlights the shortage of DS professionals across the continent, and the limited financial resources allocated to DS. The article advocates for strategic investments through collaboration between governments, the private sector, and international partners, while exploring successful funding models and the necessity for improved infrastructure to support DS growth across Africa.

Current status of Data Science research in Africa

A recent analysis of the global distribution of individuals trained in DS identified 11,400 professionals in North America, Europe, and Asia1. IIn contrast, Africa had a limited number of data scientists, with South Africa having a few hundred1. Furthermore, information on the distribution of trained DS beyond South Africa is limited. This information gap underscores the urgency of documenting and mapping current DS capacity, sources and funding distributions. The African Union published the Digital Transformation Strategy for Africa (2020–2030), which aims for “an integrated and inclusive digital society and economy”2. However, there is a scarcity of local financial resources resulting in a dependency on donor funding, limited funding through government initiatives, and private foundations spearheading key investments. With the exception of South Africa, there is a considerable variation in research investment between most African governments and a failure to meet the agreed-upon 1% GDP target for expenditure on scientific research2. The limited overall research expenditure hinders scientific progress, especially in emerging fields including DS, despite its recognition by the African Union as a critical area of investment.

We explored funding channeled to Africa for DS compared to other continents reported by the National Institutes of Health (NIH) World Reporter (Figs. 1 and 2)3. The NIH is one of the major science funders in Africa. Whilst acknowledging that the NIH data does not include all funding for DS in Africa, it highlights that gaps in investment in DS exist. NIH World Reporter shows that about 70% of the funding was from the NIH and 30% from other funders. Only 1.54% of this funding was allocated to Africa for DS, to ~100 unique projects, mainly in South Africa, Kenya, and Nigeria (Fig. 2).

Fig. 1: Annual trends and total investments in funding of data science in Africa.figure 1

Main graph shows data sourced from the NIH World Report (https://worldreport.nih.gov/) which indicates that total global investment from 2017 to 2022 amounted to $2,683.81 million. The (A) insert shows the relative share of global funding allocated to different regions, with just 1.54% invested in data science in Africa. The (B) insert shows the share provided by each funding organization listed to fund DS in Africa. Abbreviations: BMGF Bill Melinda Gates Foundation, CIHR Canadian Institutes of Health Research, EC European Commission, EDCTP European & Developing Countries Clinical Trials Partnership, MRC Medical Research Council, KHIDI Korea Health Industry Development Institute, NHMRC National Health and Medical Research Council, NIH National Institute for Health, Pasteur Pasteur Institute, Wellcome Wellcome Trust.

Fig. 2: Distribution of funded projects by agencies across Africa.figure 2

The size of each circle indicates the number of funded projects in a particular location. Abbreviations: BMGF Bill Melinda Gates Foundation,EC European Commission, EDCTP European & Developing Countries Clinical Trials Partnership, MRC Medical Research Council, NIH National Institute for Health, Pasteur Pasteur Institute, Wellcome Wellcome Trust. The map was generated using Microsoft Power BI software.

The NIH allocate millions of dollars through initiatives such as Human Hereditary and Health in Africa (H3Africa) and Data Science for Health Discovery and Innovation in Africa (DS-I Africa) to support the emerging field of DS on the African continent. Additional funding efforts exist from non-governmental organizations (NGOs), venture capitalists, and non-profit organizations (Table 1).

Table 1 African DS Funding Efforts

Moreover, selected African governments also continue to play a pivotal role in advancing DS within and outside their borders. Academic institutions and educational programs are beginning to receive funding to facilitate collaborative endeavors between academia and data scientists based elsewhere, such as those in industry and private organizations, supported by funders and local governments. Despite these positive developments, reducing gaps in funding is imperative. Robust financial support from diverse sources (e.g., private philanthropy, private sector, and public institutions) is essential to unlock its full potential and drive positive change across Africa4. Exploring diverse funding models is critical for sustained development of DS. Funding from additional sources helps to ensure programs are less vulnerable to changes in funding priorities and attracts various levels of resources, expertise, and perspectives to address challenges in DS. For example, the Developing Excellence in Leadership, Training, and Science in Africa (DELTAS Africa)5 program uses the hub-and-spoke model. This model is designed to create a hub of resources and expertise that act as a focal point for training and innovation. Smaller or less resourced partners can benefit from this hub, enabling collaboration, enhancing capacities and creating opportunities for shared growth. This approach could also be utilized for funding DS. Another approach is the Grand Challenges model, designed to stimulate innovation to solve key global health problems through DS6. Both models share a common strength: they empower African leadership in DS by using established frameworks. However, additional models, such as the Paradigmatic Principal-Agent Model (which emphasizes performance-based accountability), Sector-Differential Model (tailoring support to the unique needs of sectors like health or agriculture), Embedded Principal-Agent Model (which promotes co-design and shared ownership), and Multiple Principal-Agents Model7 (fostering coordination among diverse funders and implementers), could further expand DS capacity in Africa. These models are particularly well suited to the African context as they promote context-specific solutions, foster multi-sectoral collaboration, and mitigate risks associated with reliance on single-source funding, thereby contributing to long-term sustainability of DS initiatives. When effectively nurtured, these models have the potential to significantly boost DS output by enhancing both capacity and training.

Aligning Funders’ Strategies and Priorities

The demand for human and technological resources is escalating, and strategic investments in DS can advance healthcare systems and enhance the well-being of the population. Creating and maintaining comprehensive datasets, biobanks, and electronic health record systems that depict the health status of populations is pivotal. Such data infrastructure empowers nations to better inform healthcare policies and interventions, fostering more effective and targeted approaches8. Within the global landscape, data-driven methodologies enable prediction, management, and mitigation of the impact of epidemics and disease outbreaks, contributing to public health resilience9,10,11. Investing in building talent capacity, innovation and retaining talented researchers in Africa is imperative for elevating DS capabilities in health research. These investments will also be beneficial beyond the healthcare sector, catalyzing innovation and fueling economic growth. This ripple effect will lead to a more sustainable and self-reliant DS ecosystem, fostering a landscape in which Africa can harness the full potential of DS.

Impactful DS initiatives require substantial investments in technology, infrastructure, and talent. However, tight budgets within African governments and the need to prioritize other critical needs present significant obstacles to developing DS on the continent. It has been argued that digital transformation in Africa will be central to maintaining sustainable growth over the next few decades2. Infrastructure improvement has emerged as a crucial factor in bridging the DS gap. High-speed internet and cutting-edge computing technology are the foundation for successful DS endeavors12,13. Lack of reliable internet connections and modern computing facilities across some African countries impedes data collection, analysis, and dissemination14. Substantial disparities in internet access exist across and within African regions, with Southern Africa reporting 60% access compared to only 25% in Central Africa15. Investing in high-speed internet access and establishing well-equipped data centers can significantly enhance the DS landscape. This infrastructure development would empower researchers and data scientists to work more efficiently, resulting in better data-driven solutions and informed decision-making. Aligning government infrastructure development with funders priorities is essential for advancing and sustaining DS in Africa16. DS progress cannot occur without the necessary infrastructure, making it essential to synchronize government investments with funders’ strategic goals. Similar collaborative partnerships and long-term strategic planning have been successfully employed to tackle communicable diseases17. This approach is equally vital for DS. Moreover, donor support for infrastructure development is feasible when embedded within a broader vision for DS that promotes transparency, accountability and long-term sustainability.

Overcoming challenges in financing and nurturing data science initiatives

The shortage of skilled data scientists in Africa poses a significant challenge. While numerous DS programs are available globally, accessibility challenges such as geographical proximity, travel constraints, and limited Internet availability hinder many Africans from benefiting from these opportunities18. There is a compelling need to fund programs that can be developed and expanded within Africa, which requires additional sponsors and partners for their execution. Education and training programs in Africa have recently improved by offering DS skills and training. This surge in development is characterized by the establishment of fellowship programs within higher learning institutions and collaborative initiatives with advanced DS institutions19. Educational institutions should collaborate with industry experts to design relevant, up-to-date curriculum aligned with industry demands. Scholarship programs and internships can be pivotal in developing local talent, making DS a more accessible career choice for Africans. This approach promotes skill development and strengthens workforces in other STEM-related fields. In addition, there remains a need to retain talent in African institutions. Investments also need to fund positions post-training so that the skill sets developed can support the growth of local capacity building.

As DS continues to exert its transformative influence on our world, strategically allocating research, innovation, and educational resources in this dynamic field is an indispensable catalyst for progress. While existing global funding models have undoubtedly encouraged the discipline in high-income countries (HIC), there is currently insufficient research, training, and investment to meet the multifaceted demands and opportunities presented by DS in diverse settings, especially in low- and middle-income countries (LMICs).

The importance of strategic investments in data science for African development

A robust and sustained commitment to adequate funding is imperative to unlock the transformative power of DS fully. Governments, international organizations, and entities within the private sector each play a pivotal role in providing the financial support necessary for meaningful advances. Investing in DS equips countries with tools to address their unique challenges effectively. It is a gateway to economic growth and enhanced global competitiveness. In addition to their immediate health applications, DS initiatives have the potential to spur innovation, cultivate new industries, and position nations at the forefront of a rapidly evolving digital landscape. As we navigate the data-driven future, strategic and substantial investments in DS are not merely an option but required to foster a more resilient, equitable, and prosperous global society.