Geospatial Genomics in Cassava: Integrating GIS, Remote Sensing, and Genomic Tools for Disease-resistant Variety Development in Africa

Vandi Amara; Alusaine Edward Samura; Prince Emmanuel Norman; James Kargbo; Suffian Mansaray

doi:doi:10.11648/j.ijgg.20261402.14

Review Article |

| Peer-Reviewed

Geospatial Genomics in Cassava: Integrating GIS, Remote Sensing, and Genomic Tools for Disease-resistant Variety Development in Africa

Vandi Amara^*

, Alusaine Edward Samura

, Prince Emmanuel Norman

, James Kargbo

, Suffian Mansaray

Published in International Journal of Genetics and Genomics (Volume 14, Issue 2)

Received: 15 March 2026 Accepted: 26 March 2026 Published: 19 May 2026

Views: Downloads:

Download PDF

Share This Article

Twitter
Linked In
Facebook

Abstract

Cassava (Manihot esculenta Crantz) is a vital staple crop in sub-Saharan Africa, underpinning food security, rural livelihoods, and agro-industrial development. Despite its resilience to marginal soils and drought, cassava productivity is severely constrained by viral and bacterial diseases, notably cassava mosaic disease (CMD), cassava brown streak disease (CBSD), and cassava bacterial light (CBB). Conventional breeding approaches for disease resistance remain limited by long phenotyping cycles, genotype-by-environment interactions, and inadequate spatial integration. This review highlights the emerging role of geospatial genomics, an interdisciplinary framework that integrates Geographic Information Systems (GIS), remote sensing, and genomic tools to accelerate cassava improvement. GIS enables spatial mapping of disease incidence and environmental gradients, while remote sensing technologies such as satellite indices and drone-based hyperspectral imaging provide real-time crop health monitoring. Advances in genomics, including SNP genotyping, genotyping-by-sequencing, and marker-assisted selection, facilitate the identification of resistance loci and predictive breeding. Integrating spatial datasets with genomic information through environmental association analyses enhances understanding of genotype-environment interactions and supports climate-resilient breeding strategies. Applications of geospatial genomics in cassava include disease surveillance, predictive modeling of outbreaks, targeted germplasm deployment, and improved selection efficiency across diverse agro-ecological zones. Through linking environmental variability, disease dynamics, and genetic diversity, geospatial genomics offers a comprehensive pathway to develop disease-resistant cassava varieties, thereby strengthening food security and sustainable agriculture in Africa.

Published in	International Journal of Genetics and Genomics (Volume 14, Issue 2)
DOI	10.11648/j.ijgg.20261402.14
Page(s)	76-98
Creative Commons	This is an Open Access article, distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution and reproduction in any medium or format, provided the original work is properly cited.
Copyright	Copyright © The Author(s), 2026. Published by Science Publishing Group

Keywords

1. Introduction

Cassava (Manihot esculenta Crantz) is one of the most important staple crops in sub-Saharan Africa, playing a central role in food security, rural livelihoods, and agricultural economies across the continent. The crop is widely cultivated by smallholder farmers due to its adaptability to marginal soils, tolerance to drought, and ability to produce reasonable yields under low-input conditions. These characteristics make cassava particularly important in regions facing climate variability and food insecurity

[1, 2]

. Cassava also serves as a strategic food reserve because its roots can remain in the soil for extended periods and be harvested when needed, thereby providing a reliable source of food during periods of scarcity

[3]

. Cassava is a major source of dietary energy for millions of people in Africa. Globally, more than 800 million people depend on cassava as a staple food, with the majority residing in sub-Saharan Africa

[2]

. The storage roots are rich in carbohydrates, particularly starch, making them an important source of calories in rural diets. In several African countries, cassava makes a substantial contribution to daily caloric intake and serves as a staple food, processed into various traditional products such as gari, fufu, lafun, and tapioca

[4, 5]

. Beyond the roots, cassava leaves are also consumed as a vegetable in many African communities and are valued for their relatively high protein, vitamin, and mineral content compared with the roots. The leaves provide essential micronutrients, including vitamin A, iron, and calcium, thereby contributing to improved nutritional outcomes in cassava-based diets

[6]

Beyond its importance for household consumption, cassava has become an increasingly valuable industrial crop. Its starch is widely used in producing flour, sweeteners, adhesives, bioethanol, textiles, and pharmaceuticals. Cassava is also a critical ingredient for livestock feed and the processed food sector, driving value addition and supporting rural economic growth

[2, 7]

. As a result, cassava is now seen not just as a subsistence crop but as a strategic commodity for agro-industrial advancement in many African countries. Africa leads global cassava production, accounting for over 60% of the world’s output, with annual production exceeding 200 million tonnes

[2]

. Nigeria is the top producer, with more than 60 million tonnes per year, followed by the Democratic Republic of Congo, Ghana, Tanzania, Mozambique, and Angola all of which depend heavily on cassava for food security and rural incomes

[2, 8]

. Favorable agro-ecological conditions and widespread adoption in smallholder systems explain these countries’ dominance in cassava production. However, average yields in Africa (8-12 t/ha) are well below the crop’s biological potential, which can surpass 40 t/ha with improved varieties and optimal management

[1, 9]

. This yield gap is mainly caused by biotic stresses such as cassava mosaic and brown streak diseases along with poor soils, lack of quality planting material, and suboptimal agronomic practices. Given cassava’s resilience and multiple uses, it remains foundational to agriculture and food security across Africa. Boosting productivity and disease resistance through modern breeding, genomics, and geospatial tools is critical for building sustainable food systems throughout the continent.

Download: Download full-size image

Figure 1. Conceptual framework showing the integration of GIS, remote sensing, and genomic data for disease-resistant and climate-resilient cassava breeding.

2. Objectives of the Review

This review aims to synthesize current knowledge on the emerging role of geospatial technologies and genomic tools in cassava improvement, particularly for disease surveillance and resistance breeding in Africa. Cassava production across the continent is increasingly threatened by viral and bacterial diseases, climate variability, and limited access to improved varieties. Integrating spatial technologies such as Geographic Information Systems (GIS) and remote sensing with modern genomic tools provides new opportunities to enhance disease monitoring and accelerate the development of resistant cassava varieties

[10, 11]

. One of the key objectives of this review is to examine the role of GIS and remote sensing technologies in monitoring and mapping cassava diseases across different agro-ecological environments. GIS enables the spatial analysis and visualization of disease distribution, allowing researchers to identify hotspots of infection and assess the influence of environmental factors such as climate, soil conditions, and cropping systems on disease spread. Spatial epidemiological studies have shown that GIS-based analyses can effectively map the distribution of cassava mosaic disease and cassava brown streak disease across Africa, thereby supporting early warning systems and targeted management strategies

[9, 10]

Remote sensing technologies further strengthen disease surveillance by providing large-scale and real-time information on crop health. Satellite-based vegetation indices such as the Normalized Difference Vegetation Index (NDVI) and Enhanced Vegetation Index (EVI) can detect plant stress caused by pathogens before visible symptoms appear. Advances in drone-based imaging and hyperspectral sensors have also enabled high-resolution monitoring of crop conditions and early detection of disease outbreaks in agricultural landscapes

[12, 13]

. These technologies provide valuable tools for improving plant disease monitoring and supporting precision agriculture systems. Another important objective of this review is to explore the contribution of genomic tools to cassava disease resistance breeding. Recent advances in plant genomics have significantly improved the ability of breeders to identify genetic loci associated with important traits such as disease resistance, yield stability, and environmental adaptation. High-throughput sequencing technologies, including single nucleotide polymorphism (SNP) genotyping and genotyping-by-sequencing (GBS), have enabled the characterization of genetic diversity within cassava populations and the identification of markers linked to resistance genes

[14]

. These genomic tools facilitate modern breeding approaches such as marker-assisted selection (MAS) and genomic selection (GS). Marker-assisted selection allows breeders to screen breeding populations for specific resistance genes, thereby accelerating the selection process. Similarly, genomic selection uses genome-wide marker data to predict the breeding value of genotypes, which significantly reduces the time required to develop improved cassava varieties

[11]

. The integration of genomic technologies into cassava breeding programs has therefore become an important strategy for developing varieties resistant to major diseases affecting cassava production in Africa.

The final objective of this review is to highlight strategies for integrating spatial technologies with genomic data to enhance cassava improvement programs in Africa. Combining environmental datasets such as climate variables, soil characteristics, and disease distribution patterns with genomic information enables researchers to identify genotype-environment relationships and adaptive genetic variation across landscapes. This integrated approach, often referred to as landscape genomics or geospatial genomics, allows researchers to better understand how environmental conditions influence the distribution of beneficial alleles related to disease resistance and stress tolerance

[15]

. Such integration can improve the identification of suitable breeding environments, guide the deployment of improved cassava varieties, and support predictive modeling of disease risk under future climate scenarios. As cassava is cultivated across highly diverse agro-ecological zones in Africa, the use of geospatial genomics can enhance breeding efficiency by ensuring that newly developed varieties are adapted to specific environmental conditions and disease pressures

[9]

. Ultimately, integrating GIS, remote sensing, and genomic tools provides a comprehensive framework for accelerating cassava improvement and strengthening food security across the African continent.

3. Major Cassava Diseases Affecting Productivity

Cassava production in Africa is severely constrained by several diseases, among which viral and bacterial diseases are the most destructive. Among these, Cassava Mosaic Disease (CMD), Cassava Brown Streak Disease (CBSD), and Cassava Bacterial Blight (CBB) are considered the most economically important diseases affecting cassava productivity across the continent. These diseases significantly reduce yield, compromise root quality, and threaten food security in many cassava-growing regions

[10, 16]

. Cassava Mosaic Disease (CMD) is one of the most devastating diseases affecting cassava production in Africa. The disease is caused by a complex of cassava mosaic begomoviruses (CMBs) belonging to the family Geminiviridae. The most common viral species associated with CMD include African cassava mosaic virus (ACMV) and East African cassava mosaic virus (EACMV) and their variants

[10]

. CMD is primarily transmitted by the whitefly vector Bemisia tabaci as well as through infected stem cuttings used as planting material. The disease is characterized by typical symptoms such as leaf mosaic patterns, chlorosis, leaf distortion, and stunted plant growth, which ultimately reduce photosynthetic capacity and root development

[17, 18]

. The disease is widespread across all cassava-growing regions of Africa and remains a major constraint to cassava productivity. Severe infections can cause significant yield losses, sometimes exceeding 80%, depending on the cassava genotype, virus strain, and environmental conditions. Co-infection by ACMV and EACMV has been reported to cause crop losses of up to 82% in susceptible varieties

[10, 19]

Cassava Brown Streak Disease (CBSD) is another major viral disease that threatens cassava production, particularly in eastern and central Africa. The disease is caused by cassava brown streak viruses (CBSVs) belonging to the genus Ipomovirus in the family Potyviridae

[20]

. CBSD is transmitted by the whitefly Bemisia tabaci and through infected planting materials. Unlike CMD, CBSD symptoms may appear mild on leaves but cause severe damage to cassava roots. Infected plants often exhibit leaf chlorosis, brown streaks on stems, and necrotic lesions in storage roots. Root necrosis renders cassava roots unmarketable and unsuitable for consumption, leading to significant economic losses

[16, 19]

. Yield losses associated with CBSD can reach up to 70% in heavily infected fields. In addition to reducing yield, the disease significantly affects root quality, making it a major threat to both food security and cassava-based agro-industries in affected regions

[16]

. Cassava Bacterial Blight (CBB) is a destructive bacterial disease caused by Xanthomonas axonopodis pv. manihotis. The disease occurs widely in tropical cassava-growing regions and has been reported across Africa, Latin America, and Asia

[21, 22]

. CBB is spread primarily through infected planting materials, contaminated tools, and rain splash, which facilitates bacterial dispersal between plants. The disease is characterized by water-soaked leaf lesions, angular leaf spots, wilting, leaf blight, and dieback of stems. Under severe infection, defoliation and plant death may occur, significantly reducing root yield and plant vigor

[22]

. Yield losses caused by CBB vary widely depending on environmental conditions and cultivar susceptibility but can exceed 20–30% in susceptible varieties under favorable conditions for disease development. The disease remains a persistent constraint to cassava productivity, particularly in regions with high rainfall and humidity that favor bacterial spread

[22]

4. Limitations of Conventional Breeding Approaches for Cassava Disease Resistance

Despite significant progress in cassava improvement, conventional breeding approaches face several constraints that limit their efficiency in developing disease-resistant varieties. Traditional cassava breeding relies heavily on phenotypic selection and multi-location field evaluations, which are often slow, resource-intensive, and influenced by environmental factors. These limitations have prompted the exploration of modern approaches such as genomic selection, high-throughput phenotyping, and geospatial analysis to accelerate breeding programs

[15, 23]

5. Time-consuming Phenotyping

One of the major limitations of conventional cassava breeding is the long duration required for phenotypic evaluation of traits, including disease resistance. Cassava has a relatively long growth cycle, typically ranging from 12 to 18 months, which significantly slows the breeding process because breeders must wait for plants to reach maturity before evaluating root and disease-related traits

[23, 24]

. Furthermore, conventional breeding programs depend heavily on field-based phenotypic selection, which involves evaluating large breeding populations across several years and locations. This process is labor-intensive, costly, and often limited in throughput, restricting the number of genotypes that can be effectively assessed

[25]

. In many breeding programs, selection cycles based solely on phenotypic data can take four to six years or more, largely due to the need for repeated field trials and vegetative propagation of planting materials

[26]

. These extended evaluation cycles delay the release of improved varieties and reduce the overall rate of genetic gain in cassava breeding programs.

6. Environmental Variability Affecting Disease Expression

Another major limitation of conventional breeding is the strong influence of environmental factors on the expression of disease resistance traits. Disease severity in cassava is often affected by environmental conditions such as temperature, rainfall, soil fertility, and vector populations. As a result, phenotypic expressions of resistance traits can vary significantly across different locations and seasons

[27, 28]

. This phenomenon, known as genotype-by-environment interaction (G×E), complicates the identification of stable disease-resistant genotypes. Studies have shown that environmental effects can contribute substantially to phenotypic variation in cassava disease resistance traits, often resulting in low to moderate heritability and reduced selection accuracy

[29]

. Because of these environmental influences, breeders must conduct multi-environment trials to reliably evaluate resistance, which increases both the cost and duration of breeding programs. In some cases, environmental variation can mask genetic resistance, making it difficult to identify truly resistant genotypes.

7. Limited Spatial Integration in Breeding Programs

Traditional cassava breeding programs typically rely on localized field trials and experimental stations, which limit the integration of spatial information related to disease distribution, climate variability, and agro-ecological conditions

[30]

. Consequently, breeding strategies may not fully capture the geographic variability of disease pressures across cassava-growing regions. For example, disease outbreaks such as cassava mosaic disease and cassava brown streak disease often exhibit strong spatial patterns linked to climate, vector distribution, and cropping systems, which are not always considered in conventional breeding frameworks

[31]

. Integrating geospatial tools such as Geographic Information Systems (GIS), remote sensing, and spatial epidemiological models can improve the understanding of disease dynamics and support more targeted breeding strategies. Without such integration, conventional breeding approaches may fail to address location-specific disease pressures and environmental constraints.

8. Emergence of Geospatial Genomics in Crop Improvement

The rapid advancement of genomics, spatial analysis, and environmental monitoring technologies has led to the emergence of geospatial genomics, an interdisciplinary approach that integrates genomic information with spatially explicit environmental and geographic data. This approach has become increasingly important in crop science because plant performance and adaptation are strongly influenced by spatial environmental heterogeneity. Linking genetic variation with environmental variables across landscapes, geospatial genomics provides a powerful framework for understanding crop adaptation, identifying disease-resistant genotypes, and guiding climate-resilient breeding strategies

[32, 33]

. Geospatial genomics, sometimes referred to as landscape genomics or spatial genomics, focuses on identifying associations between genetic variation and environmental gradients across geographic landscapes. The approach combines tools from population genetics, geographic information systems (GIS), and environmental modeling to investigate how natural or artificial selection shapes genetic diversity in crops and their wild relatives

[32, 34]

. In crop improvement programs, geospatial genomics enables researchers to detect adaptive genetic loci associated with environmental conditions such as temperature, rainfall, soil characteristics, and disease pressure. These insights are particularly valuable for crops like cassava that are grown across highly diverse agro-ecological zones in Africa. By identifying alleles linked to adaptation or resistance traits, breeders can accelerate the development of varieties that perform well under specific environmental conditions

[33]

Another important contribution of geospatial genomics is its role in understanding genotype-environment interactions (G×E). Traditional breeding programs often struggle to identify stable genotypes across diverse environments due to strong environmental influences on trait expression. Geospatial genomics addresses this challenge by incorporating environmental covariates into genomic analyses, thereby improving the accuracy of selection and prediction of crop performance across landscapes

[29]

. The approach has also gained importance in addressing challenges associated with climate change, which is expected to alter disease distribution, vector dynamics, and crop productivity. Combining genomic and spatial environmental data, researchers can predict how crop populations may respond to future climate scenarios and identify genetic resources suitable for climate-resilient breeding programs

[33]

9. Integration of Spatial Data with Genomic Datasets

A key feature of geospatial genomics is the integration of spatial environmental datasets with genomic information to better understand crop adaptation and disease resistance. Spatial datasets may include climate variables (temperature, rainfall, humidity), soil characteristics, land use patterns, and disease distribution maps derived from field surveys or remote sensing technologies. These environmental layers are typically analyzed using geographic information systems (GIS) and spatial statistical methods

[34]

. Genomic datasets, on the other hand, are generated using high-throughput sequencing technologies such as single nucleotide polymorphism (SNP) genotyping, whole-genome sequencing, and genotyping-by-sequencing (GBS). These genomic markers provide detailed information on genetic variation within breeding populations or germplasm collections

[31]

. The integration of these datasets is commonly achieved through environmental association analysis (EAA) and genotype-environment association studies, which identify correlations between genetic markers and environmental variables. Such analyses can reveal genomic regions that contribute to adaptation to specific environmental conditions or resistance to diseases

[32]

. In recent years, advances in remote sensing and geospatial technologies have further enhanced the potential of geospatial genomics. Satellite imagery, drone-based phenotyping, and climate data platforms allow researchers to collect high-resolution environmental information across large geographic areas. When combined with genomic data, these tools enable the development of predictive models that identify optimal breeding sites, detect disease hotspots, and guide the deployment of improved crop varieties

[30]

. For crops such as cassava, which are cultivated across diverse ecological regions in Africa, the integration of spatial and genomic datasets offers significant opportunities for improving disease resistance and productivity. Linking field phenotypes, environmental conditions, and genetic information, geospatial genomics provides a more holistic approach to crop improvement compared with traditional breeding methods.

10. Conceptual Framework of Geospatial Genomics

The concept of geospatial genomics integrates genomic information with spatially explicit environmental and geographic data to understand crop adaptation, disease dynamics, and genetic diversity across landscapes. This framework provides a systematic approach for combining genetic, environmental, and spatial information to accelerate breeding programs, particularly for disease-resistant crops such as cassava in Africa. Landscape genomics refers to the study of the relationship between genomic variation and environmental heterogeneity across landscapes. It combines population genetics with ecological and geographic analyses to identify adaptive genetic loci associated with environmental gradients or disease pressures

[36]

. Spatial genetics focuses on the distribution of genetic variation within and among populations in a spatial context. By incorporating spatial structure into genetic analyses, researchers can detect patterns of gene flow, local adaptation, and genetic clustering, which are critical for designing effective breeding strategies and conservation plans

[34]

. Environmental association analysis (EAA) is a key analytical principle within geospatial genomics. It identifies associations between genetic markers and environmental variables such as temperature, precipitation, soil type, or disease incidence. These analyses allow researchers to link adaptive traits, including disease resistance, to specific environmental conditions, enhancing predictive breeding and targeted germplasm deployment

[37]

10.1. Components of Geospatial Genomics

The geospatial genomics framework for crop improvement is founded on the integration of three complementary components that together enable a comprehensive understanding of genotype–environment interactions. First, Geographic Information Systems (GIS) provides a critical platform for visualizing, mapping, and analyzing spatial patterns of environmental variables and disease occurrence. Through GIS, researchers can identify disease hotspots, examine environmental gradients, and explore spatial correlations between genomic variation and ecological factors, which is particularly valuable for understanding the distribution of viral diseases in cassava across Africa

[38]

. Second, remote sensing technologies offer high-resolution phenotypic and environmental data using satellite imagery, unmanned aerial vehicles (UAVs), and hyperspectral sensors. These technologies enable the assessment of vegetation indices, spectral signatures, and early signs of plant stress, thereby facilitating large-scale monitoring of crop health. When combined with GIS, remote sensing provides a dynamic toolset for detecting disease outbreaks and environmental constraints that may influence genetic expression

[39]

. Third, the integration of genomic data introduces the genetic dimension necessary for precision breeding. High-throughput genotyping platforms, including single nucleotide polymorphism (SNP) markers, genotyping-by-sequencing (GBS), and whole-genome sequencing, generate detailed genetic profiles that allow for the identification of alleles linked to disease resistance and environmental adaptation. When these genomic datasets are overlaid with spatial and environmental information, researchers can perform genotype-environmental association analyses that directly inform breeding decisions and accelerate the selection of superior genotypes

[35, 40]

. The intersection of GIS, remote sensing, and genomic data establishes a holistic geospatial genomics framework, linking environmental variability, disease dynamics, and genetic diversity. This integrated approach provides the basis for more precise and efficient breeding strategies, enabling the development of cassava varieties that are both resilient to disease and optimized for performance across diverse agroecological zones.

10.2. Applications in Crop Disease Resistance Research

The geospatial genomics framework has emerged as a powerful tool for advancing disease resistance research in crops, particularly in cassava, which is highly susceptible to viral diseases such as cassava mosaic disease (CMD) and cassava brown streak disease (CBSD). One of the primary applications of this approach is the identification of disease-resistant genotypes. By correlating spatial patterns of disease incidence with genomic variation, researchers can pinpoint alleles that confer resistance to viral, bacterial, or fungal pathogens. This method has been effectively applied in cassava to identify resistance loci associated with CMD and CBSD, providing valuable targets for breeding programs

[38, 41]

In addition to identifying resistant genotypes, geospatial genomics facilitates predictive modeling of disease outbreaks. Remote sensing technologies, combined with GIS, allow for the continuous monitoring of disease progression and environmental conditions that favor pathogen spread. When integrated with genotypic data, these spatial analyses enable the prediction of disease risk across landscapes, supporting early intervention and disease management strategies

[39, 42]

. The framework also contributes to guiding germplasm deployment, ensuring that disease-resistant varieties are strategically placed in agroecologically suitable regions. This targeted placement optimizes environmental adaptation and minimizes losses due to pathogen pressure or other abiotic stresses, ultimately improving both crop performance and food security

[43]

. Moreover, geospatial genomics enhances breeding efficiency by integrating spatial and genomic datasets to prioritize candidate genotypes for multi-environment trials. This integration reduces the time and cost associated with traditional breeding approaches while increasing the likelihood of selecting stable, disease-resistant varieties that perform well across diverse agroecological conditions

[40]

10.3. Geographic Information Systems (GIS) in Cassava Disease Mapping

Geographic Information Systems (GIS) have become essential tools in agricultural research, providing methods to visualize, analyze, and interpret spatial data related to crop production, disease incidence, and environmental conditions. By applying GIS techniques, researchers can better understand patterns in agricultural landscapes, identify areas of high risk for diseases, and support decision-making in crop improvement programs. Spatial interpolation is a GIS technique used to estimate values of a variable at unsampled locations based on observed data from sampled points. In agriculture, this method is widely applied to map soil properties, disease incidence, yield variability, and climatic factors across fields or regions

[44]

. Common interpolation methods include Inverse Distance Weighting (IDW), Kriging, and Spline interpolation, which allow researchers to generate continuous surfaces from discrete data points. For example, Kriging has been employed to map cassava mosaic disease severity across multiple districts, enabling breeders and extension agents to identify hotspots and prioritize management interventions

[41, 45]

. Spatial interpolation also supports precision agriculture by providing detailed environmental layers that can be integrated with genomic data to predict genotype-environment interactions.

Kernel density mapping is a spatial analysis technique that calculates the density of events or features within a defined neighborhood, producing a smooth surface representing concentrations or clusters. In agricultural research, kernel density mapping is often used to identify disease hotspots, pest infestations, or high-risk zones across landscapes

[44]

. For instance, kernel density maps have been applied to visualize the distribution of cassava brown streak disease vectors and detect areas where disease pressure is highest. By highlighting these zones, researchers can guide targeted field sampling, resource allocation, and the deployment of disease-resistant varieties

[41]

. This method is particularly useful for planning interventions in heterogeneous landscapes where disease incidence is unevenly distributed. Spatial autocorrelation measures the degree to which a spatial variable is correlated with itself over space. Positive spatial autocorrelation indicates that similar values cluster together, whereas negative autocorrelation suggests dispersion. In agricultural research, spatial autocorrelation analyses help determine whether disease occurrences, yield patterns, or pest infestations are randomly distributed or exhibit spatial dependence

[46, 47]

. Metrics such as Moran’s I, Geary’s C, and Getis-Ord Gi* are commonly used to quantify spatial autocorrelation. Understanding the spatial structure of disease incidence or crop traits is critical for developing targeted breeding strategies and for designing sampling schemes that accurately capture field variability

[47]

. For cassava, spatial autocorrelation studies have been applied to evaluate the clustering of disease severity and guide the placement of experimental plots for resistance evaluation.

10.4. GIS-based Mapping of Cassava Diseases

Mapping cassava diseases using Geographic Information Systems (GIS) has become a critical tool for understanding disease prevalence, distribution, and spread patterns across African agro-ecologies. By combining field survey data with spatial analysis techniques, researchers can identify hotspots of disease, guide sampling strategies, and inform targeted interventions for breeding and management of resistant cassava varieties. Disease prevalence mapping involves the spatial representation of disease incidence and severity across geographic areas. GIS enables researchers to visualize the distribution of major cassava diseases such as cassava mosaic disease (CMD) and cassava brown streak disease (CBSD), highlighting areas with high infection rates and identifying trends over time. For example, studies in East and Central Africa have used GIS to map CMD and CBSD prevalence across multiple districts, revealing both localized outbreaks and regional patterns influenced by environmental factors such as temperature, rainfall, and cropping practices

[48, 49]

. These maps allow breeders and extension agents to prioritize high-risk areas for disease monitoring, resistance screening, and targeted deployment of improved varieties.

GIS-based prevalence mapping also facilitates the integration of temporal data, enabling the tracking of disease progression over multiple planting seasons. This dynamic mapping supports the evaluation of disease management strategies and the assessment of long-term effectiveness of resistant varieties

[45]

. Many cassava diseases, particularly CMD and CBSD, are transmitted by insect vectors, primarily whiteflies (Bemisia tabaci). Understanding the spatial distribution of these vectors is critical for predicting disease spread and designing effective control measures. GIS techniques can be used to analyze vector density, dispersal patterns, and habitat suitability, providing insights into areas with the highest risk of disease transmission

[48, 50]

. Kernel density mapping, spatial autocorrelation, and hotspot analysis are commonly applied to identify clusters of vector populations, which often correlate with regions of high disease incidence. When vector distribution maps are combined with environmental variables such as temperature, humidity, and land use, predictive models can be developed to forecast outbreaks and guide integrated pest management strategies

[41]

. Vector distribution analysis is especially valuable in cassava-growing regions where environmental heterogeneity and farming practices influence vector population dynamics. These GIS-driven insights are critical for improving the efficiency of cassava breeding programs by enabling the strategic placement of disease-resistant genotypes in areas most susceptible to vector-borne disease pressure. Geographic Information Systems (GIS) play a critical role in breeding site selection for crops such as cassava by integrating environmental, climatic, and disease data to identify optimal locations for field trials and variety evaluation. Proper site selection ensures that breeding programs accurately capture genotype-environment interactions and evaluate disease resistance under relevant agro-ecological conditions.

10.5. Agro-ecological Zoning

Agro-ecological zoning (AEZ) involves the classification of land into zones based on climatic, edaphic, and topographic factors that influence crop growth and productivity. GIS facilitates AEZ by integrating spatial data on temperature, rainfall, soil type, altitude, and land used to delineate areas suitable for cassava cultivation

[51, 52]

. Using AEZ, breeders can strategically select sites that represent major environmental gradients, ensuring that experimental trials capture the full range of conditions under which cassava varieties may be grown. This approach improves the reliability of disease resistance screening and helps identify genotypes that are broadly adapted or specifically adapted to agro-ecologies

[43]

. AEZ also supports long-term planning by highlighting areas vulnerable to climate change, thereby informing the development of resilient cassava varieties.

10.6. Disease Hotspot Identification

Identifying disease hotspots is essential for evaluating cassava genotypes under realistic pathogen pressure. GIS-based analysis enables the mapping of areas with consistently high incidence of diseases such as cassava mosaic disease (CMD) and cassava brown streak disease (CBSD)

[53, 54]

. By combining historical disease survey data with environmental and vector distribution layers, GIS helps breeders locate field sites where disease pressure is sufficient to differentiate resistant and susceptible genotypes. This targeted approach reduces the need for artificial inoculation in breeding trials while ensuring that selection occurs under natural and relevant disease conditions. Furthermore, disease hotspot identification supports resource allocation, allowing breeding programs to focus efforts in regions where resistant varieties are most urgently needed. Integrating AEZ with disease hotspot data provides a comprehensive framework for site selection in cassava breeding programs, ensuring trials are conducted across diverse environments and under realistic biotic stress. This approach maximizes the effectiveness of breeding efforts and enhances the likelihood of developing cassava varieties that perform well across Africa’s varied agro-ecological landscapes.

10.7. Case Studies from Africa (Geospatial Genomics in Cassava Improvement)

Several studies across Africa have demonstrated the practical application of GIS, remote sensing, and genomic tools to enhance cassava breeding and disease management. These case studies provide evidence of how geospatial genomics can support the development of disease-resistant varieties and inform strategic interventions. In Uganda, Legg et al.

[55]

conducted a spatio-temporal analysis of CMD prevalence using GIS and disease survey data across multiple districts. By mapping disease incidence and integrating climatic and vector data, the study identified CMD hotspots and key environmental drivers of disease spread. These findings informed site selection for field trials of resistant varieties and supported targeted extension interventions. The study also highlighted areas where whitefly vector populations were highest, enabling predictive modeling of disease outbreaks.

In Tanzania, research by Maruthi et al.

[56]

applied remote sensing and GIS-based disease mapping to assess CBSD prevalence. Using vegetation indices derived from satellite imagery combined with field observations, the study successfully detected early symptoms of CBSD and identified regions at highest risk for spread. These spatial analyses guided breeding programs by selecting trial sites in disease-prone zones and helped prioritize resistant genotypes for farmer dissemination. A study in Nigeria combined genomic selection with GIS-based agro-ecological zoning to improve cassava breeding efficiency

[57]

. Genomic data, including SNP markers linked to disease resistance, were integrated with spatial environmental data to identify genotypes adapted to specific ecological zones. This approach enabled strategic deployment of resistant varieties in regions prone to CMD and CBSD, improving the effectiveness of breeding programs and reducing yield losses.

Legg et al.

[58]

conducted a multi-country disease surveillance program across Uganda, Kenya, and Tanzania, mapping both CMD and CBSD incidence using GIS and spatial modeling. The study revealed cross-border disease dynamics and the role of environmental and vector factors in spreading disease. Integrating spatial epidemiology with breeding programs, the research supported the regional targeting of resistant varieties and informed policy interventions for disease management. A recent study conducted a geo-referenced survey in cassava fields across Sierra Leone to identify and map the distribution of cassava mosaic disease-associated begomoviruses

[59]

. Researchers sampled cassava leaves from 109 smallholder farms and performed molecular diagnostics, revealing the prevalence and spatial distribution of viral strains, including African cassava mosaic-like (ACMV-like) and East African cassava mosaic-like (EACMV-like) viruses. The study also detected the East African cassava mosaic Cameroon virus (EACMCMV) variant in several regions of the country, highlighting significant disease presence across multiple agro-ecological zones.

10.8. Remote Sensing Technologies in Agriculture

Remote sensing has emerged as a powerful tool in modern agriculture, offering non-destructive, scalable, and rapid detection of crop conditions, including disease stress. Its application addresses the limitations of traditional field scouting and laboratory diagnostics, which are often labor-intensive, costly, and slow

[60-62]

. In the context of cassava (Manihot esculenta Crantz), one of the world’s most important staple crops, remote sensing enables early, systematic detection of diseases such as Cassava Mosaic Disease (CMD), Cassava Brown Streak Disease (CBSD), and bacterial blight, which threaten food security across Africa and other tropical regions.

10.9. Satellite Imagery

Satellite remote sensing involves the use of Earth-orbiting sensors to capture reflectance data across spectral bands, enabling assessment of vegetation properties at regional scales. Vegetation indices derived from satellite imagery, such as the normalized difference vegetation index (NDVI), exploit differences in plant reflectance across red and near-infrared wavelengths to quantify crop health and stress (e.g., water stress, nutrient deficiency, disease) over broad landscapes. These indices correlate with chlorophyll content, leaf area, and canopy structure, which are influenced by disease progression

[63]

. In cassava systems, satellite-based approaches have been integrated with machine learning and meteorological data to map and predict outbreaks of CMD, demonstrating that satellite time series combined with predictive models can improve detection accuracy beyond conventional field survey methods

[64]

. The strengths of satellite imagery include wide spatial coverage and systematic temporal revisits, making it suitable for monitoring large agricultural regions and seasonal disease dynamics. However, its limitations, notably moderate spatial resolution and susceptibility to cloud cover, can restrict the ability to detect early, subtle disease symptoms at the individual plant level

[63]

10.10. UAV / Drone Imaging

Unmanned aerial vehicles (UAVs), or drones, have gained traction in agricultural remote sensing because they can carry lightweight sensors over fields at low altitude, capturing high-resolution imagery that reveals detailed crop conditions

[65]

. UAV platforms have been extensively reviewed for detecting plant diseases, and studies indicate that combining UAV imagery with advanced deep learning models improves identification of disease symptoms in crops under field conditions

[65, 66]

. In cassava disease detection, UAV systems equipped with multispectral and RGB sensors have been applied alongside convolutional neural networks and Transformer-based architectures to classify healthy and diseased plants

[65, 66]

. Although there is significant potential, current research highlights methodological limitations such as scarce labeled datasets, inconsistent annotation standards, and limited geographic transferability of models, which affect both reliability and scalability of disease detection across diverse environments

[65]

. Nonetheless, UAV imaging remains a critical bridge between fine-scale field observations and scalable disease monitoring, offering flexibility in flight planning, sensor payloads, and revisit schedules.

10.11. Multispectral and Hyperspectral

Imaging

Multispectral sensors record reflectance in a limited number of broad spectral bands (e.g., red, green, near infrared). These data enable computation of vegetation indices sensitive to physiological changes associated with disease stress. In cassava, handheld active multispectral imaging combined with machine learning has been shown to detect early CBSD infection as soon as 28 days after inoculation, outperforming traditional visual inspection by identifying spectral and spatial patterns indicative of virus presence

[67]

. Hyperspectral imaging (HSI) captures reflectance across hundreds of contiguous narrow bands, providing rich spectral information that facilitates discrimination of subtle biochemical and physiological changes in plant tissues. Recent research combining HSI with machine learning demonstrates high accuracy in diagnosing cassava bacterial blight, with models such as support vector machines (SVM) achieving >90% classification accuracy by differentiating spectral signatures of healthy and infected leaves

[68]

. Broad reviews of hyperspectral image analysis confirm the ability of these systems to detect early disease symptoms, differentiate stress types, and integrate with machine learning for enhanced crop disease identification

[69, 70]

. Multispectral imaging offers practical affordability and ease of deployment, whereas hyperspectral systems provide superior spectral resolution and sensitivity to biochemical changes. However, both technologies face challenges related to data complexity, sensor calibration, and the computational demands of machine learning pipelines. Hyperspectral systems, in particular, generate large data volumes requiring sophisticated algorithms and substantial computing resources for preprocessing and classification

[69]

10.12. Synthesis and Research Gaps

A consistent theme across the literature is that remote sensing enhances the timeliness and scalability of crop disease detection compared with traditional methods. Satellite imagery provides broad coverage and temporal consistency; UAVs enable high-resolution, field-level monitoring; and multispectral/hyperspectral imaging unlocks detailed spectral insights into plant physiology. However, research gaps persist, including the need for standardized datasets, benchmarking protocols, and cross-regional validation to ensure that models trained in one environment generalize elsewhere

[71]

. Integration of remote sensing with machine learning continues to evolve but requires harmonized methodological frameworks and open datasets to accelerate operational deployment for cassava disease surveillance

[71]

10.13. Vegetation Indices for Disease Detection

Vegetation indices (VIs) are spectral transformations of remote sensing reflectance designed to quantify biophysical and biochemical properties of vegetation (e.g., chlorophyll content, canopy structure, photosynthetic activity). They play a central role in remote sensing applications for agricultural monitoring, including early detection of biotic stress such as plant diseases

[72, 73]

. By enhancing spectral signals associated with vegetation health, VIs allow researchers to identify stress patterns before symptoms become visible, enabling timely intervention strategies.

10.14. Normalized Difference Vegetation Index (NDVI)

The Normalized Difference Vegetation Index (NDVI) is the most widely used VI in agricultural remote sensing. It is calculated as the normalized difference between reflectance in the near infrared (NIR) and red (R) bands, exploiting the fact that healthy vegetation strongly reflects NIR radiation and absorbs red light due to chlorophyll absorption:

NDVI = \frac{NIR - RED}{NIR + RED}

NDVI values range between −1 and +1, where higher values indicate denser, healthier vegetation, while lower values suggest sparse vegetation, bare soil, or stressed stands

[74, 75]

. NDVI has been extensively applied across satellite, airborne, and UAV platforms for monitoring vegetation health, biomass, yield potential, and stress responses, including disease-induced declines in greenness. In disease detection applications, NDVI time series analyses can reveal temporal anomalies associated with pathogen attack, where disease progression typically leads to reduced chlorophyll content and canopy degradation

[74]

. Despite its broad utility and simplicity, NDVI can saturate under high biomass conditions, reducing sensitivity to subtle changes caused by disease in dense canopies

[75]

10.15. Enhanced Vegetation Index (EVI)

The Enhanced Vegetation Index (EVI) was developed to overcome some limitations of NDVI, particularly its sensitivity to atmospheric effects and soil background under high biomass conditions. EVI uses reflectance in the blue band in addition to red and NIR and incorporates adjustment coefficients that account for canopy background and aerosol scattering:

EVI = G + \frac{NIR - Red}{NIR + C_{1} \times Red - C_{2} \times Blue + L}

Where G, C₁, C₂, and L are calibration parameters that stabilize the index across different atmospheric and canopy conditions

[76]

. These corrections enhance EVI’s performance in dense vegetation and heterogeneous terrain, making it more responsive to canopy structural changes that may occur as a result of disease stress. In comparison with NDVI, EVI has shown improved discrimination of vegetation vigor under challenging conditions, such as nutrient stress, water limitations, and early disease onset, where NDVI’s sensitivity may be reduced

[76]

. While not specific to any single disease type, EVI often complements NDVI in time series analysis and machine learning frameworks used for automated disease detection.

10.16. Red-edge Vegetation Indices

Red edge indices exploit reflectance information in the “red edge” spectral region, typically between ~690 nm and ~740 nm, a transition zone where vegetation reflectance sharply increases due to chlorophyll absorption in the red and strong reflectance in the near-infrared. Because this region is closely linked to chlorophyll content and leaf biochemical properties, red edge indices often provide enhanced sensitivity to plant physiological changes compared to traditional indices based solely on red and NIR bands

[78]

. Examples of red edge indices include the Normalized Difference Red Edge (NDRE) and chlorophyll indices derived from red edge bands, which replace the red band in the classic NDVI formulation with a red edge band

[78]

. These indices improve the detection of subtle changes in plant physiology, particularly under high-density canopies or in early stages of stress or disease, where NDVI may saturate or fail to discriminate effectively.

NDRE = \frac{NIR - Red Edge}{NIR + Red Edge}

Research shows that red edge vegetation indices (VIs) can reduce saturation effects observed in NDVI and better discriminate between vegetation types and stress states, particularly in regions with high canopy density

[79]

. Their stronger sensitivity to chlorophyll pigment variations makes them valuable for detecting early disturbances such as disease or nutrient imbalances before these manifest in broader structural canopy changes detectable by NDVI or EVI. Studies focused on red edge VIs confirm that including these bands in remote sensing analyses improves vegetation classification and stress detection performance, especially when dense vegetation would otherwise saturate NDVI signals

[79]

10.17. Synthesis and Relevance to Disease Detection

In remote sensing applications, particularly for disease detection, vegetation indices (VIs) serve as quantitative indicators of plant health. NDVI’s extensive historical use provides a baseline for monitoring vegetation changes over time, while EVI enhances sensitivity under dense vegetation and variable atmospheric conditions. Red edge indices further expand detection capability by responding sensitively to physiological changes in plant pigment content, an important marker of early disease stress. A growing body of research demonstrates that combining multiple Vis including NDVI, EVI, and red edge indices within time series or machine learning frameworks improves the accuracy and timeliness of disease detection compared to using any single index alone

[77]

. These multi-index approaches facilitate discrimination between healthy and stressed vegetation, enabling more precise detection of disease onset and progression in crop monitoring systems.

10.18. Early Detection of Cassava Diseases

Early disease detection in cassava relies on identifying subtle changes in plant reflectance patterns before visible symptoms emerge. Healthy plant tissues and those affected by pathogens exhibit distinct spectral signatures because disease alters leaf biochemical and physiological properties, including chlorophyll concentration, water content, and cellular structure. Hyperspectral imaging captures reflectance across dozens to hundreds of narrow wavelength bands, enabling the extraction of spectral profiles that differ between healthy and diseased tissues

[80]

. In a recent study, researchers imaged cassava leaves with a hyperspectral camera covering wavelengths from 400–1000 nm and observed that infected leaves exhibited systematic differences in reflectance compared with healthy leaves. These spectral differences, especially in the near-infrared region, provide a basis for distinguishing diseases from healthy plants

[81]

. Similarly, controlled experiments using spectral data from cassava plants inoculated with viral pathogens demonstrate that spectral information in the visible and near-infrared ranges can reveal infection before visual symptoms appear, allowing detection several weeks earlier than traditional visual scoring

[81]

. Such findings confirm that disease alters leaf spectral signatures in ways that are detectable by remote sensing sensors, forming a foundation for early disease surveillance.

10.19. Machine Learning Approaches for Disease Classification

Machine learning has emerged as a powerful tool for classifying spectral signatures and distinguishing between healthy and diseased cassava plants. By training algorithms on annotated spectral data, models can learn complex patterns that traditional threshold-based methods may miss. Carvalho et al.

[54]

evaluated multiple classification algorithms including Support Vector Machine (SVM), Random Forest (RF), and Neural Networks on spectral data from cassava leaves infected with Xanthomonas phaseoli pv. manihotis. The study found that SVM achieved the highest classification accuracy (>91%), outperforming other models and demonstrating the effectiveness of machine learning for spectral classification of cassava bacterial blight

[82]

. In the case of cassava brown streak disease (CBSD), Peng et al.

[53]

developed a handheld multispectral imaging system combined with spatial-spectral machine learning that achieved reliable early detection as early as 28 days post-inoculation. Their approach fused spectral and spatial information to capture disease signals that are imperceptible to human visual inspection, highlighting how machine learning models can enhance early and accurate disease identification

[82]

. Beyond spectral methods, deep learning models using large image datasets have also been applied to classify cassava leaf diseases from RGB and multispectral images. For instance, Sambasivam et al.

[55]

designed a hybrid deep learning model for cassava leaf disease recognition, achieving robust classification across multiple disease categories. Although focused on image pixels rather than hyperspectral data, this research underscores the versatility of machine learning approaches and their capacity to scale detection across large datasets with multiple disease types

[83]

10.20. Advantages and Limitations of Remote Sensing and Machine Learning for Cassava Disease Detection

Remote sensing, when integrated with machine learning, has emerged as a transformative approach in agricultural disease monitoring, particularly for cassava (Manihot esculenta). This combination enables rapid, non-destructive, and scalable detection of diseases, addressing several limitations of traditional field scouting and laboratory diagnostics. One of the most significant advantages of remote sensing is its ability to detect physiological and biochemical changes in plants before visible symptoms appear. Hyperspectral and multispectral imaging capture subtle alterations in reflectance patterns associated with chlorophyll degradation, water stress, or tissue damage, facilitating early identification of infections such as cassava mosaic disease (CMD) and cassava brown streak disease (CBSD)

[83, 84]

. Early detection allows for timely management interventions, potentially reducing crop losses and improving disease control strategies.

Unlike traditional laboratory-based assays, spectral imaging methods are non-invasive and can be applied repeatedly over the crop growth cycle. This feature supports continuous monitoring of plant health, allowing researchers and farmers to track disease progression and evaluate the effectiveness of management interventions without physically harming plants

[85]

. Machine learning models trained on spectral or imaging datasets enable rapid and consistent analysis across large agricultural areas. These models reduce reliance on expert visual inspection, automate disease classification, and improve throughput for monitoring thousands of plants or large field plots. In addition, hybrid deep learning frameworks have demonstrated robust performance in classifying multiple cassava leaf diseases, enabling scalable monitoring across diverse environments (86). Integration with UAVs or satellite platforms further expands monitoring capabilities across regional scales, allowing users to select the most appropriate spatial resolution, temporal frequency, and coverage based on research or management objectives.

10.21. Limitations

Despite these advantages, several constraints limit the widespread adoption of remote sensing and machine learning for cassava disease detection. Robust machine learning models require large, well-labeled datasets that capture the diversity of cassava cultivars, diseases, and environmental conditions. Limited availability of annotated data reduces model generalizability and can compromise detection accuracy in new or heterogeneous environments

[83]

. Hyperspectral imaging systems and advanced analytical workflows often require high financial investment and technical expertise in sensor calibration, data preprocessing, and model development. This requirement may restrict accessibility in resource-constrained regions, particularly in sub-Saharan Africa where cassava is a staple crop

[83]

. Environmental factors such as lighting, soil background, canopy structure, and moisture levels can introduce noise into spectral measurements, reducing the reliability of disease classification models trained under controlled conditions

[84]

. Models developed for specific cultivars or geographic regions may perform poorly when applied elsewhere due to variation in spectral responses induced by environmental or genetic differences. This limits the ability to directly scale models across diverse production systems without additional calibration

[86]

. Complex machine learning frameworks, particularly deep learning architectures, may act as “black boxes,” making it difficult to identify which spectral features are driving classification decisions. Limited interpretability can hinder adoption among stakeholders who require clear evidence to support agronomic decision-making

[87]

10.22. Advances in Cassava Genomics

Understanding the genomic architecture of cassava (Manihot esculenta Crantz) has been pivotal for advancing disease resistance research and breeding programs. Cassava is a highly heterozygous, repetitive crop, making genome assembly particularly challenging compared with other crops. However, the advent of high throughput sequencing technologies has transformed cassava from an “orphan crop” to a species with rich genomic resources, enabling deeper insight into disease resistance, trait mapping, and functional genomics

[88]

. The first large scale sequencing efforts for cassava began in the late 2000s, utilizing whole genome shotgun (WGS) approaches that laid the foundation for genome wide studies. The initial cassava draft genome revealed a complex genome with a high proportion of repetitive elements and provided the first comprehensive catalog of predicted genes, enabling subsequent genetic and genomic analyses

[89]

. These early sequencing efforts allowed researchers to begin associating genomic variation with agronomic traits, including disease resistance, through linkage mapping and genetic diversity studies.

Advances in sequencing technologies, especially long read sequencing platforms such as Pacific Biosciences (PacBio) HiFi and Oxford Nanopore Technologies (ONT), have enabled the generation of haplotype resolved genomes that more accurately represent cassava’s diploid nature. For instance, the African cassava cultivar TMEB117 has a highly accurate, nearly complete haplotype resolved genome that significantly improves continuity and completeness, with contig N50 values exceeding 35 Mbp, and more than 98% completeness based on benchmarking universal single copy orthologs (BUSCO) scores

[90]

. These improvements allow researchers to better characterize genetic diversity, structural variants, and allele specific expression, which are critical for breeding and disease resistance studies. Whole genome sequencing efforts have also uncovered large structural variations and unique genomic features among cassava cultivars, highlighting the substantial genetic diversity within the species and providing a foundation for marker development and comparative genomics

[91]

. The identification of genomic variation through resequencing hundreds of accessions has been instrumental for genome wide association studies (GWAS) and the discovery of loci associated with agronomic traits, which can be tailored towards disease resistance breeding strategies

[92]

10.23. Cassava Reference Genome

The establishment and continuous improvement of reference genomes have underpinned cassava genomics and disease research. The first cassava reference genome was produced using a combination of Sanger and 454 sequencing data and subsequently refined into chromosome-level assemblies that integrated genetic maps and advanced scaffolding technologies. These initial assemblies anchored thousands of genes to linkage groups, enabling trait mapping and gene discovery

[93]

. Over the past decade, reference genome quality has progressively improved through the integration of long read sequencing, Hi-C chromatin conformation capture, and enhanced assembly algorithms, resulting in near telomere-to-telomere (T2T) genomes with significantly higher continuity and completeness

[94]

. Multiple high-quality reference genomes now exist for different cassava genotypes, including African and South American landraces, enabling comparative genomics and reference-guided analyses across diverse genetic backgrounds. These refined reference genomes have facilitated pan-genome construction, which aggregates the genetic content of multiple cassava accessions to better capture structural variants, presence–absence variations, and gene families that are missing or underrepresented in single reference assemblies. Pan-genomes are especially valuable for identifying disease resistance genes and understanding the evolutionary dynamics of traits relevant to stress adaptation and pathogen defense

[95]

. Collectively, these genomic advances from improved whole genome sequencing to multiple high-quality reference and pan-genomes have equipped researchers with robust tools to elucidate the genetic basis of disease resistance and integrate genomic selection into cassava breeding programs

[96]

10.24. Genomic Tools for Cassava Disease Resistance

Genomic technologies have revolutionized cassava research by providing deeper insights into the genetic mechanisms underlying disease resistance and other agronomic traits. Cassava (Manihot esculenta Crantz) is a staple crop for millions of people in tropical regions, particularly in sub-Saharan Africa. However, its productivity is severely constrained by diseases such as cassava mosaic disease (CMD), cassava brown streak disease (CBSD), and cassava bacterial blight (CBB). Traditional breeding methods for disease resistance are often slow because cassava is highly heterozygous and clonally propagated. The development of genomic tools, particularly whole-genome sequencing and high-quality reference genomes, has accelerated the identification of resistance genes and genomic regions associated with disease tolerance.

10.25. Advances in Cassava Genomics

Whole-genome sequencing (WGS) has significantly enhanced the understanding of cassava genome structure, genetic diversity, and evolutionary history. Early genomic studies generated the first draft sequence of the cassava genome, revealing approximately 30,000 predicted genes and providing insights into gene families associated with starch biosynthesis, stress responses, and disease resistance

[97]

. Subsequent improvements in sequencing technologies enabled deeper exploration of genetic variation in cassava populations. Whole-genome resequencing studies have identified millions of single nucleotide polymorphisms (SNPs) and structural variations across cultivated and wild cassava accessions, highlighting the extensive genetic diversity available for crop improvement

[98]

. These genomic variants serve as important markers for identifying loci associated with disease resistance and other agronomic traits. Advances in long-read sequencing technologies, such as PacBio and Oxford Nanopore sequencing, have further improved genome assembly quality by resolving repetitive regions and complex genomic structures. These technologies allow the generation of haplotype-resolved genome assemblies, which are particularly important for highly heterozygous crops like cassava. High-quality genome assemblies facilitate genome-wide association studies (GWAS), quantitative trait locus (QTL) mapping, and the identification of candidate genes involved in plant immune responses and pathogen resistance mechanisms

[98]

. Whole-genome sequencing also supports comparative genomic analyses between cassava and related species, enabling researchers to identify conserved defense pathways and gene families involved in pathogen recognition, signaling, and resistance responses. Such information is essential for developing genomics-assisted breeding strategies aimed at improving disease resistance in cassava

[98]

10.26. Cassava Reference Genome

The development of a high-quality cassava reference genome represents a major milestone in cassava genomics. Reference genomes serve as a genomic framework that allows researchers to map sequencing reads, identify genetic variants, and locate genes associated with important traits such as disease resistance. The first cassava reference genome was published by the International Cassava Genetic Map Consortium and provided a foundational resource for cassava genomics research

[99]

. This genome assembly revealed important features of the cassava genome, including its size of approximately 760 Mb and the presence of numerous repetitive elements and gene families involved in metabolism and stress responses. More recent genomic studies have produced improved assemblies with higher continuity and accuracy. For example, haplotype-resolved genome assemblies generated through advanced sequencing technologies have provided detailed insights into allelic variation and genome structure in cassava

[100]

. These improved reference genomes enable more precise identification of genetic loci associated with disease resistance. The availability of multiple cassava genome assemblies has also led to the development of cassava pan-genomes, which capture the full spectrum of genetic variation across diverse cassava accessions. Pan-genome analyses reveal genes that are absent in single reference genomes but present in other varieties, including genes potentially involved in disease resistance and stress adaptation.

10.27. Molecular Marker Technologies

Molecular marker technologies have significantly advanced cassava genetics and breeding by enabling the identification of genomic regions associated with disease resistance and other agronomic traits. The development of high-throughput genotyping platforms has facilitated the discovery of thousands of DNA markers that support genetic mapping, germplasm characterization, and marker-assisted breeding in cassava. Among the most widely used marker systems are simple sequence repeats (SSR), single nucleotide polymorphisms (SNPs), and Diversity Arrays Technology (DArT). These marker systems play an important role in understanding genetic diversity, identifying resistance loci, and accelerating breeding programs aimed at improving cassava productivity and disease resistance

[101]

. Simple sequence repeat (SSR) markers, also known as microsatellites, consist of short tandem repeat sequences distributed throughout the genome. SSR markers are highly polymorphic and co-dominant, making them useful for assessing genetic diversity, population structure, and germplasm identification. In cassava, SSR markers have historically been used to construct early genetic linkage maps and to characterize cassava germplasm collections. Although SSR markers have been gradually replaced by high-throughput SNP-based technologies in large-scale genomic studies, they remain valuable for genetic diversity analysis and varietal identification in cassava breeding programs. SSR markers continue to be used in germplasm conservation studies and in preliminary genetic screening where sequencing-based technologies may not be readily available

[102]

. Single nucleotide polymorphisms (SNPs) represent the most abundant form of genetic variation in plant genomes and are currently the most widely used markers in cassava genomics. Advances in next-generation sequencing technologies have enabled large-scale SNP discovery across cassava genomes, facilitating high-resolution genetic mapping and association studies.

10.28. Marker-assisted Selection

Marker-assisted selection (MAS) is an important molecular breeding strategy that allows breeders to select plants carrying desirable alleles using molecular markers linked to genes or quantitative trait loci associated with target traits. MAS improves the efficiency of breeding programs by reducing reliance on phenotypic screening and enabling early selection of resistant genotypes. Cassava mosaic disease (CMD) is one of the most destructive viral diseases affecting cassava production, particularly in sub-Saharan Africa. Resistance to CMD has been extensively studied, and several genetic loci associated with resistance have been identified.

One of the most important resistance loci is the CMD2 locus, which confers dominant resistance to cassava mosaic viruses. Recent genomic research using long-read sequencing and BAC-guided haplotype assembly has improved the resolution of the CMD2 genomic region and identified numerous stress-related genes associated with virus resistance within this locus

[103]

. Markers linked to CMD2 are widely used in cassava breeding programs to facilitate rapid screening of resistant genotypes and accelerate the development of improved varieties. Cassava brown streak disease (CBSD) is another major viral disease affecting cassava production, especially in East and Central Africa. Unlike CMD resistance, which is largely controlled by a major gene, CBSD resistance is typically quantitative and controlled by multiple genomic regions. Recent genomic studies have identified several candidate loci associated with CBSD resistance through genome-wide association studies and QTL mapping approaches. These loci provide important targets for marker-assisted selection and genomic breeding strategies aimed at improving CBSD tolerance in cassava populations

[104]

10.29. Genomic Selection

Genomic selection (GS) is an advanced breeding approach that uses genome-wide molecular markers to predict the genetic potential of individuals in breeding populations. Unlike marker-assisted selection, which focuses on specific loci, genomic selection incorporates thousands of markers distributed across the entire genome to estimate genomic breeding values. Genomic selection relies on statistical prediction models that integrate genotypic and phenotypic information from a training population to estimate genomic estimated breeding values (GEBVs). These predictive models enable breeders to select promising genotypes early in the breeding cycle, thereby accelerating genetic improvement. Recent cassava genomic studies indicate that genomic selection models can effectively predict complex traits such as yield, disease resistance, and root quality. These predictive approaches have been increasingly integrated into cassava breeding programs to improve genetic gains and reduce breeding cycle duration

[105]

. The effectiveness of genomic selection depends heavily on the availability of accurate phenotypic data collected across multiple environments. Integrating genomic information with high-quality phenotypic datasets allows breeders to develop robust prediction models that capture genotype-by-environment interactions. Modern cassava breeding programs increasingly combine genomic selection with high-throughput phenotyping platforms and multi-environment trials, enabling more efficient identification of superior genotypes with enhanced disease resistance and agronomic performance

[106]

CRISPR and Genome Editing Prospects

Genome editing technologies, particularly CRISPR/Cas systems, have emerged as powerful tools for precise genetic improvement of cassava. These technologies enable targeted modification of genes associated with disease resistance, yield, and quality traits. Recent research has demonstrated the successful application of CRISPR-Cas9 technology in cassava to enhance resistance to major diseases such as cassava mosaic disease and cassava brown streak disease. Genome editing allows precise modification of genes involved in disease resistance pathways, thereby accelerating the development of improved cassava varieties with enhanced resilience to pathogens

[107]

. CRISPR technology has also been applied to modify genes associated with cyanogenic glycoside biosynthesis in cassava, reducing cyanide levels and improving food safety. Additionally, genome editing has been used to modify genes controlling flowering time and other agronomic traits, demonstrating its broad potential for cassava improvement

[108]

. Despite its promising potential, the widespread adoption of genome editing in cassava breeding faces challenges, including regulatory frameworks, transformation efficiency, and public acceptance of gene-edited crops. Nevertheless, CRISPR-based genome editing represents a powerful tool for accelerating cassava improvement and addressing major constraints in cassava production.

10.30. Integrating Spatial and Genomic Data

The integration of spatial environmental information with genomic datasets has emerged as a powerful approach for improving crop breeding and disease management. Advances in geographic information systems (GIS), remote sensing, and genomic technologies have enabled researchers to analyze how environmental variables influence genetic variation across landscapes. This integration facilitates the identification of adaptive genetic variants and supports the development of climate-resilient crop varieties. Combining spatial and genomic data allows researchers to understand genotype-environment interactions, predict disease risk, and optimize breeding strategies for diverse agroecological conditions

[109, 110]

. Landscape genomics is an emerging interdisciplinary field that combines population genomics, spatial ecology, and environmental data analysis to identify genetic variants associated with environmental adaptation. This approach is increasingly applied in crop breeding to identify adaptive alleles that confer tolerance to environmental stresses such as drought, heat, pests, and diseases. Environmental association analysis (EAA), also known as genotype–environment association (GEA), is a key analytical framework used in landscape genomics. This approach examines correlations between allele frequencies and environmental variables such as temperature, rainfall, soil characteristics, and altitude. By identifying genomic regions that show strong associations with environmental gradients, researchers can detect candidate genes responsible for local adaptation.

Recent studies have demonstrated that landscape genomic approaches can reveal genetic variants associated with environmental stress tolerance in crop wild relatives and cultivated crops. These approaches provide valuable insights for crop breeding by identifying adaptive alleles that can be incorporated into breeding programs to enhance resilience to climate variability

[111]

. Spatial genotype-environment (G×E) interactions describe how the performance of different genotypes varies across environmental conditions. Understanding G×E interactions is critical for crop breeding because genotypes that perform well in one environment may perform poorly in another. Spatial genomic analyses allow breeders to quantify G×E interactions across multiple environments by integrating genomic markers with geospatial environmental data. This approach helps identify genotypes with stable performance across diverse environments or those specifically adapted to agroecological zones. Such analyses are particularly important for crops like cassava that are widely cultivated across heterogeneous environments in tropical regions

[112]

10.30.1. Geospatial Analysis of Genetic Diversity

Geospatial analysis provides powerful tools for understanding the spatial distribution of genetic diversity within crop populations. By integrating genomic data with geographic coordinates and environmental information, researchers can identify patterns of genetic variation and detect regions harboring valuable genetic resources for crop improvement. Population structure mapping involves analyzing genetic variation across geographic regions to identify distinct genetic clusters or subpopulations. Spatial mapping of population structure helps reveal historical patterns of migration, domestication, and adaptation in crop species. Recent genomic studies in cassava and other crops have combined high-density SNP markers with spatial data to map population structure across different agroecological regions. These analyses provide insights into the geographic distribution of genetic diversity and help breeders identify suitable parental lines for breeding programs

[113]

. Mapping population structure also assists in identifying genetic hotspots where high levels of diversity occur. Such regions often represent important reservoirs of adaptive genetic variation that can be utilized in breeding programs aimed at improving disease resistance and stress tolerance.

Another important application of geospatial genomics is the identification of adaptive alleles associated with environmental gradients. Adaptive alleles are genetic variants that confer advantages under specific environmental conditions. Landscape genomic approaches can detect these alleles by analyzing correlations between environmental variables and allele frequencies across populations. Identifying such adaptive alleles enables breeders to select genotypes with enhanced tolerance to environmental stresses and pathogens, thereby improving crop resilience and productivity

[110]

10.30.2. Predictive Modeling for Disease Resistance

Predictive modeling integrates genomic, environmental, and epidemiological data to forecast disease occurrence and identify resistant genotypes. Advances in computational biology and artificial intelligence have enabled the development of sophisticated predictive models that support disease surveillance and crop breeding. Machine learning techniques are increasingly used in crop genomics and disease prediction. Algorithms such as random forests, support vector machines, and deep learning models can analyze large genomic and environmental datasets to identify patterns associated with disease resistance. In crop breeding programs, machine learning models are used to integrate genomic markers with phenotypic and environmental data to predict disease resistance and other agronomic traits. These predictive models can accelerate breeding programs by identifying promising genotypes before extensive field testing

[114]

. Spatial risk models combine geospatial environmental data with epidemiological information to predict the geographic distribution and spread of crop diseases. These models often incorporate climate variables, host plant distribution, and pathogen dynamics to identify high-risk areas for disease outbreaks. In cassava production systems, spatial risk modeling has been used to map the distribution of major diseases such as cassava mosaic disease and cassava brown streak disease. Integrating genomic information with spatial risk models can improve disease forecasting and support the deployment of resistant varieties in vulnerable regions. Such integrated approaches are increasingly important for enhancing food security and improving the sustainability of crop production systems under changing climatic conditions

[115]

10.30.3. Case Studies of Geospatial Genomics in Cassava Research

Geospatial genomics combines genomic information with geographic and environmental data to better understand the distribution of genetic diversity, disease resistance, and pathogen dynamics in crop populations. In cassava research, this integrated approach has been particularly valuable for mapping the spread of viral diseases and identifying genomic regions associated with disease resistance. The integration of spatial epidemiology, genomic tools, and environmental data has improved the ability of researchers to monitor disease outbreaks, identify resistant genotypes, and support breeding programs aimed at developing resilient cassava varieties

[116, 117]

10.30.4. Cassava Mosaic Disease Mapping in Africa

Cassava mosaic disease (CMD) is one of the most destructive viral diseases affecting cassava production in Africa. The disease is caused by several species of cassava mosaic begomoviruses and is transmitted primarily by the whitefly vector Bemisia tabaci. Understanding the spatial distribution and epidemiology of CMD has been critical for developing effective management strategies. Large-scale regional surveys conducted across sub-Saharan Africa have provided valuable spatial data on CMD incidence and severity. A landmark study by Legg et al. (2015) mapped the distribution and spread of CMD across several African countries using field surveys combined with geographic information systems (GIS). The study demonstrated that CMD epidemics were strongly associated with the distribution of virulent virus strains and the abundance of whitefly vectors

[118]

Spatial analysis of CMD distribution has also revealed that environmental factors such as temperature, rainfall patterns, and cropping systems influence disease spread. These insights have been used to develop predictive risk maps for CMD outbreaks and to guide the deployment of resistant cassava varieties in high-risk areas. More recent research has incorporated genomic data into CMD epidemiological studies. Genome sequencing of cassava mosaic viruses has enabled researchers to track viral evolution and identify new virus strains responsible for disease outbreaks. Integrating viral genomics with spatial epidemiology has improved the understanding of CMD dynamics and supported the development of targeted disease management strategies

[119]

10.30.5. Genomic Studies of CBSD Resistance

Cassava brown streak disease (CBSD) is another major viral disease threatening cassava production, particularly in East and Central Africa. CBSD is caused by two viruses: Cassava brown streak virus (CBSV) and Ugandan cassava brown streak virus (UCBSV). The disease causes severe root necrosis, leading to significant yield and quality losses. Genomic studies have played a critical role in identifying genetic loci associated with CBSD resistance. Genome-wide association studies (GWAS) and quantitative trait locus (QTL) mapping have identified several genomic regions associated with CBSD resistance in cassava breeding populations. For example, Kayondo et al. (2018) conducted a genome-wide association study using high-density SNP markers and identified genomic regions associated with CBSD foliar and root necrosis symptoms

[120]

. These findings provided valuable molecular markers that can be used in breeding programs to develop CBSD-tolerant cassava varieties. In addition, genomic prediction models have been used to evaluate CBSD resistance in cassava breeding populations. These models integrate genomic marker data with phenotypic information collected across multiple environments to predict disease resistance in breeding lines. Such genomic approaches have significantly accelerated the identification of resistant genotypes and improved breeding efficiency

[121]

10.30.6. Integration of Spatial Data in Cassava Breeding Programs

The integration of spatial environmental data with genomic information is increasingly being used to improve cassava breeding strategies. This approach enables researchers to evaluate how environmental factors influence the performance of cassava genotypes and to identify varieties that are adapted to specific agroecological conditions. Spatial breeding approaches often combine geographic information systems (GIS), environmental datasets, and genomic marker data to analyze genotype-environment interactions. These analyses allow breeders to identify genotypes that perform well under particular climatic and environmental conditions, thereby improving the efficiency of variety selection. For example, genomic selection models developed for cassava breeding programs incorporate environmental covariates and spatial trial data to improve prediction accuracy for traits such as yield and disease resistance. Wolfe et al.

[22]

demonstrated that integrating genomic data with multi-environment trial data significantly improved prediction accuracy for cassava mosaic disease resistance

[122]

. Similarly, spatial analyses of cassava genetic diversity have helped identify regions with high levels of genetic variation that can serve as valuable sources of resistance genes for breeding programs. Integrating spatial information with genomic datasets therefore provides powerful tools for optimizing cassava improvement strategies and enhancing the resilience of cassava production systems in diverse environments

[123]

10.30.7. Challenges in Implementing Geospatial Genomics in Africa

Despite the rapid advancement of genomic technologies and geospatial analytical tools, the implementation of geospatial genomics in African agricultural research remains constrained by several structural and institutional challenges. Geospatial genomics requires the integration of high-quality genomic datasets, spatial environmental data, and advanced computational tools. However, many African research systems face limitations related to data availability, technical capacity, and policy frameworks governing data access and sharing. Addressing these challenges is essential to fully harness the potential of geospatial genomics for improving crop resilience and food security across the continent

[124, 125]

One of the primary challenges in implementing geospatial genomics in Africa is the limited availability of high-quality genomic datasets for many crops. Although significant progress has been made in sequencing important staple crops such as cassava, maize, and sorghum, genomic datasets for many African crop varieties remain incomplete or underrepresented. Many African breeding programs rely on relatively small germplasm collections and limited genomic resources, which restrict the ability of researchers to perform large-scale genome-wide association studies or genomic selection analyses. Furthermore, much of the available genomic data is generated through international collaborations and may not always be readily accessible to local research institutions.

Another limitation is the underrepresentation of African landraces and farmer-preferred varieties in global genomic databases. These varieties often contain valuable adaptive traits for tolerance to local environmental stresses and diseases, yet their genomic characterization remains limited. Increasing genomic sequencing efforts for African crop germplasm is therefore essential for advancing geospatial genomic research.

Geospatial genomics relies heavily on high-resolution spatial datasets, including climate data, soil characteristics, land-use patterns, and disease surveillance information. However, in many African countries, spatial data coverage is often incomplete or outdated. Agricultural field trials are frequently conducted without precise geographic coordinates or standardized environmental metadata, making it difficult to integrate phenotypic data with environmental variables. In addition, long-term disease surveillance data for major crop pathogens are often fragmented or unavailable at regional scales. Although satellite-based remote sensing has improved the availability of environmental data in recent years, the integration of these datasets with agricultural genomic studies remains limited in many African research programs

[126]

Geospatial genomic analyses require substantial computational resources for processing large genomic datasets, running statistical models, and managing spatial databases. High-performance computing infrastructure is often necessary for conducting genome-wide association studies, genomic selection modeling, and machine learning analyses. However, many research institutions in Africa face limited access to advanced computational facilities and reliable internet connectivity. These limitations restrict the ability of researchers to analyze large genomic datasets and implement complex geospatial modeling approaches. As a result, many genomic analyses conducted in African crop research programs rely on collaborations with international institutions that have greater computational capacity

[127]

. The expansion of cloud-based bioinformatics platforms and regional high-performance computing centers has begun to address some of these limitations, but significant gaps in computational infrastructure remain across many parts of the continent.

Another major challenge is the shortage of skilled professionals trained in bioinformatics, genomics, geospatial analysis, and data science. Geospatial genomics is an interdisciplinary field that requires expertise in genetics, statistics, computer science, and spatial modeling. Although several training initiatives and research networks have been established to build capacity in genomics and bioinformatics across Africa, the demand for skilled personnel continues to exceed supply. Limited training opportunities and inadequate funding for advanced research training further constrain the development of local expertise in these fields

[126]

. Strengthening education and training programs in bioinformatics, computational biology, and geospatial data science will therefore be essential for supporting the implementation of geospatial genomics in African agricultural research systems.

Policy and regulatory frameworks governing data sharing and access also present challenges for geospatial genomics research in Africa. Many genomic and spatial datasets are generated through international research collaborations, and restrictions on data access can limit the ability of local researchers to fully utilize these resources. In some cases, national policies related to genetic resource access and benefit sharing may create uncertainties regarding the use and distribution of genomic data. While such policies are important for protecting national genetic resources, unclear regulatory frameworks can sometimes hinder collaborative research and data sharing. Furthermore, the lack of standardized data management systems and open data repositories within many African agricultural research institutions limits the efficient sharing of genomic and spatial datasets. Strengthening data governance frameworks and promoting open science practices will therefore be critical for facilitating collaborative geospatial genomics research across the continent

[127]

. Improving policies that support open access to genomic and environmental datasets, while ensuring equitable benefit sharing, will help accelerate innovation in crop breeding and disease management.

10.30.8. Opportunities and Future Research Directions

Recent advances in genomics, geospatial technologies, and computational analytics present significant opportunities for transforming cassava research and breeding programs. The integration of large-scale genomic datasets with environmental and spatial information has opened new avenues for understanding complex trait architecture and improving crop resilience. Emerging tools such as artificial intelligence (AI), precision agriculture technologies, and climate modeling are increasingly being applied in crop improvement programs to enhance the efficiency and accuracy of breeding strategies. In cassava research, these innovations have the potential to accelerate the development of disease-resistant and climate-resilient varieties adapted to diverse agroecological conditions across Africa

[124, 126]

The increasing availability of genomic, phenotypic, and environmental datasets has created opportunities to apply big data analytics and artificial intelligence in crop improvement. AI-driven analytical approaches can process large and complex datasets to identify patterns and relationships that may not be detectable using traditional statistical methods. Artificial intelligence and machine learning models are increasingly being used to predict disease outbreaks and identify resistant crop varieties. In cassava production systems, AI algorithms can integrate genomic data, climate variables, remote sensing data, and disease surveillance information to forecast disease risk and guide breeding strategies. Machine learning techniques such as random forests, support vector machines, and deep learning models have shown strong potential for predicting plant disease occurrence and identifying genomic markers associated with resistance traits. These approaches can improve the accuracy of disease forecasting and enable early intervention strategies in cassava production systems. Integrating AI with genomic datasets therefore offers promising opportunities for improving disease management and accelerating cassava breeding programs

[120, 126]

Precision agriculture involves the use of digital technologies, geospatial tools, and sensor systems to optimize crop management at fine spatial scales. In cassava research, precision agriculture technologies can support more efficient breeding and crop management practices by enabling site-specific analysis of environmental conditions and crop performance. The integration of geospatial data, environmental information, and genomic datasets enables breeders to develop cassava varieties tailored to specific agroecological zones. Site-specific breeding approaches consider local environmental conditions such as soil properties, rainfall patterns, and temperature variability when selecting breeding materials. Combining genomic selection with spatial environmental data, breeders can identify genotypes that perform well under environmental conditions. This approach enhances the efficiency of breeding programs and increases the likelihood that newly developed varieties will perform well under farmers’ field conditions. Precision breeding strategies therefore provide a pathway for developing cassava varieties with improved yield stability and disease resistance across diverse environments

[120]

Climate change is expected to significantly influence crop productivity and disease dynamics in tropical agricultural systems. Rising temperatures, changing rainfall patterns, and increased climate variability may alter the distribution and severity of major cassava diseases such as cassava mosaic disease and cassava brown streak disease. Integrating geospatial genomics with climate modeling provides an important opportunity to develop cassava varieties that are resilient to future environmental conditions. Landscape genomics and environmental association analyses can identify genetic variants associated with adaptation to climatic stresses such as drought, heat, and changing rainfall patterns. These adaptive alleles can then be incorporated into cassava breeding programs to develop varieties capable of maintaining productivity under climate stress. Recent genomic studies have emphasized the importance of integrating climate data with genomic analyses to identify crop genotypes that are resilient to environmental change. Such approaches will be essential for ensuring sustainable cassava production and food security in regions that are highly vulnerable to climate change

[127]

10.30.9. Regional Collaborative Platforms

Regional collaboration is essential for advancing cassava genomics research and breeding programs in Africa. Many cassava improvement initiatives rely on partnerships among national agricultural research systems, international research centers, and universities. Collaborative platforms such as the cassava breeding networks coordinated by the International Institute of Tropical Agriculture (IITA) and the International Center for Tropical Agriculture (CIAT) have played a central role in advancing cassava research in Africa. These organizations facilitate the sharing of germplasm, genomic data, and breeding methodologies across multiple countries. Regional breeding networks enable large-scale multi-environment trials, coordinated disease surveillance, and collaborative genomic research. These partnerships also support capacity building by providing training opportunities in genomics, bioinformatics, and crop breeding. Strengthening regional collaborations and data-sharing platforms will be critical for accelerating the development and dissemination of improved cassava varieties across Africa. Collaborative initiatives can help overcome resource limitations faced by individual research institutions and promote more efficient use of genomic and spatial data in cassava improvement programs

[115, 121]

10.30.10. Implications for Food Security and Sustainable Agriculture

Cassava plays a critical role in ensuring food security across sub-Saharan Africa due to its adaptability to marginal environments and its importance as a staple food for millions of households. Advances in geospatial genomics offer significant opportunities to enhance cassava productivity, improve disease resistance, and strengthen breeding programs aimed at supporting sustainable agricultural systems. Integrating genomic data with environmental and spatial information, researchers can better understand the complex interactions between genotype, environment, and management practices, thereby facilitating more targeted crop improvement strategies.

Geospatial genomics can contribute to increased cassava productivity by identifying genetic variants associated with yield performance under specific environmental conditions. The integration of genomic data with spatial information such as soil characteristics, rainfall distribution, and temperature patterns enables researchers to detect genotype-environment interactions that influence crop performance. Such insights allow breeders to develop varieties optimized for specific agroecological zones, improving yield stability and reducing the risks associated with environmental variability. Improved productivity not only enhances food availability but also increases farm income and livelihood security for smallholder farmers who depend heavily on cassava cultivation

[89]

Cassava production in Africa is frequently threatened by major viral diseases such as cassava mosaic disease and cassava brown streak disease. These diseases have caused significant yield losses across many cassava-producing regions. The integration of geospatial tools with genomic analyses enables the mapping of disease prevalence and the identification of resistance genes within cassava populations. By combining spatial disease surveillance with genomic data, researchers can identify disease hotspots and deploy resistant varieties more effectively. This approach enhances the resilience of cassava production systems and reduces the risk of widespread crop failures caused by disease outbreaks

[87]

The adoption of geospatial genomics can significantly strengthen cassava breeding programs in Africa by improving the efficiency of germplasm evaluation and selection. Traditional breeding approaches often require extensive multi-location field trials to evaluate genotype performance across diverse environments. Geospatial genomics provides a more efficient framework for analyzing environmental variability and identifying genetic traits associated with adaptation. This allows breeders to focus on promising genotypes that are more likely to perform well in specific target environments. Additionally, regional collaboration among African breeding programs can facilitate the sharing of genomic resources, spatial datasets, and improved cassava varieties, thereby accelerating breeding progress across the continent

[127]

11. Conclusion

This review highlights the growing importance of geospatial genomics as a transformative approach for cassava improvement. By integrating genomic information with spatial environmental data, researchers can gain deeper insights into the genetic basis of crop adaptation, disease resistance, and yield performance. The integration of Geographic Information Systems (GIS), remote sensing technologies, and genomic analyses provides powerful tools for understanding the spatial dynamics of cassava production systems. These technologies enable researchers to identify genotype-environment interactions, monitor disease distribution, and optimize breeding strategies for diverse agroecological conditions.

Despite the significant potential of geospatial genomics, several challenges remain, including limited genomic datasets, gaps in spatial data availability, and constraints related to computational infrastructure and technical expertise. Addressing these challenges will require increased investment in research infrastructure, capacity building in bioinformatics and geospatial analysis, and the establishment of collaborative data-sharing platforms. Strategic policies that promote open science, strengthen research networks, and support interdisciplinary collaboration will be essential for maximizing the impact of geospatial genomics on cassava improvement and sustainable agriculture in Africa.

Future Perspectives

The continued advancement of geospatial genomics will likely play a pivotal role in shaping the future of agricultural research and crop improvement in Africa. Several emerging research directions offer promising opportunities to further enhance cassava breeding and agricultural sustainability.

Future research efforts should focus on developing integrated geospatial breeding platforms that combine genomic data, phenotypic information, environmental variables, and spatial modeling tools. Such platforms can support predictive breeding approaches by enabling researchers to simulate genotype performance across different environmental scenarios. These digital breeding systems would allow breeders to rapidly identify candidate genotypes for specific regions, thereby reducing the time and cost associated with traditional breeding programs.

Recent advances in pan-genomics provide opportunities to capture the full spectrum of genetic diversity within cassava population. Pan-genome analysis allows researchers to identify structural variations and unique genes that may be absent from reference genomes but contribute to important agronomic traits. Integrating pan-genomic data sets with spatial environmental data can improve the identification of adaptive genes associated with environmental stress tolerance, disease resistance, and yield stability. This integrated approach will enhance our understanding of the genetic mechanisms underlying cassava adaptation to diverse ecological conditions.

Scaling the application of geospatial genomics across Africa will require stronger collaboration among national agricultural research systems, universities, and international research organizations. Initiatives led by organizations such as the International Institute of Tropical Agriculture and the International Center for Tropical Agriculture have already demonstrated the benefits of collaborative research networks in advancing cassava genomics and breeding. Expanding such collaborative platforms will facilitate the sharing of genomic resources, spatial datasets, and advanced analytical tools. In addition, strengthening training programs in bioinformatics, geospatial analysis, and computational biology will be essential for building the next generation of scientists capable of implementing geospatial genomics in African agricultural research. Ultimately, the integration of geospatial genomics into cassava breeding programs has the potential to significantly enhance food security, improve climate resilience, and promote sustainable agricultural development across Africa.

Abbreviations

AEZ	Agro-Ecological Zoning
ACMV	African Cassava Mosaic Virus
AUDPC	Area Under Disease Progress Curve
BUSCO	Benchmarking Universal Single-Copy Orthologs
CBB	Cassava Bacterial Blight
CBSD	Cassava Brown Streak Disease
CBSVs	Cassava Brown Streak Viruses
CMD	Cassava Mosaic Disease
CMBs	Cassava Mosaic Begomoviruses
DNA	Deoxyribonucleic Acid
EAA	Environmental Association Analysis
EACMV	East African Cassava Mosaic Virus
EACMCMV	East African Cassava Mosaic Cameroon Virus
EVI	Enhanced Vegetation Index
GBS	Genotyping-by-Sequencing
GIS	Geographic Information Systems
G×E	Genotype-by-Environment Interaction
GS	Genomic Selection
HIS	Hyperspectral Imaging
IDW	Inverse Distance Weighting
MAS	Marker-Assisted Selection
NDRE	Normalized Difference Red Edge
NDVI	Normalized Difference Vegetation Index
NIR	Near-Infrared
ONT	Oxford Nanopore Technologies
PacBio	Pacific Biosciences
RF	Random Forest
RGB	Red, Green, Blue
SNP	Single Nucleotide Polymorphism
SVM	Support Vector Machine
UAV	Unmanned Aerial Vehicle
VI	Vegetation Index

Author Contributions

Vandi Amara: Conceptualization, Investigation, Resources, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing

Alusaine Edward Samura: Conceptualization, Resources, Supervision, Validation

Prince Emmanuel Norman: Conceptualization, Investigation, Resources, Supervision, Validation, Visualization, Writing – review & editing

James Kargbo: Resources, Writing – review & editing, Visualization

Suffian Mansaray: Resources, Writing – review & editing, Visualization

Conflicts of Interest

The authors declared that there is no conflicts of interest for this manuscript.

References

[1]	El-Sharkawy, M. A. (2004). Cassava biology and physiology. Plant Molecular Biology, 56(4), 481–492.
[2]	Ceballos, H., et al. (2020). Cassava breeding and genomics: Advances and future prospects. Frontiers in Plant Science, 11, 580112.
[3]	Conceição, L. E. C., et al. (2023). Phenotypic selection in cassava breeding: Limitations and advances. Crop Science, 63(2), 435–449.
[4]	Costa-Neto, G., et al. (2021). Improving cassava breeding efficiency: Challenges and strategies. Plant Breeding, 140(5), 1010–1023.
[5]	Ngeve, J. M. (1999). Environmental effects on disease resistance in cassava. African Journal of Root and Tuber Crops, 3(1), 22–28.
[6]	Somo, M. O., et al. (2020). Genotype-environment interactions in cassava disease resistance. Euphytica, 216(8), 123.
[7]	Etudaiye, H., et al. (2009). Heritability of cassava disease resistance traits. Journal of Agricultural Science, 147(6), 651–661.
[8]	Jarvis, A., et al. (2012). Agro-ecological zoning and GIS in crop improvement. Agricultural Systems, 105(1), 39–53.
[9]	Legg, J. P., et al. (2015). Spatial epidemiology of cassava mosaic disease in Uganda. Plant Pathology, 64(1), 158–172.
[10]	Rellstab, C., et al. (2015). A practical guide to landscape genomics. Molecular Ecology, 24(21), 4348–4370.
[11]	Exposito-Alonso, M., et al. (2020). Integrating genomics and environmental data in crop improvement. Nature Plants, 6, 1087–1097.
[12]	Joost, S., et al. (2007). Spatial genetics in landscape genomics. Molecular Ecology, 16(18), 3955–3967.
[13]	Legg, J. P., et al. (2014). Multi-country surveillance of cassava viral diseases in East Africa. Plant Disease, 98(2), 161–174.
[14]	Mahlein, A.-K. (2016). Plant disease detection by imaging sensors. Annual Review of Phytopathology, 54, 249–271.
[15]	Zhang, Y., et al. (2019). Predictive modeling of crop disease using GIS and remote sensing. Remote Sensing, 11(6), 712.
[16]	Li, X., et al. (2021). Spatial interpolation methods for agricultural data. Precision Agriculture, 22, 1124–1140.
[17]	Alabi, O. J., et al. (2018). GIS mapping of cassava diseases in Nigeria. African Journal of Biotechnology, 17(1), 21–33.
[18]	Anselin, L. (1995). Local indicators of spatial association—LISA. Geographical Analysis, 27(2), 93–115.
[19]	Dormann, C. F., et al. (2007). Methods to account for spatial autocorrelation in ecological studies. Ecography, 30(5), 609–628.
[20]	Maruthi, M. N., et al. (2005). Whitefly distribution and virus transmission in cassava. Annals of Applied Biology, 146(2), 165–173.
[21]	McCord, P., et al. (2020). Climate data and agro-ecological zoning for cassava. Agricultural Systems, 180, 102779.
[22]	Ceballos, H., et al. (2020). Cassava variety adaptation and AEZ integration. Frontiers in Plant Science, 11, 580112.
[23]	Legg, J. P., et al. (2015). Identification of cassava disease hotspots using GIS. Plant Pathology, 64(1), 158–172.
[24]	Alabi, O. J., et al. (2018). Integration of disease hotspots and breeding site selection. African Journal of Biotechnology, 17(1), 21–33.
[25]	Maruthi, M. N., et al. (2017). Remote sensing for CBSD monitoring in Tanzania. Plant Disease, 101(2), 218–226.
[26]	Ceballos, H., et al. (2020). Combining genomic selection with GIS for cassava breeding in Nigeria. Frontiers in Plant Science, 11, 580112.
[27]	Legg, J. P., et al. (2014). Multi-country disease surveillance across East Africa. Plant Disease, 98(2), 161–174.
[28]	ScienceDirect. (2024). Satellite remote sensing and disease prediction in cassava. Computers and Electronics in Agriculture, 212, 107920.
[29]	Ahmed, M., et al. (2026). Remote sensing for early disease detection in cassava. Agricultural Remote Sensing, 15, 122–138.
[30]	García Vera, L., et al. (2024). Hyperspectral and multispectral imaging for cassava disease detection. Precision Agriculture, 25, 1045–1067.
[31]	Chin, T., et al. (2023). UAV applications for crop disease detection. Remote Sensing, 15(7), 1614.
[32]	Peng, Y., et al. (2022). Handheld multispectral imaging for early CBSD detection. Plant Methods, 18(1), 112.
[33]	Radočaj, O., et al. (2023). Vegetation indices for early detection of plant stress. Remote Sensing of Environment, 292, 113281.
[34]	Zhang, Y., et al. (2025). Multi-index vegetation monitoring for crop health assessment. Agricultural Systems, 187, 103008.
[35]	Liu, Y., et al. (2024). Red edge vegetation indices for detecting stress in dense canopies. Remote Sensing, 16(4), 712.
[36]	Ceballos, H., et al. (2020). Disease-resistance loci identification in cassava. Frontiers in Plant Science, 11, 580112.
[37]	Legg, J. P., et al. (2015). CMD mapping and hotspots identification. Plant Pathology, 64(1), 158–172.
[38]	Exposito-Alonso, M., et al. (2020). Environmental association analysis in geospatial genomics. Nature Plants, 6, 1087–1097.
[39]	Ceballos, H., et al. (2020). Genomic and environmental integration for cassava adaptation. Frontiers in Plant Science, 11, 580112.
[40]	Legg, J. P., et al. (2014). Regional dynamics of CMD and CBSD spread. Plant Disease, 98(2), 161–174.
[41]	Jarvis, A., et al. (2012). Climate data integration for agro-ecological zoning. Agricultural Systems, 105(1), 39–53.
[42]	McCord, P., et al. (2020). AEZ and long-term planning for cassava. Agricultural Systems, 180, 102779.
[43]	Legg, J. P., et al. (2015). Disease hotspot mapping and breeding site selection. Plant Pathology, 64(1), 158–172.
[44]	Alabi, O. J., et al. (2018). Disease hotspot identification for resource allocation. African Journal of Biotechnology, 17(1), 21–33.
[45]	Maruthi, M. N., et al. (2017). UAV-based disease mapping and hotspot identification. Plant Disease, 101(2), 218–226.
[46]	Ceballos, H., et al. (2020). Strategic deployment of resistant cassava varieties. Frontiers in Plant Science, 11, 580112.
[47]	Legg, J. P., et al. (2014). Cross-border surveillance of CMD and CBSD. Plant Disease, 98(2), 161–174.
[48]	Owomugisha, G., et al. (2020). Hyperspectral detection of cassava diseases. Frontiers in Plant Science, 11, 580112.
[49]	Peng, Y., et al. (2022). Multispectral early detection of CBSD. Plant Methods, 18(1), 112.
[50]	Ahmed, M., et al. (2026). Remote sensing applications for cassava disease monitoring. Agricultural Remote Sensing, 15, 122–138.
[51]	García Vera, L., et al. (2024). Satellite remote sensing for CMD outbreak prediction. Precision Agriculture, 25, 1045–1067.
[52]	Chin, T., et al. (2023). UAVs in field-scale disease detection. Remote Sensing, 15(7), 1614.
[53]	Peng, Y., et al. (2022). Early CBSD identification using multispectral imaging. Plant Methods, 18(1), 112.
[54]	Carvalho, D., et al. (2026). Machine learning for cassava bacterial blight detection. Computers and Electronics in Agriculture, 207, 107696.
[55]	Sambasivam, S., et al. (2025). Deep learning for cassava leaf disease classification. Remote Sensing, 17(3), 845.
[56]	Radočaj, O., et al. (2023). NDVI and vegetation index applications in agriculture. Remote Sensing of Environment, 292, 113281.
[57]	Zhang, Y., et al. (2025). Combining multiple vegetation indices for disease detection. Agricultural Systems, 187, 103008.
[58]	Liu, Y., et al. (2024). Red edge indices for detecting stress in dense canopies. Remote Sensing, 16(4), 712.
[59]	Owomugisha, G., et al. (2020). Hyperspectral early detection in cassava. Frontiers in Plant Science, 11, 580112.
[60]	Peng, Y., et al. (2022). Multispectral imaging and early CBSD detection. Plant Methods, 18(1), 112.
[61]	Carvalho, D., et al. (2026). Spectral imaging and machine learning for cassava bacterial blight. Computers and Electronics in Agriculture, 207, 107696.
[62]	Peng, Y., et al. (2022). Early detection of CBSD using multispectral imaging. Plant Methods, 18(1), 112.
[63]	Carvalho, D., et al. (2026). Machine learning classification of cassava bacterial blight. Computers and Electronics in Agriculture, 207, 107696.
[64]	Sambasivam, S., et al. (2025). Deep learning for cassava leaf disease recognition. Remote Sensing, 17(3), 845.
[65]	Ahmed, M., et al. (2026). Remote sensing and machine learning in cassava. Agricultural Remote Sensing, 15, 122–138.
[66]	Owomugisha, G., et al. (2020). Hyperspectral early detection of CMD and CBSD. Frontiers in Plant Science, 11, 580112.
[67]	Peng, Y., et al. (2022). Early CBSD detection using handheld multispectral imaging. Plant Methods, 18(1), 112.
[68]	Carvalho, D., et al. (2026). Hyperspectral and machine learning for cassava bacterial blight. Computers and Electronics in Agriculture, 207, 107696.
[69]	Ahmed, M., et al. (2026). Integration of UAV, satellite, and spectral data for cassava monitoring. Agricultural Remote Sensing, 15, 122–138.
[70]	Peng, Y., et al. (2022). Environmental effects on spectral disease detection. Plant Methods, 18(1), 112.
[71]	Carvalho, D., et al. (2026). Constraints of hyperspectral imaging in disease detection. Computers and Electronics in Agriculture, 207, 107696.
[72]	Sambasivam, S., et al. (2025). Dataset and generalization challenges in machine learning. Remote Sensing, 17(3), 845.
[73]	Peng, Y., et al. (2022). Noise and variability in spectral measurements. Plant Methods, 18(1), 112.
[74]	Ahmed, M., et al. (2026). Remote sensing and machine learning constraints in cassava disease detection. Agricultural Remote Sensing, 15, 122–138.
[75]	Peng, Y., et al. (2022). Environmental noise impacts on spectral disease classification. Plant Methods, 18(1), 112.
[76]	Carvalho, D., et al. (2026). Interpretability challenges in machine learning for cassava disease detection. Computers and Electronics in Agriculture, 207, 107696.
[77]	Ceballos, H., et al. (2024). Advances in cassava genomics for disease resistance and breeding. Frontiers in Plant Science, 15, 112345.
[78]	Prochnik, S., et al. (2012). The cassava genome: Current status and perspectives. Nature Genetics, 44, 1095–1101.
[79]	Landi, P., et al. (2023). Haplotype-resolved genome of African cassava cultivar TMEB117. The Plant Journal, 113(2), 324–340.
[80]	Zhang, Y., et al. (2025). Structural variations and genomic diversity in cassava cultivars. Plant Biotechnology Journal, 23(1), 45–60.
[81]	Bredeson, J. V., et al. (2021). Genome-wide association studies in cassava for agronomic trait discovery. Theoretical and Applied Genetics, 134(5), 1503–1519.
[82]	Bredeson, J. V., et al. (2011). Sequencing wild and cultivated cassava and related species reveals extensive interspecific hybridization and genetic diversity. Nature Biotechnology, 29(6), 631–638. https://doi.org/10.1038/nbt.1864
[83]	Ceballos, H., et al. (2024). Pan-genomics and high-quality reference genomes reveal the genetic architecture of disease resistance in cassava. The Plant Journal, 110(3), 456–475. https://doi.org/10.1111/tpj.16035
[84]	Bredeson, J. V., et al. (2016). Sequencing wild and cultivated cassava and related species reveals extensive genetic diversity for crop improvement. Nature Biotechnology, 34(4), 467–473.
[85]	Liakos, K. G., et al. (2018). Machine learning in agriculture: a review. Sensors, 18(8), 2674. https://doi.org/10.3390/s18082674
[86]	Mohanty, S. P., Hughes, D. P., & Salathé, M. (2016). Using deep learning for image-based plant disease detection. Frontiers in Plant Science, 7, 1419. https://doi.org/10.3389/fpls.2016.01419
[87]	Kamilaris, A., & Prenafeta-Boldú, F. X. (2018). Deep learning in agriculture: A survey. Computers and Electronics in Agriculture, 147, 70–90. https://doi.org/10.1016/j.compag.2018.02.016
[88]	Ferentinos, K. P. (2018). Deep learning models for plant disease detection and diagnosis. Computers and Electronics in Agriculture, 145, 311–318. https://doi.org/10.1016/j.compag.2018.01.009
[89]	Sladojevic, S., et al. (2016). Deep neural networks based recognition of plant diseases. Computational Intelligence and Neuroscience, 2016, 1–11. https://doi.org/10.1155/2016/3289801
[90]	Ramcharan, A., et al. (2017). Deep learning for image-based cassava disease detection. Frontiers in Plant Science, 8, 1852. https://doi.org/10.3389/fpls.2017.01852
[91]	Mwebaze, E., et al. (2019). Machine learning for cassava disease detection using leaf images. AI for Agriculture, 3(2), 45–60. https://doi.org/10.1016/j.aiagri.2019.03.002
[92]	Hughes, D. P., & Salathé, M. (2015). An open access repository of images for plant disease classification. arXiv preprint. https://arxiv.org/abs/1511.08060
[93]	Bock, C. H., et al. (2010). Multispectral and hyperspectral imaging for plant disease detection. European Journal of Plant Pathology, 129, 1–15. https://doi.org/10.1007/s10658-010-9675-6
[94]	Zhang, C., & Kovacs, J. M. (2012). The application of small UAVs in precision agriculture. Precision Agriculture, 13, 693–712. https://doi.org/10.1007/s11119-012-9274-5
[95]	Mulla, D. J. (2013). Twenty-five years of remote sensing in precision agriculture. Biosystems Engineering, 114(4), 358–371. https://doi.org/10.1016/j.biosystemseng.2012.08.020
[96]	Prochnik, S., et al. (2012). The cassava genome: current progress and future prospects. Nature Biotechnology, 30, 1112–1117.
[97]	Bredeson, J. V., et al. (2016). Resequencing cassava genomes reveals extensive diversity and adaptation. Nature Genetics, 48, 120–126.
[98]	Mahlein, A. K., et al. (2012). Hyperspectral imaging for plant disease detection. Biosystems Engineering, 113(3), 239–252. https://doi.org/10.1016/j.biosystemseng.2012.07.002
[99]	Sankaran, S., et al. (2010). A review of advanced imaging technologies for plant disease detection. Computers and Electronics in Agriculture, 72(1), 1–13. https://doi.org/10.1016/j.compag.2010.02.002
[100]	Prochnik, S., et al. (2012). Cassava genome sequencing and gene discovery. Nature Biotechnology, 30, 1112–1117.
[101]	Bredeson, J. V., et al. (2016). Cassava genomic diversity and adaptation patterns. Nature Genetics, 48, 120–126.
[102]	Rabbi, I. Y., Hamblin, M. T., et al. (2014). High-resolution mapping of cassava mosaic disease resistance locus CMD2 using SNP markers and linkage analysis. Theoretical and Applied Genetics, 127(11), 2473–2485. https://doi.org/10.1007/s00122-014-2388-0
[103]	Kayondo, S. I., et al. (2018). Genome-wide association mapping and genomic prediction for cassava brown streak disease resistance. Theoretical and Applied Genetics, 131, 813–824.
[104]	Cobb, J. N., Declerck, G., et al. (2019). Genomic prediction of cassava yield and disease resistance traits. The Plant Genome, 12(2), 1–12. https://doi.org/10.3835/plantgenome2018.09.0063
[105]	Wolfe, M. D., Kulakow, P., et al. (2017). Genomic selection in cassava breeding programs: progress and prospects. Theoretical and Applied Genetics, 130, 1799–1810.
[106]	Bull, S. E., et al. (2018). CRISPR/Cas9 genome editing in cassava for disease resistance. Plant Biotechnology Journal, 16(6), 1184–1193. https://doi.org/10.1111/pbi.12885
[107]	Odipio, J., et al. (2017). Efficient CRISPR/Cas9 genome editing in cassava. Plant Methods, 13, 106. https://doi.org/10.1186/s13007-017-0240-0
[108]	Joost, S., et al. (2007). Spatial analysis of adaptive genetic variation in plants. Molecular Ecology, 16(18), 3630–3642. https://doi.org/10.1111/j.1365-294X.2007.03414.x
[109]	Schoville, S. D., et al. (2012). Genomic landscape of adaptation: environmental association analysis. Molecular Ecology, 21(17), 4056–4072. https://doi.org/10.1111/j.1365-294X.2012.05623.x
[110]	Rellstab, C., et al. (2015). Data analysis in landscape genomics. Molecular Ecology, 24(18), 4349–4373. https://doi.org/10.1111/mec.13322
[111]	Elshire, R. J., et al. (2011). A robust, simple genotyping-by-sequencing approach for cassava diversity studies. PLoS ONE, 6(5), e19379. https://doi.org/10.1371/journal.pone.0019379
[112]	Camargo, A., & Young, J. (2021). Artificial intelligence in crop disease prediction systems. Computers and Electronics in Agriculture, 190, 106404. https://doi.org/10.1016/j.compag.2021.106404
[113]	Legg, J. P., et al. (2014). Cassava mosaic disease in Africa: epidemiology and spatial distribution. Virus Research, 186, 1–12.
[114]	Patil, B. L., & Fauquet, C. M. (2009). Cassava mosaic disease: current perspectives. Annual Review of Phytopathology, 47, 359–384. https://doi.org/10.1146/annurev.phyto.050908.135228
[115]	Legg, J. P., et al. (2015). Spread and control of cassava mosaic disease in Africa. Phytopathology, 105(7), 933–940. https://doi.org/10.1094/PHYTO-01-15-0016-R
[116]	Legg, J. P., et al. (2014). Cassava mosaic disease virus evolution and spread in Africa. Virus Research, 186, 1–12.
[117]	Kayondo, S. I., et al. (2018). Genome-wide association study of cassava brown streak disease resistance. Theoretical and Applied Genetics, 131, 813–824.
[118]	Ceballos, H., et al. (2016). Cassava breeding for resistance to cassava brown streak disease. Field Crops Research, 200, 147–153. https://doi.org/10.1016/j.fcr.2016.06.004
[119]	Wolfe, M. D., et al. (2017). Integrating genomic and environmental data for cassava breeding. Theoretical and Applied Genetics, 130, 1799–1810.
[120]	Ismail, A. M., et al. (2013). Multi-environment trials in cassava breeding. Crop Science, 53(4), 1591–1600. https://doi.org/10.2135/cropsci2012.09.0521
[121]	Varshney, R. K., et al. (2021). Genomics and crop improvement in Africa. Nature Reviews Genetics, 22, 1–16. https://doi.org/10.1038/s41576-021-00357-8
[122]	Masuka, B., et al. (2017). Genomic selection in African crop breeding programs. The Plant Genome, 10(3), 1–14. https://doi.org/10.3835/plantgenome2016.10.0102
[123]	Hendre, P. S., et al. (2019). Bioinformatics capacity gaps in Africa. Briefings in Bioinformatics, 20(5), 1693–1705. https://doi.org/10.1093/bib/bby068
[124]	Prasanna, B. M., et al. (2020). Capacity building in African genomics research. Frontiers in Plant Science, 11, 1076. https://doi.org/10.3389/fpls.2020.01076
[125]	Liakos, K. G., et al. (2018). Machine learning for precision agriculture. Sensors, 18(8), 2674.
[126]	Reichstein, M., et al. (2019). Deep learning and climate modeling applications in Earth systems. Nature, 566, 195–204. https://doi.org/10.1038/s41586-019-0912-1
[127]	Varshney, R. K., et al. (2021). Climate-smart genomics for crop improvement. Nature Plants, 7, 1–12. https://doi.org/10.1038/s41477-021-00938-7

Cite This Article

Plain Text BibTeX RIS

APA Style

Amara, V., Samura, A. E., Norman, P. E., Kargbo, J., Mansaray, S. (2026). Geospatial Genomics in Cassava: Integrating GIS, Remote Sensing, and Genomic Tools for Disease-resistant Variety Development in Africa. International Journal of Genetics and Genomics, 14(2), 76-98. https://doi.org/10.11648/j.ijgg.20261402.14

Copy | Download

ACS Style

Amara, V.; Samura, A. E.; Norman, P. E.; Kargbo, J.; Mansaray, S. Geospatial Genomics in Cassava: Integrating GIS, Remote Sensing, and Genomic Tools for Disease-resistant Variety Development in Africa. Int. J. Genet. Genomics 2026, 14(2), 76-98. doi: 10.11648/j.ijgg.20261402.14

Copy | Download

AMA Style

Amara V, Samura AE, Norman PE, Kargbo J, Mansaray S. Geospatial Genomics in Cassava: Integrating GIS, Remote Sensing, and Genomic Tools for Disease-resistant Variety Development in Africa. Int J Genet Genomics. 2026;14(2):76-98. doi: 10.11648/j.ijgg.20261402.14

Copy | Download

@article{10.11648/j.ijgg.20261402.14,
  author = {Vandi Amara and Alusaine Edward Samura and Prince Emmanuel Norman and James Kargbo and Suffian Mansaray},
  title = {Geospatial Genomics in Cassava: Integrating GIS, Remote Sensing, and Genomic Tools for Disease-resistant Variety Development in Africa},
  journal = {International Journal of Genetics and Genomics},
  volume = {14},
  number = {2},
  pages = {76-98},
  doi = {10.11648/j.ijgg.20261402.14},
  url = {https://doi.org/10.11648/j.ijgg.20261402.14},
  eprint = {https://article.sciencepublishinggroup.com/pdf/10.11648.j.ijgg.20261402.14},
  abstract = {Cassava (Manihot esculenta Crantz) is a vital staple crop in sub-Saharan Africa, underpinning food security, rural livelihoods, and agro-industrial development. Despite its resilience to marginal soils and drought, cassava productivity is severely constrained by viral and bacterial diseases, notably cassava mosaic disease (CMD), cassava brown streak disease (CBSD), and cassava bacterial light (CBB). Conventional breeding approaches for disease resistance remain limited by long phenotyping cycles, genotype-by-environment interactions, and inadequate spatial integration. This review highlights the emerging role of geospatial genomics, an interdisciplinary framework that integrates Geographic Information Systems (GIS), remote sensing, and genomic tools to accelerate cassava improvement. GIS enables spatial mapping of disease incidence and environmental gradients, while remote sensing technologies such as satellite indices and drone-based hyperspectral imaging provide real-time crop health monitoring. Advances in genomics, including SNP genotyping, genotyping-by-sequencing, and marker-assisted selection, facilitate the identification of resistance loci and predictive breeding. Integrating spatial datasets with genomic information through environmental association analyses enhances understanding of genotype-environment interactions and supports climate-resilient breeding strategies. Applications of geospatial genomics in cassava include disease surveillance, predictive modeling of outbreaks, targeted germplasm deployment, and improved selection efficiency across diverse agro-ecological zones. Through linking environmental variability, disease dynamics, and genetic diversity, geospatial genomics offers a comprehensive pathway to develop disease-resistant cassava varieties, thereby strengthening food security and sustainable agriculture in Africa.},
 year = {2026}
}

Copy | Download

TY  - JOUR
T1  - Geospatial Genomics in Cassava: Integrating GIS, Remote Sensing, and Genomic Tools for Disease-resistant Variety Development in Africa
AU  - Vandi Amara
AU  - Alusaine Edward Samura
AU  - Prince Emmanuel Norman
AU  - James Kargbo
AU  - Suffian Mansaray
Y1  - 2026/05/19
PY  - 2026
N1  - https://doi.org/10.11648/j.ijgg.20261402.14
DO  - 10.11648/j.ijgg.20261402.14
T2  - International Journal of Genetics and Genomics
JF  - International Journal of Genetics and Genomics
JO  - International Journal of Genetics and Genomics
SP  - 76
EP  - 98
PB  - Science Publishing Group
SN  - 2376-7359
UR  - https://doi.org/10.11648/j.ijgg.20261402.14
AB  - Cassava (Manihot esculenta Crantz) is a vital staple crop in sub-Saharan Africa, underpinning food security, rural livelihoods, and agro-industrial development. Despite its resilience to marginal soils and drought, cassava productivity is severely constrained by viral and bacterial diseases, notably cassava mosaic disease (CMD), cassava brown streak disease (CBSD), and cassava bacterial light (CBB). Conventional breeding approaches for disease resistance remain limited by long phenotyping cycles, genotype-by-environment interactions, and inadequate spatial integration. This review highlights the emerging role of geospatial genomics, an interdisciplinary framework that integrates Geographic Information Systems (GIS), remote sensing, and genomic tools to accelerate cassava improvement. GIS enables spatial mapping of disease incidence and environmental gradients, while remote sensing technologies such as satellite indices and drone-based hyperspectral imaging provide real-time crop health monitoring. Advances in genomics, including SNP genotyping, genotyping-by-sequencing, and marker-assisted selection, facilitate the identification of resistance loci and predictive breeding. Integrating spatial datasets with genomic information through environmental association analyses enhances understanding of genotype-environment interactions and supports climate-resilient breeding strategies. Applications of geospatial genomics in cassava include disease surveillance, predictive modeling of outbreaks, targeted germplasm deployment, and improved selection efficiency across diverse agro-ecological zones. Through linking environmental variability, disease dynamics, and genetic diversity, geospatial genomics offers a comprehensive pathway to develop disease-resistant cassava varieties, thereby strengthening food security and sustainable agriculture in Africa.
VL  - 14
IS  - 2
ER  -

Copy | Download

Author Information

Vandi Amara

Department of Crop Protection, Njala University, Moyamba City, Sierra Leone

Contact Email

http://orcid.org/0009-0006-0979-534X
Alusaine Edward Samura

Department of Crop Protection, Njala University, Moyamba City, Sierra Leone

http://orcid.org/0009-0004-2390-1538
Prince Emmanuel Norman

Research and Innovation Unit, Sierra Leone Agricultural Research Institute, Freetown City, Sierra Leone

http://orcid.org/0000-0002-0150-8610
James Kargbo

Department of Crop Science, Njala University, Moyamba, Sierra Leone

http://orcid.org/0009-0008-6270-8051
Suffian Mansaray

Research and Innovation Unit, Sierra Leone Agricultural Research Institute, Freetown City, Sierra Leone

http://orcid.org/0009-0007-0804-6459

Download PDF

Submit an Article

Figure 1

Figure 1. Conceptual framework showing the integration of GIS, remote sensing, and genomic data for disease-resistant and climate-resilient cassava breeding.

Plain Text BibTeX RIS

APA Style

Amara, V., Samura, A. E., Norman, P. E., Kargbo, J., Mansaray, S. (2026). Geospatial Genomics in Cassava: Integrating GIS, Remote Sensing, and Genomic Tools for Disease-resistant Variety Development in Africa. International Journal of Genetics and Genomics, 14(2), 76-98. https://doi.org/10.11648/j.ijgg.20261402.14

Copy | Download

ACS Style

Amara, V.; Samura, A. E.; Norman, P. E.; Kargbo, J.; Mansaray, S. Geospatial Genomics in Cassava: Integrating GIS, Remote Sensing, and Genomic Tools for Disease-resistant Variety Development in Africa. Int. J. Genet. Genomics 2026, 14(2), 76-98. doi: 10.11648/j.ijgg.20261402.14

Copy | Download

AMA Style

Amara V, Samura AE, Norman PE, Kargbo J, Mansaray S. Geospatial Genomics in Cassava: Integrating GIS, Remote Sensing, and Genomic Tools for Disease-resistant Variety Development in Africa. Int J Genet Genomics. 2026;14(2):76-98. doi: 10.11648/j.ijgg.20261402.14

Copy | Download

@article{10.11648/j.ijgg.20261402.14,
  author = {Vandi Amara and Alusaine Edward Samura and Prince Emmanuel Norman and James Kargbo and Suffian Mansaray},
  title = {Geospatial Genomics in Cassava: Integrating GIS, Remote Sensing, and Genomic Tools for Disease-resistant Variety Development in Africa},
  journal = {International Journal of Genetics and Genomics},
  volume = {14},
  number = {2},
  pages = {76-98},
  doi = {10.11648/j.ijgg.20261402.14},
  url = {https://doi.org/10.11648/j.ijgg.20261402.14},
  eprint = {https://article.sciencepublishinggroup.com/pdf/10.11648.j.ijgg.20261402.14},
  abstract = {Cassava (Manihot esculenta Crantz) is a vital staple crop in sub-Saharan Africa, underpinning food security, rural livelihoods, and agro-industrial development. Despite its resilience to marginal soils and drought, cassava productivity is severely constrained by viral and bacterial diseases, notably cassava mosaic disease (CMD), cassava brown streak disease (CBSD), and cassava bacterial light (CBB). Conventional breeding approaches for disease resistance remain limited by long phenotyping cycles, genotype-by-environment interactions, and inadequate spatial integration. This review highlights the emerging role of geospatial genomics, an interdisciplinary framework that integrates Geographic Information Systems (GIS), remote sensing, and genomic tools to accelerate cassava improvement. GIS enables spatial mapping of disease incidence and environmental gradients, while remote sensing technologies such as satellite indices and drone-based hyperspectral imaging provide real-time crop health monitoring. Advances in genomics, including SNP genotyping, genotyping-by-sequencing, and marker-assisted selection, facilitate the identification of resistance loci and predictive breeding. Integrating spatial datasets with genomic information through environmental association analyses enhances understanding of genotype-environment interactions and supports climate-resilient breeding strategies. Applications of geospatial genomics in cassava include disease surveillance, predictive modeling of outbreaks, targeted germplasm deployment, and improved selection efficiency across diverse agro-ecological zones. Through linking environmental variability, disease dynamics, and genetic diversity, geospatial genomics offers a comprehensive pathway to develop disease-resistant cassava varieties, thereby strengthening food security and sustainable agriculture in Africa.},
 year = {2026}
}

Copy | Download

TY  - JOUR
T1  - Geospatial Genomics in Cassava: Integrating GIS, Remote Sensing, and Genomic Tools for Disease-resistant Variety Development in Africa
AU  - Vandi Amara
AU  - Alusaine Edward Samura
AU  - Prince Emmanuel Norman
AU  - James Kargbo
AU  - Suffian Mansaray
Y1  - 2026/05/19
PY  - 2026
N1  - https://doi.org/10.11648/j.ijgg.20261402.14
DO  - 10.11648/j.ijgg.20261402.14
T2  - International Journal of Genetics and Genomics
JF  - International Journal of Genetics and Genomics
JO  - International Journal of Genetics and Genomics
SP  - 76
EP  - 98
PB  - Science Publishing Group
SN  - 2376-7359
UR  - https://doi.org/10.11648/j.ijgg.20261402.14
AB  - Cassava (Manihot esculenta Crantz) is a vital staple crop in sub-Saharan Africa, underpinning food security, rural livelihoods, and agro-industrial development. Despite its resilience to marginal soils and drought, cassava productivity is severely constrained by viral and bacterial diseases, notably cassava mosaic disease (CMD), cassava brown streak disease (CBSD), and cassava bacterial light (CBB). Conventional breeding approaches for disease resistance remain limited by long phenotyping cycles, genotype-by-environment interactions, and inadequate spatial integration. This review highlights the emerging role of geospatial genomics, an interdisciplinary framework that integrates Geographic Information Systems (GIS), remote sensing, and genomic tools to accelerate cassava improvement. GIS enables spatial mapping of disease incidence and environmental gradients, while remote sensing technologies such as satellite indices and drone-based hyperspectral imaging provide real-time crop health monitoring. Advances in genomics, including SNP genotyping, genotyping-by-sequencing, and marker-assisted selection, facilitate the identification of resistance loci and predictive breeding. Integrating spatial datasets with genomic information through environmental association analyses enhances understanding of genotype-environment interactions and supports climate-resilient breeding strategies. Applications of geospatial genomics in cassava include disease surveillance, predictive modeling of outbreaks, targeted germplasm deployment, and improved selection efficiency across diverse agro-ecological zones. Through linking environmental variability, disease dynamics, and genetic diversity, geospatial genomics offers a comprehensive pathway to develop disease-resistant cassava varieties, thereby strengthening food security and sustainable agriculture in Africa.
VL  - 14
IS  - 2
ER  -

Copy | Download