Repository logo
Log In(current)
  • Inicio
  • Personal de Investigación
  • Unidad Académica
  • Publicaciones
  • Colecciones
    Datos de Investigacion Divulgacion cientifica Personal de Investigacion Protecciones Proyectos Externos Proyectos Internos Publicaciones Tesis
  1. Home
  2. Universidad de Santiago de Chile
  3. Publicaciones
  4. Forecasting Crop Yields in Rainfed India: A Comparative Assessment of Machine Learning Baselines and Implications for Precision Agribusiness
Details

Forecasting Crop Yields in Rainfed India: A Comparative Assessment of Machine Learning Baselines and Implications for Precision Agribusiness

Journal
Agriculture (Switzerland)
ISSN
2077-0472
Date Issued
2025
Author(s)
Derpich-Contreras, I  
DOI
https://doi.org/10.3390/agriculture16010065
Abstract
Machine learning (ML) has emerged as a practical approach to forecasting crop yields in climate-vulnerable, rainfed agricultural systems where production uncertainty is strongly influenced by monsoon variability. In India s semi-arid and sub-humid regions, reliable yield forecasts are critical for agribusiness planning and managing climate risks. This study presents a standardized evaluation of three widely used ML forecasting models-Linear Regression (LR), Random Forest (RF), and Support Vector Regression (SVR)-for rainfed cereal yields in eight Indian administrative divisions from 2000 to 2025. The study applied a unified methodological framework that included data cleaning, z-score normalization, domain-informed feature selection, strict chronological train-test splitting, and five-fold cross-validation. The dataset integrates agroclimatic and soil variables, including temperature, precipitation, relative humidity, wind speed, and soil pH, comprising approximately 1250 division-year observations. Model performance was assessed on an independent, temporally held-out test set using root mean square error (RMSE), mean absolute error (MAE), and R2. The results show that RF provides the most robust predictive performance under realistic forecasting conditions. It achieved the lowest RMSE (0.268 t/ha) and the highest R2 (0.271), outperforming LR and SVR. Although the explained variance is modest, it reflects strict temporal validation and the inherent uncertainty of rainfed systems. Feature importance analysis highlights temperature and precipitation as dominant yield drivers. Overall, this study establishes a conservative and reproducible baseline for operational machine learning (ML)-based yield forecasting in precision agribusiness.
Get Involved!
  • Source Code
  • Documentation
  • Slack Channel
Make it your own

DSpace-CRIS can be extensively configured to meet your needs. Decide which information need to be collected and available with fine-grained security. Start updating the theme to match your Institution's web identity.

Need professional help?

The original creators of DSpace-CRIS at 4Science can take your project to the next level, get in touch!

Logo USACH

Universidad de Santiago de Chile
Avenida Libertador Bernardo O'Higgins nº 3363. Estación Central. Santiago Chile.
ciencia.abierta@usach.cl © 2023
The DSpace CRIS Project - Modificado por VRIIC USACH.

  • Accessibility settings
  • Privacy policy
  • End User Agreement
  • Send Feedback
Logo DSpace-CRIS
Repository logo COAR Notify