Repositorio Institucional del CAB-IB

Depositar documentos | Cómo depositar | Registrarse

Estudio de técnicas de machine learning aplicadas a la clasificación de cultivo en imágenes satelitales. / Study of machine learning techniques applied to crop classification on satelite imagery.

Aguirre, Pablo A. (2019) Estudio de técnicas de machine learning aplicadas a la clasificación de cultivo en imágenes satelitales. / Study of machine learning techniques applied to crop classification on satelite imagery. Proyecto Integrador Ingeniería en Telecomunicaciones, Universidad Nacional de Cuyo, Instituto Balseiro.

Vista previa

PDF (Tesis)
Disponible bajo licencia Creative Commons: Reconocimiento - No comercial - Compartir igual.
Español
2214Kb

Resumen en español

El monitoreo de cultivo cumple un rol importante en la agricultura. Durante los últimos años se exploró el uso de algoritmos de aprendizaje supervisado (machine learning) en conjunto con imágenes satelitales de los cultivos a monitorear como una herramienta importante en la automatización de este proceso, pudiendo monitorear grandes extensiones de cultivo de manera rápida y eficiente. En este trabajo se estudiaron los principios físicos del sensado remoto (con un enfoque en sensado satelital), los tipos de imágenes que se pueden recolectar de un terreno y las características especificas de cada uno. Se investigo como acceder a dichas imágenes, que son de dominio público. También se estudiaron distintos algoritmos de machine learning y sus bases matemáticas, junto con posibles métricas de rendimiento y sus características. Luego se aplicaron estos algoritmos a la clasificación de cultivos utilizando imágenes ópticas y de radar. Para esto se analizo cual sería la mejor manera de implementar estos algoritmos, utilizando finalmente el lenguaje Python con las implementaciones de los algoritmos encontradas en la librería Scikit-learn. El objetivo del trabajo fue realizar una comparación del rendimiento de los algoritmos mas utilizados en la clasificación de cultivo bajo distintas condiciones. Los métodos de clasificación principalmente estudiados fueron las técnicas de redes neuronales, maquinas de soporte vectorial, arboles de decisión y bosques aleatorios. Se trabajó buscando la mejor combinación de parámetros de cada clasificador mediante barridos aplicando cross-validation de 5 pliegues. Entre las condiciones estudiadas las más importantes fueron la clasificación considerando y no considerando al tiempo como una variable mas, todo esto utilizando datos de distinta naturaleza (ópticos, radar o ambos). En la clasificación con datos en los que no se tiene en cuenta la evolución temporal del cultivo el mejor clasificador obtenido utilizando datos ópticos fue una maquina de soporte vectorial con un accuracy del 91% y un kappa de 0.87. El mejor clasificador entrenado con datos de radar también fue una máquina de soporte vectorial con accuracy de 73% y kappa de 0.60. Respecto a la clasificación con datos en los que se tiene en cuenta la evolución temporal, el mejor clasificador entrenado con imágenes ópticas fue una maquina de soporte vectorial con accuracy de 93% y kappa de 0.91. El mejor clasificador obtenido utilizando datos de radar fue un bosque aleatorio con kappa 0.86 y un accuracy de 95 %. Se realizó un análisis del efecto que tiene el tipo de datos utilizado en la clasificación multitemporal de cultivos con periodos de siembra y cosecha similares, llegando a la conclusión de que los datos de radar son más efectivos que los ópticos en estos casos. Sin embargo utilizar la combinación de datos de radar y ópticos fue lo que mejor rendimiento obtuvo. Por otro lado, se estudió el efecto de entrenar los algoritmos con una base de datos que posee un numero de muestras distinto para cada clase. Esto llevó a la conclusión de que los algoritmos priorizan la correcta clasificación de las clases que presentan mas muestras de entrenamiento. Este estudio indica que es muy importante que los datos de entrenamiento sean una muestra representativa del contexto en el que se van a utilizar los algoritmos.

Resumen en inglés

Crop monitoring has an important role in agriculture. During the last years the use of machine learning algorithms with satellite imagery altogether was explored as an important tool in automation of this process because of its ability to monitor big extensions of eld in a fast and efficient way. In this work the physical principles of remote sensing were studied (with focus on satellite sensing) along with the possible types of imagery you can gather from agricultural terrain and specic characteristics of every one of this types. How to acquire this public domain images was investigated. Different machine learning algorithms were also studied along with their mathematical basis and possible performance metrics. Then this algorithms were applied to crop classication using optical and radar imagery. For this purpose the best way to implement said algorithms was analyzed. Finally coding language Python with the algorithms implementations found in Scikit-learn library were used. The objective of this work was to make a comparation of the most used algorithms performances on crop classication under different conditions. The mainly studied methods were support vector machines, neural networks, decision trees and random forests. Research was made looking for the best combination of parameters for each classier using 5-fold cross-validation. The most important conditions studied were multitemporal and single date classication, using data of different nature (optical, radar or both). On single-date classication, the best classier obtained using optical data was a support vector machine with accuracy of 91% and a kappa coefficient of 0.87. The best classier trained with radar data was also a support vector machine with obtained accuracy of 73% and a 0.60 kappa coefficient. Regarding multitemporal classication, the best classier trained with optical data was a support vector machine with obtained accuracy of 93% and a 0.91 kappa coefficient. The best classier obtained using radar data was a random forest with obtained kappa of 0.86 and accuracy of 95%. An analysis of the effect of the type of data used on multitemporal classication of crops with similar planting and harvest periods was made. This study led to the conclusion that radar data is more effective than optical in this cases. However, using a combination of both radar and optical data gave the best results. On other side, the effect of trainning algorithms with a database that has a different number of samples for every class was studied. This led to the conclusion that the algorithms priorize the correct classication of the classes that have the most trainning samples. This study indicate that it is very important that the trainning data consist in a representative sample of the context the algorithms are going to be utilized.

Tipo de objeto:	Tesis (Proyecto Integrador Ingeniería en Telecomunicaciones)
Palabras Clave:	[Machine learning; Crop classification; Clasificación de cultivo; Multitemporal classification; Clasificación multitemporal; Single-date classification; ; Clasificación de una sola fecha; Satellite sensing; Sensado satelital; Sattelite imagery; Imágenes satelitales]
Referencias:	[1] Pacala, S., Socolow, R. Stabilization wedges: Solving the climate problem for the next 50 years with current technologies. Science, 2004. 1 [2] Peña-Barragan, J. M., López-Granados, F., García-Torres, L., Jurado-Exposito, M., Sanchez de la Orden, M., García-Ferrer, A. Discriminating cropping systems and agro-enviromental measures by remote sensing. Agronomy for Sustainable Development, 2008. 1 [3] Peña, J. M., Ngugi, M., Plant, R. Object-based crop identication using multiple vegetation indices, textural features and crop phenology. Remote Sens. Environ, 2011. 1 [4] De Wit, A., Clevers, J. Efficiency and accuracy of per-eld classication for operational crop mapping. Int. J. Remote Sens, 2004. 1 [5] Peña, J. M. e. a. Object-based image classication of summer crops with machine learning methods. Remote Sensing, 2014. 1, 15, 27, 46, 53, 54 [6] Blaschke, T. Object based image analysis for remote sensing. ISPRS Journal of Photogrammetry and Remote Sensing, 2009. 2, 27 [7] Orynbaikyzy, A., Gessner, U., Conrad, C. Crop type classication using a combination of optical and radar remote sensing data: a review. International Journal of Remote Sensing, 2019. 2, 15 [8] Chuvieco, E. Fundamentals of Satellite Remote Sensing: An Enviromental Approach. 2a editon. CRC Press, 2016. 2, 5 [9] ESA. Sentinel-1, instrument payload, 2019. URL https://sentinel.esa.int/web/sentinel/missions/sentinel-1/instrument-payload, [Online; accedido 17-09-2019]. 7 [10] ESA. Sentinel-2, 2019. URL https://sentinel.esa.int/web/sentinel/missions/sentinel-2, [Online; accedido 17-09-2019]. 7 [11] Canty, M. J. Image Analysis, Classication and Change Detection in Remote Sensing. CRC Press, 2014. 9, 13, 14 [12] Carletta, J. Assessing agreement on classication tasks: The kappa statistic. Computational linguistics, 1996. 11 [13] Arthur, D., Vassilvitskii, S. k-means++:the advantages of careful seeding. Stanford Semantic School, 2007. 13 [14] Chan, J., et al. An evaluation of ensemble classiers for mapping natura 2000 heathland in belgium using spaceborne angular hyperspectral ( chris/proba ) imagery. Int. J. Appl. Earth Obs. Geoinf., 2012. 15, 19 [15] Saini, R., Ghosh, S. Crop classication on single date sentinel-2 imagery using random forest and support vector machine. ISPRS Journal of Photogrammetry and Remote Sensing, 2018. 19, 38, 53 [16] McNairn, H., Champagne, C., Shang, J., Holmstrom, D., Reichert, G. Integration of optical and synthetic aperture radar (sar) imagery for delivering operational annual crop inventories. ISPRS Journal of Photogrammetry and Remote Sensing, 2008. 38, 46, 52, 53, 54 [17] Akhter, S., et al. Machine learning approaches on crop pattern recognition, a comparative analysis. IEEE, 2018. 46, 53 [18] Song, Q., et al. Object-based feature selection for crop classication using multitemporal high-resolution imagery. International Journal of Remote Sensing, 2018. 46, 53 [19] Zhong, L., Hu, L., Zhou, H. Deep learning based multi-temporal crop classication. Remote Sensing of Environment, 2019. 46, 53 [20] Del Frate, F., et al. Crop classication using multiconguration c-band sar data. IEEE Transactions on Geoscience and remote sensing, 2003. 52, 53, 54 [21] Xu, L., et al. Crop classication based on temporal information using sentinel-1 sar time-series data. Remote Sensing, 2018. 15, 52, 54 [22] Bishop, C. M. Pattern Recognition and Machine Learning. Springer, 2006. 15 [23] Peixeiro, M. How to improve a neural network with regularization, 2019. URL https://towardsdatascience.com/how-to-improve-a-neural-network-with-regularization-8a18ecda9fe3, [Online; accedido 04-11-2019]. 18 [24] Scikit-learn. Random Forests, 2019. URL https://scikit-learn.org/stable/modules/ensemble.html#forest, [Online; accedido 04-11-2019]. 19 [25] Sonobe, R., et al. Assessing the suitability of data from sentinel-1a and 2a for crop classication. GIScience remote Sens., 2017. 19 [26] USGS. Landsat missions, Landsat 8, 2019. URL https://www.usgs.gov/land-resources/nli/landsat/landsat-8?qt-science_support_page_related_con=0#qt-science_support_page_related_con, [Online; accedido10-10-2019]. 21 [27] del sur, S. Agricultura argentina, 2019. URL https://surdelsur.com/es/agricultura-argentina/, [Online; accedido 09-10-2019]. 38, 39
Materias:	Ingeniería en telecomunicaciones > Aprendizaje automático
Divisiones:	INVAP
Código ID:	872
Depositado Por:	Tamara Cárcamo
Depositado En:	12 Mar 2021 08:18
Última Modificación:	12 Mar 2021 08:18

Personal del repositorio solamente: página de control del documento

Repositorio Institucional del Centro Atómico Bariloche y el Instituto Balseiro. 2010-2022
Biblioteca Leo Falicov - Centro Atómico Bariloche - Av. Bustillo 9500 (R8402AGP) - S. C. de Bariloche, R.N. | Argentina
https://ricabib.cab.cnea.gov.ar - ri.admin@cab.cnea.gov.ar