January 21, 2026 14:00 CEST
Live on Microsoft Teams.
On 21 January at 14:00 CEST, the ESA Φ-Lab Collaborative Innovation Network will host a new Φ-talk. Details are below.
Meet the speakers
- Nicolas Longépé, Earth Observation Data Scientist at ESA
- Yassine El Ouahidi, AI Applied Scientist at Mistral
- Àlex Ramírez Atrio, Senior Deep Learning Scientist at Pi School
Talk abstract
Recent advances in Large Language Models (LLMs) have created opportunities to support reasoning, discovery, and synthesis in Earth Observation (EO) and Earth Sciences, provided domain specificity and reliability can be ensured. In this work, we introduce Earth Virtual Expert (EVE), a comprehensive open-source initiative to develop, evaluate, and deploy a domain-specialized LLM for EO. EVE serves as a testbed for studying domain-adaptive training, grounded generation, and evaluation strategies tailored to scientific use, rather than general-purpose conversational performance. As part of this initiative, we present EVE-instruct, a text-only, instruction-tuned and aligned LLM specialized for EO. Built on Mistral Small 3.2 (24B parameters) with a 128k context window, it focuses on domain-specific reasoning, question answering, and retrieval– and hallucination-aware generation, without significant tradeoff of general capabilities. We release all data used to train and evaluate EVE-instruct: a large-scale curated EO corpus of 3B tokens, synthetically generated fine-tuning datasets derived from this corpus (4B tokens), and manually-created EO-specific evaluation test sets comprising 7500 samples across multiple-choice and open-ended question answering, and factuality test sets. To support trustworthy usage and deployment, we further develop a Retrieval-Augmented Generation (RAG) database from the curated corpus and a hallucination-detection module focused on factual consistency and scientific grounding. These components are integrated with EVE-instruct and deployed with a graphical user interface and accessible via API, currently supporting more than 300 users from the EO research and industry field.
All models, datasets, and code are publicly released on HuggingFace and Github.
Agenda
- EVE general overview - Nicolas Longépé, Earth Observation Data Scientist @ ESA (10 minutes)
- Recent advances in LLMs - Yassine El Ouahidi, AI Applied Scientist @ Mistral (15 minutes)
- EVE technical overview - Àlex Ramírez Atrio, AI Scientist @ Pi School (15 minutes)
- EVE demonstration - Nicolas Longépé, Earth Observation Data Scientist @ ESA (10 minutes)
- Q&A
Register here!