Published: August 2, 2025
43
337
2.7k

Google DeepMind just released one of the most important tools in geospatial data science. It’s called AlphaEarth Foundations. I want to break it down for you in intuitive terms:

Image in tweet by Yohan

We have petabytes of satellite images. But it’s still hard to answer questions like: • What’s in this image? • How has it changed? • What kind of crop or forest is this? AlphaEarth helps answer these questions, even in places with limited data.

AlphaEarth is a foundation model for Earth Observation. It turns raw satellite data into compact numerical representations, called embeddings.

Image in tweet by Yohan

𝗦𝗼, 𝘄𝗵𝗮𝘁 𝗮𝗿𝗲 𝗲𝗺𝗯𝗲𝗱𝗱𝗶𝗻𝗴𝘀? To understand embeddings, it probably helps to work through an example. Imagine we have the following graph: You can see that there are just two dimensions to this graph. A ‘greenness’ dimension and a ‘treeness’ dimension.

Image in tweet by Yohan

Now imagine if we take a satellite image and break it down into small grids, like the following. Let’s focus on one specific grid - i.e. the one I’ve selected in orange below:

Image in tweet by Yohan

If we took this orange grid, and situated it on our graph (or ‘embedding space’), it would probably sit somewhere here (since it’s both pretty green, and pretty tree-like):

Image in tweet by Yohan

Congratulations, if you’ve followed this basic example, you’ve got the idea of what an embedding is.

Now, imagine if instead of us just having two dimensions, we have dozens. E.g. it may cover anything from ‘blueness’ to ‘urbanness’ to ‘smokeyness’, etc. And then, instead of us just having one grid from one satellite image, we situate every grid from billions of satellite

You can think of embeddings as a way to do two things: 𝟭. 𝗖𝗼𝗺𝗽𝗿𝗲𝘀𝘀 a large amount of information from images into a small set of numbers/coordinates, and 𝟮. 𝗣𝗼𝘀𝗶𝘁𝗶𝗼𝗻 similar parts of an image close together in a multi-dimensional space, so that the model can

Here’s a visual example that helps make it more concrete. It shows how about 1,000 words are represented using embeddings across 200 dimensions. Here the word “Albania” ends up near other related words like “Albanian”. The model has learned that they often appear in similar

Image in tweet by Yohan

Once you’ve got good embeddings, you can: • Detect change over time • Find places with similar landscapes • Classify land cover using less training data • Fill gaps where no data exists This saves time, resources, and improves accuracy.

What makes AlphaEarth stand out? • It uses 𝗱𝗶𝘃𝗲𝗿𝘀𝗲 𝗱𝗮𝘁𝗮- satellites, field measurements, and climate records • It works 𝗮𝗰𝗿𝗼𝘀𝘀 𝘁𝗶𝗺𝗲: not just static images • It’s 𝗵𝗶𝗴𝗵-𝗿𝗲𝘀𝗼𝗹𝘂𝘁𝗶𝗼𝗻: 10m² • It 𝗴𝗲𝗻𝗲𝗿𝗮𝗹𝗶𝘀𝗲𝘀 well, even with limited

There are four key innovations: 𝟭. 𝗧𝗶𝗺𝗲-𝗮𝘄𝗮𝗿𝗲 𝗺𝗼𝗱𝗲𝗹𝗹𝗶𝗻𝗴: learns from sparse or irregular data 𝟮. 𝗦𝗽𝗮𝗰𝗲-𝗧𝗶𝗺𝗲 𝗣𝗿𝗲𝗰𝗶𝘀𝗶𝗼𝗻 𝗲𝗻𝗰𝗼𝗱𝗶𝗻𝗴: uses attention layers and convolutions 𝟯. 𝗧𝗲𝘅𝘁 𝗮𝗹𝗶𝗴𝗻𝗺𝗲𝗻𝘁: links image data with

Image in tweet by Yohan

𝗛𝗼𝘄 𝘄𝗲𝗹𝗹 𝗱𝗼𝗲𝘀 𝗶𝘁 𝗽𝗲𝗿𝗳𝗼𝗿𝗺? Across 15 tasks, AlphaEarth outperformed other models. It did best in: • Crop type mapping • Tree species classification • Evapotranspiration estimation • Land use change detection

Image in tweet by Yohan

Interestingly, AEF seems to be able to estimate biophysical variables that are continuous rather than categorical. AEF is the only model that meaningfully predicted evapotranspiration, a key variable for farming, water planning, and climate work.

Image in tweet by Yohan

You can access AlphaEarth embeddings for free in Google Earth Engine. No need to host the model or download anything. Just plug it into your existing GEE workflows.

Image in tweet by Yohan

Here’s what you can do with it: 1. Similarity search Pick any point on Earth, and find all locations with similar environmental conditions. 2. Change detection Compare embeddings over time to track changes like wildfires or urban growth.

Image in tweet by Yohan

3. Clustering Group areas with similar features, with no labels required. This is great for identifying forest types, soil regions, or urban patterns. 4. Low-shot classification Train accurate maps with far fewer labelled points.

Image in tweet by Yohan

Google is also offering grants (up to $5,000) to test new use-cases for these embeddings. You can apply here: https://docs.google.com/forms/...

The takeaway: AlphaEarth Foundations is a big drop in the geospatial space. It’s fast, accurate, easy to use, and available now. Expect even more models like this in future.

If you liked this, you might enjoy this post on foundation models: https://x.com/yohaniddawela/st... And give us a follow @yohaniddawela for more breakdowns on geospatial topics.

Interested in getting a short overview of the latest geospatial papers and datasets each week? Subscribe to the Spatial Edge newsletter: http://yohan.so

Image in tweet by Yohan

@yohaniddawela Thanks for sharing. I’ll check it out.

@archi_reum Awesome

@yohaniddawela Overlooked skills gaps slow growth more than tech debt...esp. in fast-moving markets. Ignoring talent bottlenecks creates bigger problems later.

@codewithimanshu random AI reply... Need to update your script bro

@yohaniddawela There’s bound to be some alpha in using it to e.g. predict global crop yields etc.

@janekm Yeah I’m sure that will be a use case. Lots of people trying to use earth observation for yield estimation.

@yohaniddawela Good breakdown 👌

@_AI_Frontiers_ Thanks - hope it helps

@yohaniddawela Does it include SAR data?

@WitCoHE_Bak Yes it does

Image in tweet by Yohan

@yohaniddawela Nice, I did this back in 2004!

@yohaniddawela Is it a thingy that let's you turn satellite imaging data into 3D topography models with significantly high levels of detail never before seen?

@ProfChocCaXe No but something like this could be along the lines of what you're itnerested in: https://x.com/yohaniddawela/st...

Share this thread

Read on Twitter

View original thread

Navigate thread

1/36