novembre | 2025 | Deeplearning.fr

Long-range weather prediction is one of the great challenges of modern science.

We can forecast the next 3 to 5 days with remarkable accuracy – but beyond 10 days, the atmosphere becomes chaotic, and forecasting extreme cold becomes much harder.

Yet a new idea is emerging from the intersection of space physics, atmospheric science, and data analytics:

Earth’s magnetic field, measured from space, might provide early clues about upcoming cold waves – not as a cause, but as an indicator.

This article explains that idea in a simple and accessible way.

Why predicting cold outbreaks is so difficult

Cold outbreaks – those sudden plunges of Arctic air that hit Europe or North America – usually begin far above our heads, in the stratosphere. This is where the polar vortex lives: a giant spinning structure of cold air that can stretch, weaken, or even split apart.

When the polar vortex becomes unstable, it can set off a chain reaction:

The jet-stream becomes wavier. High-altitude air patterns shift. Cold Arctic air spills southward 2–3 weeks later.

Meteorologists track these signals, but early detection remains difficult. Most traditional data sources only see the atmosphere after the shift has begun.

What if we had a way to sense these changes earlier?

Why look at Earth’s magnetic field?

Earth is surrounded by a magnetic bubble called the magnetosphere, and just below it lies the ionosphere, a layer filled with charged particles.

These upper layers respond sensitively to:

changes in atmospheric circulation, waves rising from the lower atmosphere, disturbances in the polar regions, and interactions between solar activity and Earth’s environment.

When the atmosphere changes dramatically – especially over the poles – the magnetic environment often reacts.

This is where ESA’s SWARM satellites come in.

What is SWARM?

SWARM is a constellation of three satellites launched by the European Space Agency.

Their mission? To measure Earth’s magnetic field with exceptional precision.

Every day, SWARM records millions of data points describing:

the strength of the magnetic field, the electrical currents flowing in the ionosphere, the level of “agitation” in the polar regions, and how these conditions change over time.

Although SWARM was not designed for weather forecasting, its data provides a unique view of the upper atmosphere, where the early symptoms of cold outbreaks often originate.

An important clarification: this is not about causality

We are not saying that magnetic changes cause cold waves.

The atmosphere does not listen to the magnetic field.

Instead, the magnetic field acts as a mirror or indicator of large-scale dynamical changes happening above us.

Think of it like a thermometer:

A thermometer does not cause a fever. But it can tell you something important is happening.

Magnetic field variations work the same way.

How magnetic signals could warn us 2–3 weeks ahead

Scientists have identified several magnetic signatures that often appear before the atmosphere shifts:

1. Polar magnetic “agitation”

When polar regions become disturbed, the magnetic field fluctuates more strongly.

This can be measured through a simple index: the daily variability of the magnetic field at high latitudes.

2. North–South magnetic asymmetry

If one hemisphere becomes much more “active” than the other, it can reflect imbalances in the polar vortex and jet-stream.

3. Slow magnetic trends

Certain long-lasting magnetic patterns may be linked to energy waves traveling upward from the lower atmosphere.

These signals are not perfect predictors, but they carry information that traditional meteorological models may not see.

Testing the idea: does it actually work?

To explore this concept, researchers create statistical models that compare:

magnetic variations from SWARM, and real cold outbreaks recorded in weather data.

In simple backtests:

Strong magnetic disturbances often appear 10 to 20 days before major cold events. When magnetic activity in the polar regions is in the top 10% of values, the probability of a cold outbreak in the following three weeks can increase significantly.

It’s not a magic crystal ball, but it’s a useful leading indicator, especially when combined with traditional forecasting tools like the NAO or AO index.

Why this matters

If confirmed with real-world testing, this method could help:

power grid operators prepare for surges in heating demand, farmers anticipate frost risk, governments plan emergency responses, meteorologists refine their long-range outlooks.

Every extra day of warning can save money, protect infrastructure, and reduce risks.

The path forward

This approach is still in its early stages, but the potential is exciting.

Future steps include:

Large-scale analysis of SWARM data from 2014 to today, Integration with long-range weather models, Machine learning models trained to detect subtle magnetic precursors, Seasonal dashboards that estimate cold-outbreak probabilities.

We are only beginning to discover how the upper atmosphere and magnetic environment reflect deep dynamical processes on Earth.

In summary

Earth’s magnetic field does not control the weather. But it is sensitive to the same forces that trigger cold outbreaks. Thanks to ESA’s SWARM satellites, we now have a way to observe these signals globally and continuously. Early tests suggest that magnetic indicators may offer a 10–30 day early-warning signal for extreme cold.

This new approach is not meant to replace traditional weather forecasting — it is meant to enhance it, giving us a new window into the hidden processes that shape our climate.

TL;DR: Instead of converting your whole document library to text and searching that text, we search each page’s visual tokens (smart “patches” of the image). We find the right pages fast, then decode those exact tokens directly with DeepSeek-OCR to get the text and answer the question. No training needed. No full-document OCR passes. Just search → decode tokens → answer.

Why “text-first” RAG keeps letting you down

Classic RAG does this:

OCR every page to text
Split that text into chunks
Embed & search those chunks
Ask an LLM to answer

It’s okay for clean docs, but it breaks on:

multi-column layouts, tables, stamps, math, receipts
big OCR bills up front (or repeatedly)
brittle retrieval (if OCR misses a word, you never find it)

The flip: search the page itself, then decode

Our idea is simple:

Turn every page image into compact visual tokens once.
Turn your question into a tiny image (plus 2–5 short variants) and make tokens for that too.
Use ColBERT-style matching to find the pages whose tokens best match your question tokens.
Directly decode those winning page tokens with DeepSeek-OCR to get faithful text.
Let a lightweight LLM read the snippets and reply with citations.

Key point: we don’t run OCR across the corpus. We decode directly from the tokens we just retrieved. Nothing else.

Quick analogy

Each page is a mosaic of little magnetic tiles (visual tokens).
Your question becomes a mini mosaic too.
We bring them together; the tiles that “snap” hardest reveal the right pages.
Then we read those snapped tiles—not the whole wall.

Where ColBERT and DeepSeek-OCR fit (no jargon)

ColBERT: a retrieval trick that compares your question in small pieces to a page in small pieces, then adds up the best matches. It’s precise and great for spotting details.
DeepSeek-OCR: a modern OCR that can take those visual tokens directly and output text. No re-encoding pixels. No full-page OCR needed at question time.

Together: ColBERT finds the right tokens; DeepSeek-OCR reads those tokens.

How it works (for non-devs)

Index once — We convert each page into visual tokens and store them.
Ask anything — Your question becomes a tiny text image (plus a few synonyms), then we make tokens for it.
Match by parts — We compare little pieces of your question to little pieces of every page and rank the best pages.
Decode tokens — We hand the winning page tokens straight to DeepSeek-OCR and get back the exact text.
Answer + cite — A small LLM assembles the final answer and cites the pages it used.

Why this is different from text-based RAG

Topic	Text-first RAG	Token-Direct Visual RAG
Where search happens	Over OCR’d text chunks	Over visual tokens of each page
OCR at query time	Often heavy or repeated	Direct token decoding (no full-doc OCR)
Layout fidelity	Tables/columns can get mangled	Preserved until decoding
Compute	OCR + chunking + embeddings first	Search first, then decode the matched tokens
Traceability	“Which chunk produced this?”	The same tokens that matched are decoded

What you get in practice

Speed & lower cost: We don’t re-OCR or re-embed everything each time.
Faithful answers: We decode precisely the tokens that matched the query.
Great on messy layouts: Invoices, forms, multi-column reports, tables, stamps.
Zero training: Works out-of-the-box with standard ColBERT-style matching and DeepSeek-OCR.

Example: “What’s the total due on the March invoice?”

Old way: OCR the whole invoice, hope the table survived, hope the right chunk exists, then search the chunks.
Our way: Match your query-image (“total due March invoice”) against page tokens, jump straight to the bottom-right box that matched, decode those tokens directly, and answer—with a link to that page.

FAQ

Do we still “do OCR”?
We decode tokens directly with DeepSeek-OCR. That’s different from running OCR over every page. We decode only the tokens we retrieved, not entire documents.

Is there any training?
No. This is a zero-train pipeline. You can ship it as is.

What if I want summaries instead of verbatim text?
Today, we decode the matched tokens verbatim (fast and faithful). Later, we can drop in a specialized decoder (a small model head) that directly outputs the summary or a structured table—still from tokens—so you get exactly the format you want.

How do you handle synonyms or phrasing differences?
The query step creates a few short variants (synonyms/aliases) and turns them into images. That makes matching robust, even without training.

Roadmap (non-dev)

Now: Search by visual tokens → decode matched tokens → answer.
Soon:
- Two-stage search for big libraries (quick coarse pass, then exact pass).
- Token masks so we decode an even smaller set of tokens when pages are huge.
Later:
- Task-specific decoders (e.g., “decode to summary”, “decode tables to CSV”, “decode only figures & captions”).
- Drop-in, no changes to the search stage.

Why this matters

Documents are visual. Forcing them into plain text first is fragile and expensive. Token-Direct Visual RAG respects the page as a page: we find the answer visually, then read exactly what we found. That’s why it’s faster, cheaper, and more trustworthy—especially on the messy docs that break ordinary RAG.

Why this will feel different in production

Search happens before any heavy decoding: late-interaction over cached visual tokens is precise on small page regions (tables, stamps, math).
Decoding is targeted: you decode only the tokens that won retrieval, not whole pages. With DeepSeek’s compression, that slashes compute while keeping fidelity high.
Option to go “blazing”: If/when scale grows, drop in PLAID/FastPLAID (no training) for big retrieval-latency cuts, then rerank on full tokens

https://github.com/bacoco/DeepSynth

L	M	M	J	V	S	D
					1	2
3	4	5	6	7	8	9
10	11	12	13	14	15	16
17	18	19	20	21	22	23
24	25	26	27	28	29	30

Deeplearning.fr

You have to learn the rules of the game. And then you have to play better than anyone else

Archives mensuelles : novembre 2025

Can Earth’s Magnetic Field Help Predict Cold Waves Weeks in Advance? A New Approach to Long-Range Weather Forecasting

Stop Rereading Your PDFs: a plain-English guide to Token-Direct Visual RAG

Why “text-first” RAG keeps letting you down

The flip: search the page itself, then decode

Quick analogy

Where ColBERT and DeepSeek-OCR fit (no jargon)

How it works (for non-devs)

Why this is different from text-based RAG

What you get in practice

Example: “What’s the total due on the March invoice?”

FAQ

Roadmap (non-dev)

Why this matters

Why this will feel different in production