🔧🚧

Sto smanettando!

Hey! I'm currently working on the site — updates, fixes, and various improvements. Things might not work perfectly (or at all 😅).

As long as you see this, I am working Once gone, the website is ready!

⚡ dev in progress...
live — loading languages...

⚙ Reading Settings

Choose a color scheme that works best for you. Changes apply instantly to your session.

Custom colors

pipeline: idle

What's missing?

A quantitative index of knowledge gaps between Wikipedias. Ranked by cultural relevance, not just global popularity.

Score = 0.4N + 0.3L + 0.2V + 0.1Q
Total Items
Max Sitelinks
Avg Sitelinks
Languages
1. Pick a source language (articles exist here) and a target language (articles are missing here).
2. Set the minimum languages filter — this shows only articles that already exist in at least that many Wikipedias. Higher = more notable gaps.
3. Browse the results. Click any title to open the source Wikipedia article directly. The Wikidata link below shows all language versions.
4. Be Patient, if results are not immediately available, they will be loaded in a few seconds.
5. The articles only toggle filters out non-article pages (categories, templates, portals, user pages, etc.) and shows only real encyclopedic content.
N = neighbor coverage (do nearby languages have it?) · L = global sitelinks · V = pageviews from target country · Q = source article quality
Top ranked gaps — EN → IT
loading...
# Article Score Langs
Loading...

FAQ

How did GapMap start? +
My name is Mike. One day I was reading an article on English Wikipedia and wanted to share it with a friend who doesn't read English. I figured I'd just translate it myself. Then I thought...how many times does this happen? How many important articles exist in one language but not another, and nobody even knows they're missing? That question became GapMap: a tool that systematically finds and ranks these knowledge gaps across every Wikipedia.
How does it work? +

Simple version

Every few days we download the full list of every Wikipedia article in every language from Wikidata. For each article, we count how many languages have it. If an article exists in 50 languages but not in yours, it's probably important and missing.

Technical version

The system ingests Wikidata's wb_items_per_site.sql.gz dump, which maps each item to its sitelinks across all wikis. Updated every ~3 days.

For each item i, we define its coverage set: C(i) = { w ∈ W : sitelink(i, w) ≠ ∅ } where W = the monitored Wikipedias. |C(i)| is our proxy for universal notability.

Given source wiki s and target wiki t, the gap set is: G(s,t) = { i : s ∈ C(i) ∧ t ∉ C(i) } Ranking is by |C(i)| descending with configurable threshold.

What does the score mean? +
The score ranks how "important" a missing article is. It combines neighbor coverage (do culturally nearby languages have it?), total sitelinks, pageviews from the target country, and source article quality. A score of 9+ means it's a critical gap.
Where does the data come from? +
All data comes from Wikidata, the structured database behind Wikipedia. We process the official dumps to build a complete index of which articles exist in which languages. Refreshed every 3 days. No AI, no guessing — just cross-referencing public databases.
Can I contribute? +
Yes! Find an article that's missing in your language, click the link to read it in the source language, then create it on your Wikipedia. You can also help improve GapMap itself — the project is open source.
Why do some results look wrong? +
Sometimes an article exists in the target Wikipedia under a different name but isn't linked on Wikidata. Other times, a topic is covered within a broader article. We're continuously improving filters to reduce false positives.
Is this affiliated with Wikipedia? +
No. GapMap is an independent project. It uses publicly available data from Wikidata under their open licenses, but it's not endorsed by or affiliated with the Wikimedia Foundation.