AIb2.io - AI Research Decoded

Your Morning Ran on Invisible Tech Rankings

At 7:12 a.m., your phone guessed your face, your maps app guessed traffic, your bank guessed whether that coffee purchase was fraud, and somewhere a warehouse robot guessed which shelf to raid next. Most of modern life now runs on little piles of prediction, which makes this new Nature story extra fun: researchers built an AI system on top of Wikipedia to figure out which technologies are picking up the most momentum before the rest of us start yelling about them on LinkedIn.

One quick clarification, because science publishing loves a plot twist. The PubMed entry here points to a Nature news article by James Mitchell Crow, not the underlying research paper itself. That article reports on Cosmos 1.0, a 2025 Scientific Data paper by Xian Gong, Paul McCarthy, Colin Griffith, Claire McFarland, and Marian-Andrei Rizoiu that maps 23,544 technology-adjacent entities from Wikipedia and then manually verifies an ET100 list of emerging technologies (Crow, 2026; Gong et al., 2025).

Your Morning Ran on Invisible Tech Rankings

Wikipedia, But Wearing a Lab Coat

So here is the thing: Wikipedia is chaotic, argumentative, occasionally petty, and weirdly useful. Which means it is also a pretty good place to watch technology evolve in public.

The Cosmos 1.0 team used Wikipedia2Vec, a model that turns Wikipedia articles into numerical vectors, basically giving each concept a position in semantic space. If two technologies tend to live in similar textual neighborhoods, their vectors sit closer together. That lets researchers cluster technologies, track how broadly they show up, estimate how "deep tech" they are, and compare public attention with signals from places like Google Scholar, OpenAlex, Google Books, and Crunchbase (Wikipedia2Vec docs; Gong et al., 2025).

If that sounds like asking Wikipedia to become a crystal ball, yes, a little. But not in the "mystical orb on a velvet pillow" sense. More in the "messy public knowledge graph with enough scale to reveal patterns humans miss" sense.

Crow reports that the resulting 2026 Momentum 100 list puts machine learning, blockchain databases, and 3D printing near the top, with soft robotics, augmented reality, and omics also ranking highly (Crow, 2026). That lineup feels believable, which is both reassuring and mildly annoying. Even the robots know blockchain refuses to leave the group chat.

Why This Is Clever Instead of Just Nerdy

Let me unpack that. A lot of technology forecasting still relies on expert panels, patent counts, publication trends, or some heroic consultant staring at dashboards until a buzzword appears. Those approaches are useful, but they can be slow, top-down, and biased toward fields that already have strong institutional visibility.

This paper tries a more bottom-up move. Instead of starting with a boardroom list of "important technologies," it starts with how technologies are described, linked, and discussed across Wikipedia. That matters because tech progress rarely arrives as a neat parade of isolated inventions. It shows up as convergence: AI mixing with biology, robotics mixing with materials science, sensing mixing with energy systems. The paper is basically saying, "Maybe we should stop sorting the future into separate filing cabinets."

That idea lines up with recent work on technology convergence and forecasting. Reviews in Technological Forecasting and Social Change argue that convergence is now a core driver of innovation strategy, not a side quest (Sick & Bröring, 2022). Newer studies use graph embeddings, link prediction, semantic networks, and patent analysis to forecast where technologies may head next (Wang & Li, 2024; Choi et al., 2023; Kim et al., 2022; Jang et al., 2025).

This is where it gets interesting. The Wikipedia-based method is not just asking, "What is hot?" It is asking, "What is connected, spreading, and showing signs of becoming broadly useful?" That is a much better question if you care about where money, talent, and policy attention should go.

The Useful Part, and the Slightly Chaotic Part

In the real world, this kind of map could help governments prioritize R&D, help investors separate signal from vibes, and help companies avoid spending three years discovering that the market moved on while they were still polishing their innovation deck. If you wanted to visualize those relationships for an internal strategy session, this is exactly the kind of problem where a tool like mapb2.io makes sense: people understand tech ecosystems faster when they can actually see the clusters instead of reading another bullet list with "synergy" in it.

But there are limits, and they matter. Cosmos 1.0 relies on English Wikipedia embeddings from 2018, which means very new concepts can be underrepresented. The authors say that explicitly, and it is not a tiny footnote. In AI years, 2018 is roughly 400 dog years. Also, Wikipedia reflects public knowledge and editorial culture, not reality itself. Some technologies get loads of attention before they become useful; others quietly do the actual work while flashier neighbors hog the spotlight.

There is also a nice irony here. Wikimedia said in April 2025 that Wikipedia sits "at the core of every AI training model," while public concern has grown that AI systems may siphon value from the volunteer-built knowledge base they depend on (Wikimedia Foundation, 2025). So yes, we are now using AI to mine Wikipedia to predict which technologies will matter most, while AI itself keeps eating Wikipedia for breakfast. Very normal ecosystem behavior. No notes.

And outside the lab, adoption is already real enough to justify the forecasting obsession. A Federal Reserve note published on April 3, 2026 estimated that about 18% of U.S. firms had adopted AI by the end of 2025, with much higher exposure when weighted by employment (Allen, 2026). In other words, the future is not standing politely in the hallway. It is already in the building, messing with the thermostat.

References

Crow, J. M. (2026). Wikipedia-based AI model reveals the 100 technologies to watch. Nature. https://doi.org/10.1038/d41586-026-00980-4

Gong, X., McCarthy, P. X., Griffith, C., McFarland, C., & Rizoiu, M.-A. (2025). Cosmos 1.0: a multidimensional map of the emerging technology frontier. Scientific Data, 12, 1837. https://www.nature.com/articles/s41597-025-06125-y

Sick, N., & Bröring, S. (2022). Exploring the research landscape of convergence from a TIM perspective: A review and research agenda. Technological Forecasting and Social Change, 175, 121321. https://doi.org/10.1016/j.techfore.2021.121321

Wang, L., & Li, M. (2024). An exploration method for technology forecasting that combines link prediction with graph embedding: A case study on blockchain. Technological Forecasting and Social Change, 208, 123736. https://doi.org/10.1016/j.techfore.2024.123736

Choi, S., Kim, J., & Yoon, B. (2023). Technology opportunity analysis using hierarchical semantic networks and dual link prediction. Technovation, 128, 102872. https://doi.org/10.1016/j.technovation.2023.102872

Kim, S., Zhang, H., & Thomson, R. (2022). Detecting emerging technologies and their evolution using deep learning and weak signal analysis. Journal of Informetrics, 16(3), 101344. https://doi.org/10.1016/j.joi.2022.101344

Jang, H., Kim, J., & Yoon, B. (2025). Developing Tech2Vec: A new embedding approach of technology information using a triple layer. Computers & Industrial Engineering, 205, 111163. https://doi.org/10.1016/j.cie.2025.111163

Allen, J. S. (2026). Monitoring AI Adoption in the U.S. Economy. FEDS Notes. https://doi.org/10.17016/2380-7172.4032

Wikipedia2Vec documentation. https://wikipedia2vec.github.io/wikipedia2vec/

Wikimedia Foundation. (2025, April 30). Our new AI strategy puts Wikipedia’s humans first. https://wikimediafoundation.org/news/2025/04/30/our-new-ai-strategy-puts-wikipedias-humans-first/

Disclaimer: This blog post is a simplified summary of published research for educational purposes. The accompanying illustration is artistic and does not depict actual model architectures, data, or experimental results. Always refer to the original paper for technical details.