this post was submitted on 19 May 2025
103 points (99.0% liked)

Europe

5923 readers
1930 users here now

News and information from Europe 🇪🇺

(Current banner: La Mancha, Spain. Feel free to post submissions for banner images.)

Rules (2024-08-30)

  1. This is an English-language community. Comments should be in English. Posts can link to non-English news sources when providing a full-text translation in the post description. Automated translations are fine, as long as they don't overly distort the content.
  2. No links to misinformation or commercial advertising. When you post outdated/historic articles, add the year of publication to the post title. Infographics must include a source and a year of creation; if possible, also provide a link to the source.
  3. Be kind to each other, and argue in good faith. Don't post direct insults nor disrespectful and condescending comments. Don't troll nor incite hatred. Don't look for novel argumentation strategies at Wikipedia's List of fallacies.
  4. No bigotry, sexism, racism, antisemitism, islamophobia, dehumanization of minorities, or glorification of National Socialism. We follow German law; don't question the statehood of Israel.
  5. Be the signal, not the noise: Strive to post insightful comments. Add "/s" when you're being sarcastic (and don't use it to break rule no. 3).
  6. If you link to paywalled information, please provide also a link to a freely available archived version. Alternatively, try to find a different source.
  7. Light-hearted content, memes, and posts about your European everyday belong in [email protected]. (They're cool, you should subscribe there too!)
  8. Don't evade bans. If we notice ban evasion, that will result in a permanent ban for all the accounts we can associate with you.
  9. No posts linking to speculative reporting about ongoing events with unclear backgrounds. Please wait at least 12 hours. (E.g., do not post breathless reporting on an ongoing terror attack.)
  10. Always provide context with posts: Don't post uncontextualized images or videos, and don't start discussions without giving some context first.

(This list may get expanded as necessary.)

Posts that link to the following sources will be removed

Unless they're the only sources, please also avoid The Sun, Daily Mail, any "thinktank" type organization, and non-Lemmy social media. Don't link to Twitter directly, instead use xcancel.com. For Reddit, use old:reddit:com

(Lists may get expanded as necessary.)

Ban lengths, etc.

We will use some leeway to decide whether to remove a comment.

If need be, there are also bans: 3 days for lighter offenses, 7 or 14 days for bigger offenses, and permanent bans for people who don't show any willingness to participate productively. If we think the ban reason is obvious, we may not specifically write to you.

If you want to protest a removal or ban, feel free to write privately to the primary mod account @[email protected]

founded 10 months ago
MODERATORS
 

After 2,5 years of intensive research and programming efforts, the entire Openwebsearch.eu project team is excited to grant access to its pilot of the first-ever federated pan-European Open Web Index (OWI).

From June onward, commercial and scientific development teams of any size as well as interested individuals are welcome to access and make use of almost a petabyte (and growing) of open web data under a general research license or – upon request – under a designated commercial license as well.

Given that the European Commission has launched the InvestAI initiative to mobilize €200 billion of investment in artificial intelligence, the Open Web Index comes with perfect timing.

The OpenWebSearch.eu consortium actively calls early adopters to pioneer innovative projects surrounding vertical web search, argumentative search, LLM applications including RAG and more.

“The OWI symbolizes a first step towards true European digital sovereignty and is a fundamental step in paving the way for a comprehensive open European AI landscape.“ says Community Manager Ursula Gmelch and further:

“Our goal behind this initial pilot phase is to onboard a range of projects from diverse domains to get early feedback in. We look forward to users confirming the quality and value in current functionalities and/or helping us pivot in such ways that real market demands can be met and further expanded upon.“

An official kick-off event will be hosted on 6 June from 10 am to 12 am CEST via Zoom.

Registration to the event is open under the following link:

https://cscfi.zoom.us/meeting/register/eATIpDQ5TZidh4Jzkim6FQ#/registration

[,,,]

you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 17 points 15 hours ago (1 children)

Is thus like google search but better and European or am i misreading?

[–] [email protected] 18 points 15 hours ago* (last edited 15 hours ago) (2 children)

In an nutshell, this is what I understand, too. It may take some time until it gets fully competitive but it could soon get a better alternative to the gatekeepers like Google imho.

Addition for a brief article I just found:

The EU’s Open Web Index Project: Another Step Toward Digital Independence

The Open Web Index (OWI) is an open-source initiative under the European Union’s Horizon Programme, aimed at democratizing web-search technologies and strengthening Europe’s digital sovereignty. The project will launch in June 2025, providing a common web index accessible to all and decoupling the indexing infrastructure from the search services that use it. In doing so, the OWI offers not only technical innovations but also a paradigm shift in the global search market—today, a single player (Google) holds over ninety percent of the market share and determines access to online information.

The project’s core idea is to make web crawling, metadata enrichment, and indexing a shared European resource. Development takes place in large data centres that process terabytes of raw data each day and publish the entire index as open data. All software components are open-source, and the CIFF format ensures that systems based on Lucene, Solr, or Terrier can connect to the OWI seamlessly. Thus, with minimal effort, researchers and developers can create vertical search engines that rank results according to specific criteria such as sustainability or privacy priorities [...]

[–] [email protected] 4 points 15 hours ago (1 children)

Hm sounds more like just a web scrape to me. I.e. someone else will need to build a search engine on top of this.

[–] tyler 11 points 14 hours ago

It’s an index, not a web scrape, though if you want to think about it that way then go for it. Indexes back all search engines.

[–] [email protected] 3 points 15 hours ago

Fingers crossed