Difference between revisions of "Semantic MediaWiki"
m (add special character) |
|||
Line 24: | Line 24: | ||
Yaron Koren gave a great presentation ([https://commons.wikimedia.org/wiki/File:Fixing_Wikidata_-_SMWCon_2023.pdf slides]) called '''[http://wikiworks.com/enhanced-wikibase.html Enhanced Wikibase]''' on how [[Wikibase]] (and therefore Wikidata) are missing features. He showed how he implemented these missing features in a series of developments. One is showcased at [https://wikidatawalkabout.org/ Wikidata Walkabout] - a drill-down and query interface to Wikibase sites; powered by [https://github.com/sahajsk21/Anvesha Anvesha] - a JavaScript library. [https://www.youtube.com/live/lYpi08dqBPs?si=ci0swXD2e-e7qCwy&t=19983 Video presentation] | Yaron Koren gave a great presentation ([https://commons.wikimedia.org/wiki/File:Fixing_Wikidata_-_SMWCon_2023.pdf slides]) called '''[http://wikiworks.com/enhanced-wikibase.html Enhanced Wikibase]''' on how [[Wikibase]] (and therefore Wikidata) are missing features. He showed how he implemented these missing features in a series of developments. One is showcased at [https://wikidatawalkabout.org/ Wikidata Walkabout] - a drill-down and query interface to Wikibase sites; powered by [https://github.com/sahajsk21/Anvesha Anvesha] - a JavaScript library. [https://www.youtube.com/live/lYpi08dqBPs?si=ci0swXD2e-e7qCwy&t=19983 Video presentation] | ||
− | ===Natural Language Queries to Wikidata: A | + | ===Natural Language Queries to Wikidata: A Naïve Prototype=== |
[[File:Architecture - Ask Wikidata SMWCon 2023.png|alt=Application architecture|thumb|architecture]] Robert Timms - Sr. Software Engineer Wikibase Suite, Wikimedia Deutschland gave [https://www.semantic-mediawiki.org/wiki/SMWCon_Fall_2023/Natural_Language_Queries_to_Wikidata:_A_Na%C3%AFve_Prototype a talk] ([https://github.com/rti/askwikidata code] [https://docs.google.com/presentation/d/1YgDmcvoXaqnYdRyX5RxewVkeioEJ92nb8Sfb_halBsM slides] [https://colab.research.google.com/drive/1yRZshpNj0kXwY0XuUYw5ziqjw_RffxH- try it]) about querying Wikibase with an LLM. Slides 9-22 go from the application architecture to the 'tada' moment. | [[File:Architecture - Ask Wikidata SMWCon 2023.png|alt=Application architecture|thumb|architecture]] Robert Timms - Sr. Software Engineer Wikibase Suite, Wikimedia Deutschland gave [https://www.semantic-mediawiki.org/wiki/SMWCon_Fall_2023/Natural_Language_Queries_to_Wikidata:_A_Na%C3%AFve_Prototype a talk] ([https://github.com/rti/askwikidata code] [https://docs.google.com/presentation/d/1YgDmcvoXaqnYdRyX5RxewVkeioEJ92nb8Sfb_halBsM slides] [https://colab.research.google.com/drive/1yRZshpNj0kXwY0XuUYw5ziqjw_RffxH- try it]) about querying Wikibase with an LLM. Slides 9-22 go from the application architecture to the 'tada' moment. | ||
Line 38: | Line 38: | ||
{{Notice|The 'gpt' in ChatGPT stands for "Generative Pre-trained Transformer" - or a fancy way to say "guess". The '''artificial''' intelligence of large language model GPTs '''guess''' what you would say next based on the prompt given and the dataset they are trained on. In OpenAI's own words: "Generative AI models formulate responses by matching patterns or words, while RAG systems retrieve data based on similarity of meaning or semantic searches."}} | {{Notice|The 'gpt' in ChatGPT stands for "Generative Pre-trained Transformer" - or a fancy way to say "guess". The '''artificial''' intelligence of large language model GPTs '''guess''' what you would say next based on the prompt given and the dataset they are trained on. In OpenAI's own words: "Generative AI models formulate responses by matching patterns or words, while RAG systems retrieve data based on similarity of meaning or semantic searches."}} | ||
− | === Major changes on interfaces of MediaWiki RDBMS library === | + | ===Major changes on interfaces of MediaWiki RDBMS library=== |
https://www.mediawiki.org/wiki/Manual:Database_access | https://www.mediawiki.org/wiki/Manual:Database_access | ||
[[Category:MediaWiki]] | [[Category:MediaWiki]] | ||
[[Category:Enterprise MediaWiki]] | [[Category:Enterprise MediaWiki]] |
Revision as of 09:10, 21 December 2023
Semantic MediaWiki is one of the largest, and most complex extensions to MediaWiki - and also an indespensible one for enterprise use. The features it provides are partly described on the Metadata page.
This page exists to dive deeper into particulars.
Contents
SMWCon 2023[edit | edit source]
The 3-day program was fantastic!
One major advancement was the fact that Bernard Krabina opened ties with Open Collective so that individuals and organizations can donate money to the project.
Task tracking[edit | edit source]
HalloWelt! combines four extensions they created to make useful task tracking in (Semantic) MediaWiki
- Extension:SimpleTasks Tasks are checklist items that can be checked on or off to indicate if the task is open or completed.
- Extension:Checklists Make lists of checkboxes.
- Extension:DateTimeTools Provides date tools in VisualEditor.
- Extension:AtMentions Easily mention a user, with notification
Miriam Schlindwein presented how it's possible to create tasks, assign them to someone, add due dates and how they can be controlled
Realtime integrations with GitLab[edit | edit source]
Fixing Wikidata[edit | edit source]
Yaron Koren gave a great presentation (slides) called Enhanced Wikibase on how Wikibase (and therefore Wikidata) are missing features. He showed how he implemented these missing features in a series of developments. One is showcased at Wikidata Walkabout - a drill-down and query interface to Wikibase sites; powered by Anvesha - a JavaScript library. Video presentation
Natural Language Queries to Wikidata: A Naïve Prototype[edit | edit source]
Robert Timms - Sr. Software Engineer Wikibase Suite, Wikimedia Deutschland gave a talk (code slides try it) about querying Wikibase with an LLM. Slides 9-22 go from the application architecture to the 'tada' moment.
Not the goal of the talk, but he revealed some of the key drawbacks of using "AI" in the first place:
- Outdated information
- Prone to hallucinations
- No sources (AI doesn't tell you how or why it claims to be authoritative.)
This is supposed to be addressed in part by using the RAG technique.
The 'gpt' in ChatGPT stands for "Generative Pre-trained Transformer" - or a fancy way to say "guess". The artificial intelligence of large language model GPTs guess what you would say next based on the prompt given and the dataset they are trained on. In OpenAI's own words: "Generative AI models formulate responses by matching patterns or words, while RAG systems retrieve data based on similarity of meaning or semantic searches." |