Difference between revisions of "Semantic MediaWiki"
(add image) |
|||
Line 3: | Line 3: | ||
This page exists to dive deeper into particulars. | This page exists to dive deeper into particulars. | ||
− | == SMWCon 2023 == | + | ==SMWCon 2023== |
The [https://www.semantic-mediawiki.org/wiki/SMWCon_Fall_2023 3-day program] was fantastic! | The [https://www.semantic-mediawiki.org/wiki/SMWCon_Fall_2023 3-day program] was fantastic! | ||
One major advancement was the fact that Bernard Krabina opened ties with [https://docs.opencollective.com/help/about/introduction Open Collective] so that individuals and organizations can [https://www.semantic-mediawiki.org/wiki/Sponsorship#Donating_money donate money] to the project. | One major advancement was the fact that Bernard Krabina opened ties with [https://docs.opencollective.com/help/about/introduction Open Collective] so that individuals and organizations can [https://www.semantic-mediawiki.org/wiki/Sponsorship#Donating_money donate money] to the project. | ||
− | === Task tracking === | + | ===Task tracking=== |
HalloWelt! combines four extensions they created to make useful task tracking in (Semantic) MediaWiki | HalloWelt! combines four extensions they created to make useful task tracking in (Semantic) MediaWiki | ||
− | * [[mw:Extension:SimpleTasks|Extension:SimpleTasks]] Tasks are checklist items that can be checked on or off to indicate if the task is open or completed. | + | |
− | * [[mw:Extension:Checklists|Extension:Checklists]] Make lists of checkboxes. | + | *[[mw:Extension:SimpleTasks|Extension:SimpleTasks]] Tasks are checklist items that can be checked on or off to indicate if the task is open or completed. |
− | * [[mw:Extension:DateTimeTools|Extension:DateTimeTools]] Provides date tools in VisualEditor. | + | *[[mw:Extension:Checklists|Extension:Checklists]] Make lists of checkboxes. |
− | * [[mw:Extension:AtMentions|Extension:AtMentions]] Easily mention a user, with notification | + | *[[mw:Extension:DateTimeTools|Extension:DateTimeTools]] Provides date tools in VisualEditor. |
+ | *[[mw:Extension:AtMentions|Extension:AtMentions]] Easily mention a user, with notification | ||
Miriam Schlindwein presented how it's possible to create tasks, assign them to someone, add due dates and how they can be controlled {{#ev:youtube|lYpi08dqBPs|||||t=13336}} | Miriam Schlindwein presented how it's possible to create tasks, assign them to someone, add due dates and how they can be controlled {{#ev:youtube|lYpi08dqBPs|||||t=13336}} | ||
− | === Realtime integrations with GitLab === | + | ===Realtime integrations with GitLab=== |
See [[GitLab operations]] | See [[GitLab operations]] | ||
− | === Fixing Wikidata === | + | ===Fixing Wikidata=== |
Yaron Koren gave a great presentation ([https://commons.wikimedia.org/wiki/File:Fixing_Wikidata_-_SMWCon_2023.pdf slides]) called '''[http://wikiworks.com/enhanced-wikibase.html Enhanced Wikibase]''' on how [[Wikibase]] (and therefore Wikidata) are missing features. He showed how he implemented these missing features in a series of developments. One is showcased at [https://wikidatawalkabout.org/ Wikidata Walkabout] - a drill-down and query interface to Wikibase sites; powered by [https://github.com/sahajsk21/Anvesha Anvesha] - a JavaScript library. | Yaron Koren gave a great presentation ([https://commons.wikimedia.org/wiki/File:Fixing_Wikidata_-_SMWCon_2023.pdf slides]) called '''[http://wikiworks.com/enhanced-wikibase.html Enhanced Wikibase]''' on how [[Wikibase]] (and therefore Wikidata) are missing features. He showed how he implemented these missing features in a series of developments. One is showcased at [https://wikidatawalkabout.org/ Wikidata Walkabout] - a drill-down and query interface to Wikibase sites; powered by [https://github.com/sahajsk21/Anvesha Anvesha] - a JavaScript library. | ||
− | === Natural Language Queries to Wikidata: A Naive Prototype === | + | ===Natural Language Queries to Wikidata: A Naive Prototype=== |
Robert Timms - Sr. Software Engineer Wikibase Suite, Wikimedia Deutschland gave [https://www.semantic-mediawiki.org/wiki/SMWCon_Fall_2023/Natural_Language_Queries_to_Wikidata:_A_Na%C3%AFve_Prototype a talk] ([https://github.com/rti/askwikidata code] [https://docs.google.com/presentation/d/1YgDmcvoXaqnYdRyX5RxewVkeioEJ92nb8Sfb_halBsM slides] [https://colab.research.google.com/drive/1yRZshpNj0kXwY0XuUYw5ziqjw_RffxH- try it]) about querying Wikibase with an LLM. Not the goal of the talk, but he revealed some of the key drawbacks of using "AI" in the first place: | Robert Timms - Sr. Software Engineer Wikibase Suite, Wikimedia Deutschland gave [https://www.semantic-mediawiki.org/wiki/SMWCon_Fall_2023/Natural_Language_Queries_to_Wikidata:_A_Na%C3%AFve_Prototype a talk] ([https://github.com/rti/askwikidata code] [https://docs.google.com/presentation/d/1YgDmcvoXaqnYdRyX5RxewVkeioEJ92nb8Sfb_halBsM slides] [https://colab.research.google.com/drive/1yRZshpNj0kXwY0XuUYw5ziqjw_RffxH- try it]) about querying Wikibase with an LLM. Not the goal of the talk, but he revealed some of the key drawbacks of using "AI" in the first place: | ||
− | # Outdated information | + | |
− | # Prone to hallucinations | + | #[[File:Architecture - Ask Wikidata SMWCon 2023.png|alt=Application architecture|thumb|architecture]]Outdated information |
− | # No sources (AI doesn't tell you how or why it claims to be authoritative.) | + | #Prone to hallucinations |
+ | #No sources (AI doesn't tell you how or why it claims to be authoritative.) | ||
+ | |||
This is supposed to be addressed in part by using the <abbr title="Retrieval-Augmented Generation">RAG</abbr> technique. | This is supposed to be addressed in part by using the <abbr title="Retrieval-Augmented Generation">RAG</abbr> technique. | ||
Revision as of 21:39, 20 December 2023
Semantic MediaWiki is one of the largest, and most complex extensions to MediaWiki - and also an indespensible one for enterprise use. The features it provides are partly described on the Metadata page.
This page exists to dive deeper into particulars.
Contents
SMWCon 2023[edit | edit source]
The 3-day program was fantastic!
One major advancement was the fact that Bernard Krabina opened ties with Open Collective so that individuals and organizations can donate money to the project.
Task tracking[edit | edit source]
HalloWelt! combines four extensions they created to make useful task tracking in (Semantic) MediaWiki
- Extension:SimpleTasks Tasks are checklist items that can be checked on or off to indicate if the task is open or completed.
- Extension:Checklists Make lists of checkboxes.
- Extension:DateTimeTools Provides date tools in VisualEditor.
- Extension:AtMentions Easily mention a user, with notification
Miriam Schlindwein presented how it's possible to create tasks, assign them to someone, add due dates and how they can be controlled
Realtime integrations with GitLab[edit | edit source]
Fixing Wikidata[edit | edit source]
Yaron Koren gave a great presentation (slides) called Enhanced Wikibase on how Wikibase (and therefore Wikidata) are missing features. He showed how he implemented these missing features in a series of developments. One is showcased at Wikidata Walkabout - a drill-down and query interface to Wikibase sites; powered by Anvesha - a JavaScript library.
Natural Language Queries to Wikidata: A Naive Prototype[edit | edit source]
Robert Timms - Sr. Software Engineer Wikibase Suite, Wikimedia Deutschland gave a talk (code slides try it) about querying Wikibase with an LLM. Not the goal of the talk, but he revealed some of the key drawbacks of using "AI" in the first place:
- Outdated information
- Prone to hallucinations
- No sources (AI doesn't tell you how or why it claims to be authoritative.)
This is supposed to be addressed in part by using the RAG technique.
The 'gpt' in ChatGPT stands for "Generative Pre-trained Transformer" - or a fancy way to say "guess". The artificial intelligence of large language model GPTs guess what you would say next based on the prompt given and the dataset they are trained on. In OpenAI's own words: "Generative AI models formulate responses by matching patterns or words, while RAG systems retrieve data based on similarity of meaning or semantic searches." |