Legobot is currently going through a dump and removing all langlinks when Wikidata matches. A brief explanation of how the bot works is below:

  • The bot uses a specialized generator to get up-to-date page content from a dump and the API.
  • The bot finds the Wikidata item associated with the article.
    • If it does not exist, the bot currently logs it as an error. Functionality will be added to have it create the item soon.
  • For each langlink stored locally, the bot checks if that language is stored on Wikidata. If there is no link, the bot will add it. If the language exists, but is to a different page, the bot will report it as a conflict.
  • If all the links are good, the bot will remove all langlinks and save the page. The regular expressions the bot uses are the same that interwiki.py has used for years, so there should be no problems with them.

Logs: