Community Wishlist Survey 2021/Wikidata/Duplicates and merge candidates
Appearance
Duplicates and merge candidates
- Problem: There is an increasing number of items that are empty or possible duplicates
- Who would benefit: Wikidata editors
- Proposed solution: Improve on prior art like Projectmerge to detect duplicates not only by labels but by comparing properties and links with other items; migrate the WD:DNM do not merge lists to something more usable (example suggested in the discussion page, migrate to P1889 statements
- More comments:
- Phabricator tickets:
- Proposer: Sabas88 (talk) 12:38, 20 November 2020 (UTC)
Discussion
- Removed the Phabricator task as it's not relevant. --Matěj Suchánek (talk) 15:54, 20 November 2020 (UTC)
- @Sabas88: Thanks for your proposal. Is there code for the mentioned projects that we can take a look at? We'd like to have a better understanding on how the projects detect duplicates. Thanks again! Harumi Monroy 19:35, 23 November 2020 (UTC)
- Sorry I can't find it... Help:Merge has a list of tools but I didn't see a relevant git repository --Sabas88 (talk) 12:45, 25 November 2020 (UTC)
- A good idea. Improve on existing tools, to be able to better predict if two items are duplicate. Simplistic example: same name, different description, but both populated places (or similar property, city, village) with a very similar geographic location (within a radius of 2 km one from the other). --FocalPoint (talk) 05:58, 24 November 2020 (UTC)
- Or if not same name, perhaps with some other String Metric and comparing properties..--Sabas88 (talk) 12:45, 25 November 2020 (UTC)
Voting
- Support Movses (talk) 19:38, 8 December 2020 (UTC)
- Support マイキ (talk) 19:39, 8 December 2020 (UTC)
- Support Imz (talk) 20:13, 8 December 2020 (UTC)
- Support Ferdi2005[Mail] 20:44, 8 December 2020 (UTC)
- Support Mcampany (talk) 21:40, 8 December 2020 (UTC)
- Support YFdyh000 (talk) 22:28, 8 December 2020 (UTC)
- Support RXerself (talk) 23:53, 8 December 2020 (UTC)
- Support BALA. RTalk 01:44, 9 December 2020 (UTC)
- Support Tarnumg (talk) 02:12, 9 December 2020 (UTC)
- Support NMaia (talk) 03:09, 9 December 2020 (UTC)
- Support Chrisaliv (talk) 05:35, 9 December 2020 (UTC)
- Support example: many location related duplicates from svwiki amd cebwiki with actual source seemingly being Geonames katpatuka (talk) 06:06, 9 December 2020 (UTC)
- Support Omda4wady (talk) 07:32, 9 December 2020 (UTC)
- Support Avron (talk) 07:36, 9 December 2020 (UTC)
- Support Kpjas (talk) 11:30, 9 December 2020 (UTC)
- Support Akela (talk) 12:57, 9 December 2020 (UTC)
- Support Delpha (talk) 13:07, 9 December 2020 (UTC)
- Support Bietels (talk) 14:13, 9 December 2020 (UTC)
- Support Nehaoua (talk) 16:45, 9 December 2020 (UTC)
- Support Петър Петров (talk) 17:31, 9 December 2020 (UTC)
- Support JAn Dudík (talk) 20:29, 9 December 2020 (UTC)
- Support - Darwin Ahoy! 02:02, 10 December 2020 (UTC)
- Support - yona B. (D) 08:23, 10 December 2020 (UTC)
- Support Susanna Ånäs (Susannaanas) (talk) 11:02, 10 December 2020 (UTC)
- Support Euro know (talk) 11:26, 10 December 2020 (UTC)
- Support Sasuke Sarutobi (talk) 23:34, 10 December 2020 (UTC)
- Support Higa4 (talk) 04:58, 11 December 2020 (UTC)
- Support Paucabot (talk) 12:12, 11 December 2020 (UTC)
- Support Watty62 (talk) 14:44, 11 December 2020 (UTC)
- Support Husky (talk) 16:13, 11 December 2020 (UTC)
- Support Bencemac (talk) 16:16, 11 December 2020 (UTC)
- Support Poslovitch (talk) 16:45, 11 December 2020 (UTC)
- Support Susanna Giaccai (talk) 16:50, 11 December 2020 (UTC)
- Support Theklan (talk) 18:20, 11 December 2020 (UTC)
- Support BoldLuis (talk) 18:27, 11 December 2020 (UTC)
- Support Francois-Pier (talk) 08:35, 12 December 2020 (UTC)
- Support Tom Ja (talk) 09:54, 12 December 2020 (UTC)
- Support Klaas `Z4␟` V: 15:00, 12 December 2020 (UTC)
- Neutral I support the improvements to the merge tools, but for me the most needed would be creation of Merge tool that allows "Unmerge" option as I see a lot of bad merges done with easy to use merge tools used by users with very little experience. This proposal does not mentions any Unmerge options. --Jarekt (talk) 15:29, 12 December 2020 (UTC)
- Support it could be useful, and could encourage the usage of "different from" property on similar elements Luca.favorido (talk) 19:42, 12 December 2020 (UTC)
- Support. Meiræ 22:00, 12 December 2020 (UTC)
- Support Gelli1742 (talk) 20:21, 13 December 2020 (UTC)
- Support C. crispus (talk) 07:46, 14 December 2020 (UTC)
- Support --Mosbatho (talk) 21:58, 14 December 2020 (UTC)
- Support Nurtenge (talk) 06:59, 15 December 2020 (UTC)
- Support — SMcCandlish ☺ ☏ ¢ >ʌⱷ҅ᴥⱷʌ< 08:42, 15 December 2020 (UTC)
- Support MTheiler (talk) 15:13, 15 December 2020 (UTC)
- Support Utopes (talk) 19:26, 15 December 2020 (UTC)
- Support TemboUngwe (talk) 15:28, 16 December 2020 (UTC)
- Support --Luan (discussão) 19:50, 16 December 2020 (UTC)
- Support F. Riedelio (talk) 10:25, 17 December 2020 (UTC)
- Support GiFontenelle (talk) 00:41, 18 December 2020 (UTC)
- Support Nashona (talk) 01:51, 19 December 2020 (UTC)
- Support Patsagorn Y. (Talk) 04:53, 19 December 2020 (UTC)
- Support Iva (talk) 16:03, 20 December 2020 (UTC)
- Support — Baidax 💬 17:15, 21 December 2020 (UTC)