Duplicate toolinfo records are being noticed more in Toolhub than they were in Hay's Directory. A few examples are:
- https://toolhub.wikimedia.org/tools/whois-gateway & https://toolhub.wikimedia.org/tools/toolforge-whois
- https://toolhub.wikimedia.org/tools/bullseye & https://toolhub.wikimedia.org/tools/toolforge-bullseye
- https://toolhub.wikimedia.org/tools/wpcleaner & https://toolhub.wikimedia.org/tools/toolforge-wpcleaner
Duplicates can appear for various reasons. Some are caused by Toolforge publishing a toolinfo record for a tool that has also created and published a toolinfo.json file itself (both crawler managed). Others are an example of a crawled toolinfo.json record (likely from Toolforge or a tool that scrapes an on-wiki listing) and a toolinfo record created directly in Toolhub. It is also completely possible to have duplicates that are both created directly in Toolhub.
Having multiple records describing the same content is not ideal. It is highly unlikely that the duplicates will have the same content. It will also be difficult for users to determine which record is more correct.