Gerwoman
Welcome to Wikidata, Gerwoman!
Wikidata is a free knowledge base that you can edit! It can be read and edited by humans and machines alike and you can go to any item page now and add to this ever-growing database!
Need some help getting started? Here are some pages you can familiarize yourself with:
- Introduction – An introduction to the project.
- Wikidata tours – Interactive tutorials to show you how Wikidata works.
- Community portal – The portal for community members.
- User options – including the 'Babel' extension, to set your language preferences.
- Contents – The main help page for editing and using the site.
- Project chat – Discussions about the project.
- Tools – A collection of user-developed tools to allow for easier completion of some tasks.
Please remember to sign your messages on talk pages by typing four tildes (~~~~); this will automatically insert your username and the date.
If you have any questions, don't hesitate to ask on Project chat. If you want to try out editing, you can use the sandbox to try. Once again, welcome, and I hope you quickly feel comfortable here, and become an active editor for Wikidata.
Best regards! Liuxinyu970226 (talk) 12:47, 19 March 2015 (UTC)
re.
editSobre los bots no sabría decirte, creo que no se pueden fusilar el VIAF de arriba a abajo por no ser una base de datos en dominio público, pero no me hagas mucho caso. A veces pasan y añaden algo pero no sabría decirte a cuenta de qué, probablemente como mínimo para añadirse algo vía bot desde viaf deba estar el Q de Wikidata validado allí como autoridad primero. No confiaría mucho en sus corridas, al menos a corto-medio plazo.
En cuanto a añadir códigos más rápido... sí, existe un script en Wikidata para acelerar el proceso (entiendo que aunque le chupes todo un "cluster" de identificadores a VIAF ...a efectos prácticos no se considera plagiar una base de datos si vas de persona en persona de forma semimanual, ¡"supongo", eh!, ¡lo mismo nos terminan encarcelando a todos! :-). Algunos días el chisme está sobrecargado y no funciona pero pelillos a la mar. Para activarlo, pues en tu common.js de aquí añades importScript( 'User:Magnus Manske/authority control.js' );
. Pan comío para ti. Si de primera no se carga reinicia el explorador, borra la caché,... esas historias, yatúsabes. Su funcionamiento es ...a prueba de tontos, sin ningún misterio. El elemento en cuestión debe tener etiqueta en tu idioma predeterminado para que funcione, eso sí. Aun así, si tienes dudas, encantado de ayudar. Prefiero que cosas de Wikidata me preguntes por estos lares, no te preocupes que me entero. Un saludo. Strakhov (talk) 15:58, 3 June 2016 (UTC)
- Intenta probar con otro navegador. En teoría se te abre una ventana flotante con numerosos numeritos al hacer clic allí (tienes que estar en la página de un Q). Mientras esté gris está trabajando y hay que esperar (por si logras arrancarlo). Si no pasa nada cliqueando... no sabría decirte el porqué. Puedes preguntar a Magnus. :) He mirado preferencias y hay una cosa de authority control. La tengo 'en check'. No sé si tendrá que ver. Strakhov (talk) 17:37, 3 June 2016 (UTC)
- Si es error o no... no lo sé. No soy monjólogo. :( Esto es un base de datos... y que Peiró y Pasamar la denominan monja... es un dato... Wikidata:Glossary#Claims_and_statements says that Deprecated rank is used for a statement that contains information that may not be considered reliable or that is known to include errors. Strakhov (talk) 17:35, 10 July 2016 (UTC) pd. el resumen de edición no tiene nada que ver contigo, ha saltado random
RANM
editLa Royal National Academy of Medicine (Q2134665) tiene identificadores numéricos interesantes para sus académicos. Funciona la url http://www.ranm.es/academicos/$1
quitándole la parte de académicos-de-numero y el-nombre-del-señor y quedándose uno solo con la primera cifra. Funciona con los cuatro tipos que hay (numerarios, de honor, correspondientes y numerarios anteriores, pero con honor y correspondientes se come el nombre del señor la visualización). Como te vi proponer una propiedad para unos flamencos raros y exóticos, quizás te interese pedir propiedad para unos héroes patrios, aunque no seas especialmente patrioter. Un saludo. Strakhov (talk) 20:19, 14 April 2017 (UTC)
Pings
editSi no firmas después creo que no suena. Strakhov (talk) 19:30, 19 October 2017 (UTC)
- Creo que hay un bot que elimina duplicados. Saludos. strakhov (talk) 18:34, 5 November 2017 (UTC)
PAS import
editHi, I tried to add more to your PAS catalog. However, it appears there is a problem with IDs/URLs? For example, the entry Nicola DALLAPORTA has the ID and URL for Giorgio Dal Piaz. --Magnus Manske (talk) 13:45, 18 December 2017 (UTC)
Dialnet
editHi, we now have three Mix'n'match catalogs with Dialnet author ID (P1607) entries... --Magnus Manske (talk) 08:53, 11 January 2018 (UTC)
Duplicate catalog
editHi, you created #925 for KVAB member ID (P3887), but that one already had #359. It's even linked form the property. Any particular reason? --Magnus Manske (talk) 09:07, 22 January 2018 (UTC)
P1047
editSo Catholic Hierarchy person ID (P1047) now has three Mix'n'match catalogs... Ideas? --Magnus Manske (talk) 11:04, 27 February 2018 (UTC)
P4931
editCould you add these IDs?
QID,P4931 Q909,842 Q93227,1974 Q169341,2723 Q172505,26 Q174210,767 Q185592,601 Q188798,1225 Q188848,1205 Q203413,1725 Q215869,214 Q229613,116 Q236026,161 Q266305,37 Q274585,156 Q276510,896 Q284017,1036 Q311389,1096 Q312610,2496 Q330258,14 Q333632,1785 Q345446,1942 Q360386,22 Q361762,136 Q366890,3441 Q369514,24 Q380673,28 Q381843,32 Q382570,1076 Q386391,12 Q399318,1053 Q425932,2210 Q441365,151 Q445934,1738 Q449638,2858 Q453460,1111 Q456827,1350 Q461770,1348 Q463977,2076 Q468806,759 Q469467,619 Q472866,311 Q508497,1084 Q512104,13 Q512952,1488 Q517143,19 Q526424,9 Q533331,928 Q542000,114 Q542573,2474 Q553782,2507 Q554470,707 Q558709,159 Q566338,3535 Q590492,1731 Q605250,255 Q611314,980 Q615130,3267 Q629788,645 Q656025,101 Q680400,1940 Q705639,4478 Q707999,2043 Q709413,474 Q727090,2621 Q743869,650 Q746951,1644 Q751501,113 Q765871,41 Q769719,3350 Q788307,1901 Q856896,3484 Q876748,350 Q912066,1633 Q912564,1505 Q919004,1565 Q919418,166 Q926422,62 Q929828,72 Q937502,56 Q938425,655 Q940617,48 Q954681,2588 Q956073,1726 Q958724,3351 Q961151,689 Q974754,104 Q997768,15 Q1042863,17 Q1127112,3439 Q1184502,1237 Q1209692,177 Q1261023,2433 Q1282487,390 Q1287147,300 Q1352007,2178 Q1356734,3440 Q1364604,20 Q1373194,1772 Q1393468,462 Q1394516,3877 Q1420762,944 Q1625070,602 Q1709118,663 Q1709289,958 Q1709455,803 Q1709632,265 Q1712411,174 Q1713156,27 Q1729145,1086 Q1749735,2002 Q1791454,2218 Q1819994,2550 Q1876350,2203 Q1876372,795 Q1876535,1193 Q1891660,558 Q1932618,8 Q1934739,163 Q2134419,777 Q2255746,1799 Q2327185,1954 Q2343131,35 Q2369852,1352 Q2429939,2606 Q2467902,18 Q2590062,73 Q2605806,2659 Q2622356,1208 Q2639364,155 Q2641297,2408 Q2646308,791 Q2821510,1301 Q2825073,720 Q2839072,256 Q2865964,453 Q2877981,613 Q2880299,1581 Q2887955,87 Q2893480,1510 Q2917655,157 Q3049517,169 Q3099006,222 Q3142015,1690 Q3173178,514 Q3268448,33 Q3270457,3762 Q3273108,4 Q3311629,47 Q3322193,153 Q3324926,2460 Q3384550,2286 Q3390376,43 Q3391192,11 Q3391862,38 Q3466333,2270 Q3469069,883 Q3622555,865 Q3719848,1008 Q3814067,498 Q4383464,107 Q4396048,745 Q4444192,2317 Q4694654,5 Q4712225,274 Q4722026,1458 Q4762125,847 Q5015998,662 Q5041668,824 Q5211758,3812 Q5292181,137 Q5368094,131 Q5379748,242 Q5379752,66 Q5379818,327 Q5403971,575 Q5405271,641 Q5461883,923 Q5484113,294 Q5557433,21 Q5611669,40 Q5657691,2343 Q5658195,2289 Q5659468,943 Q5662463,1927 Q5663077,2930 Q5663265,165 Q5663437,812 Q5663585,757 Q5664161,451 Q5675662,208 Q5696611,1 Q5698985,342 Q5701974,1085 Q5703706,1316 Q5707198,92 Q5715335,2733 Q5725121,1375 Q5725446,1662 Q5750346,1131 Q5750833,331 Q5752234,768 Q5752725,816 Q5760095,44 Q5765069,57 Q5772932,1754 Q5791804,1767 Q5796452,2323 Q5812772,112 Q5827342,2000 Q5830026,1575 Q5831002,666 Q5833353,2211 Q5865795,45 Q5867129,93 Q5878335,1153 Q5888322,54 Q5890340,897 Q5890355,42 Q5904569,1122 Q5906282,815 Q5906334,991 Q5906442,70 Q5935026,1289 Q5936235,781 Q5939141,125 Q5942542,3 Q5942795,39 Q5947576,2827 Q5947805,135 Q5951984,508 Q5973442,315 Q5974548,1163 Q5976569,341 Q5992263,152 Q5992540,322 Q6004867,1548 Q6039616,1031 Q6042411,252 Q6053736,1420 Q6053851,1595 Q6054092,23 Q6054311,1095 Q6054329,608 Q6054332,713 Q6097922,30 Q6105692,3310 Q6109748,187 Q6109778,106 Q6109988,172 Q6110588,436 Q6110617,3175 Q6112331,693 Q6116524,1387 Q6118161,501 Q6138666,301 Q6147534,4092 Q6155173,2988 Q6160191,418 Q6165676,3433 Q6173547,175 Q6204099,31 Q6291739,6 Q6299284,762 Q6299296,849 Q6299420,1025 Q6301602,1776 Q6553424,1686 Q6700442,419 Q6721539,279 Q6743080,848 Q6752465,7 Q6752853,330 Q7368989,840 Q7411091,10 Q7554439,148 Q7934392,1869 Q8193645,176 Q8204606,51 Q8210913,164 Q8345324,987 Q8776210,16 Q9006205,337 Q9010155,325 Q9013693,435 Q9015626,25 Q9020828,3571 Q9025786,2919 Q9066610,851 Q9070817,167 Q9076632,188 Q10268281,857 Q10299145,658 Q10308991,378 Q10945766,2313 Q10955965,295 Q10957611,688 Q11291292,674 Q11332299,297 Q11689767,127 Q11779307,2905 Q11906094,96 Q12737717,860 Q15062210,1654 Q15431277,76 Q15701884,1773 Q15919237,1929 Q15966206,150 Q16009530,422 Q16145077,158 Q16145521,129 Q16334672,154 Q16483046,871 Q16488276,2544 Q16488449,414 Q16488789,160 Q16490320,1381 Q16492760,705 Q16493082,1443 Q16494506,95 Q16532432,4312 Q16562292,521 Q16568311,173 Q16570266,1165 Q16582095,1534 Q16582321,1844 Q16583795,100 Q16584433,1281 Q16584447,289 Q16591740,1879 Q16601082,1045 Q16601231,122 Q16614481,1006 Q16629284,266 Q16647709,3122 Q16941993,231 Q16942149,566 Q16942241,190 Q17364440,1771 Q17462716,686 Q17489515,2296 Q17612335,358 Q17619728,667 Q17626769,953 Q17626896,684 Q18204569,248 Q18419948,382 Q18608504,1496 Q18608507,638 Q18608510,3741 Q18923999,138 Q19060971,245 Q19521967,97 Q19801588,606 Q19999892,596 Q20005334,308 Q20008364,780 Q20013112,1342 Q20013746,77 Q20014468,1858 Q20014664,438 Q20015231,83 Q20015514,36 Q20015551,1231 Q20016614,50 Q20016624,2826 Q20016629,52 Q20016645,111 Q20925405,4506 Q21483972,445 Q22986619,298 Q23023958,470 Q23023961,89 Q23498431,275 Q23682503,376 Q23705196,46 Q23705227,676 Q23705232,65 Q23705243,702 Q23705279,695 Q23705298,1119 Q23705314,911 Q23821144,2923 Q23901628,741 Q23936724,1695 Q23939808,1438 Q23939834,515 Q23942815,90 Q23942828,139 Q23942832,220 Q23942860,292 Q23942873,629 Q24287683,1542 Q24567753,29 Q24713672,115 Q24846624,709 Q25039609,1866 Q25407824,1285 Q25411028,132 Q25411035,49 Q26720177,272 Q26970942,80 Q26970945,141 Q26970970,375 Q26996139,1192 Q27118919,890 Q27119712,1018 Q27575270,59 Q27893855,850 Q28028519,320 Q28028521,179 Q28028573,1449 Q28028587,1295 Q28057709,1653 Q28121995,1276 Q28358617,383 Q28503864,1577 Q28663012,1158 Q28816027,1707 Q28933294,1770 Q29419517,891 Q29419524,178 Q29419547,753 Q29419602,752 Q29419603,1389 Q29647594,1582 Q29915781,546 Q30118651,639 Q30182977,1398 Q30299887,171 Q30464993,380 Q30916526,3618 Q32803380,1049 Q33027002,34 Q33120162,448 Q33120547,425 Q33728804,2418 Q34051966,726 Q34618587,703 Q34818466,1291 Q34821496,293 Q35507464,2891 Q35533344,303 Q35567409,1290 Q35592587,704 Q35606226,2173 Q35661871,722 Q35829275,532 Q35831527,232 Q36493832,60 Q38688932,370 Q39081793,2717 Q40877448,1347 Q40878117,310 Q40879176,792 Q40880704,1665 Q40881109,1887 Q40884842,1605 Q40886025,2197 Q40887489,1353 Q42382402,309 Q42382825,197 Q42383038,78 Q42383044,191 Q42383730,98 Q42384216,853 Q42395322,439 Q42404794,281 Q42727696,283 Q43853741,307 Q45356702,108 Q45901003,672 Q47012645,126 Q47015146,586 Q47015335,568 Q47051953,1050 Q48088624,1876 Q48340396,4347 Q48888366,121 Q50387429,55 Q50387795,184 77.180.88.13 00:05, 11 March 2018 (UTC)
PRS Legislative Research MP ID
editHi, Thanks for creating mixnmatch catalogue for this id, but although I am manually matching them, it is not being updated in Wikidata, see for example. Please have a look whats wrong. Thanks, -- Bodhisattwa (talk) 11:43, 22 March 2018 (UTC)
Rate Beer IDs
editHi, I wonder whether you could kindly apply your web-scraper magic to the sub pages of https://www.ratebeer.com/breweries/ for RateBeer brewery ID (P2905) and load them into Mix'n'Match? Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 11:35, 2 April 2018 (UTC)
- Thanks for setting up the Mix'n'Match for RateBeer brewery IDs! I've noticed a small issue though; It looks like the generated URL contains a trailing
/
which goes against the format as a regular expression (P1793) for this property:[a-z0-9-]+/\d+
.
- Thanks for setting up the Mix'n'Match for RateBeer brewery IDs! I've noticed a small issue though; It looks like the generated URL contains a trailing
- Would it be possible to change the output to remove the trailing slash so that, for example, Starobrno Brewery (Q12221): starobrno-brewery-heineken/962/ would become starobrno-brewery-heineken/962? I think that I could manage to automatically fix the constraint (format) violations if you don't already have an easy way of doing that as well. Cheers in advance! Aluxosm (talk) 23:17, 3 August 2020 (UTC)
Researchmap
editThanks for setting up the mixnmatch for Researchmap IDs. I only found yours after I had just tried to do the same thing here. I'm interested in a couple of things. Firstly, which page did you scrape? I couldn't find one, so made it iterate over all ids of the form "read0#######", which seemed inefficient, but I couldn't find any better list. Secondly, when I approve matches on your mixnmatch, they don't seem to automatically insert the identifier into the item I am matching. Am I doing something wrong, or are your identifiers not synched with the property itself? --99of9 (talk) 03:13, 12 April 2018 (UTC)
Wikidata:Requests for comment/Sort identifier statements on items that are instances of human
editYou seem to be involved with external identifiers and might be interested in Wikidata:Requests for comment/Sort identifier statements on items that are instances of human. 77.179.112.1 10:01, 3 May 2018 (UTC)
Mix'n'Match for Norwegian war sailor register ship-ID
editGood evening! Thank you very much for setting up the Mix'n'Match for Norwegian war sailor register ship-ID. Do you think it is possible to either add further possibilities for searching the name of a vessel, oralternativly an easier searc on wikidata for this property. This because unfortunately a vessels article name (item) will be spelled in different ways dependig on language wiki. as an example Bergensfjord will have the hit Bergensfjord (Q16296194) FIN as this is the only exactly match for M/S Bergensfjord. in Other languages DE: Bergensfjord (Schiff, 1913), SS BergensfjordEN, and not at least DS «Bergensfjord» (1913) NO. I have no knowlegde of how to set up Mix'n'Match so I do not know if my oberservation here is correct. Breg Pmt (talk) 20:41, 18 May 2018 (UTC)
I now see that I also can use "search no.wikipedia" Breg Pmt (talk) 22:14, 18 May 2018 (UTC)
Armenian ID properties
editHi,
Thanks for creating Mix'n'Match catalogues 1254 & 1255. However, I think we may get more automated matches if they are re-run, using Armenian names from the sources, and matching against Armenian Wikidata labels, which seem to be more complete. This may be more useful to our Armenian colleagues, for manual matching, too. Could you have a look at whether it's possible to do this - or use both - please? Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 10:30, 29 May 2018 (UTC)
- Hi Andy. Check this catalog. Unfortunately I cannot read Armenian. --Gerwoman (talk) 18:32, 29 May 2018 (UTC)
- Thank you. Me neither, but I have plenty of Armenian friends who can ;-) Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 18:46, 29 May 2018 (UTC)
Hi Gerwoman, thanks for creating Mix'n'Match catalogues for Armenian National Academy of Sciences ID (P5212) and Armenian Parliamentary ID (P5213). All of entries of the first are joined with WD items and all the entries of the second one have items in WD. Thus, I have nearly nothing to do in M'n'M. Can you create some more cataloges from Template:Armenia properties, please? National Gallery of Armenia work ID (P5210), Spyur ID (P5217) or Armenian Cinema ID (P5218) could be the best. - Kareyac (talk) 13:40, 19 July 2018 (UTC)
Swedish_Academy_member
editThanks for your work with mix-and-match (would be interesting to know what tools you use)
Question can you change in the set up on Mix-and-Match that the sv not se Wiki is used when searching on page list/1322/unmatched I havnt found were to change it- Salgo60 (talk) 07:14, 19 June 2018 (UTC)
- Thank you Salgo60. I use the tool to import catalogs, or the tool to create a scraper. Both of them were developed by Magnus Manske. Perhaps he can change the language in that catalog. --Gerwoman (talk) 18:16, 19 June 2018 (UTC)
- Can I see somewhere how you was setting it up?!?! I tried once with no success.... All members are now part of WD so we dont need to disturb WD Guru Manske with this trivial thing - Thanks once again from another Magnus in Stockholm Sweden - Salgo60 (talk) 18:47, 19 June 2018 (UTC)
- Once created the scraper/catalog I cannot change the language, Salgo60. Sorry. I don't have superpowers. --Gerwoman (talk) 18:30, 21 June 2018 (UTC)
- it was more how you did set up the scraper so it worked finding pages...not the language code..... I feel you have skills that my slow brain can't figure out ;-( - Salgo60 (talk) 18:34, 21 June 2018 (UTC)
- Once created the scraper/catalog I cannot change the language, Salgo60. Sorry. I don't have superpowers. --Gerwoman (talk) 18:30, 21 June 2018 (UTC)
- Can I see somewhere how you was setting it up?!?! I tried once with no success.... All members are now part of WD so we dont need to disturb WD Guru Manske with this trivial thing - Thanks once again from another Magnus in Stockholm Sweden - Salgo60 (talk) 18:47, 19 June 2018 (UTC)
DBE RAH
editHi, you did create catalog 651 back in the day, and now 1340 on the same source, except 1340 has a lot less entries than 651, but I suspect contains new entries. This seems a bit messy. What do you suggest? --Magnus Manske (talk) 08:04, 2 July 2018 (UTC)
Mix'n'match catalog editor
editI have made a new Mix'n'match page to edit catalog information, such as name, description, and associated property. The page will load for everyone, but only users on my whitelist can save changes. As of now, you and I are the first ones on that list. example. If you can see the "Save" button, it works for you. You can edit all catalogs this way. Changes you save will go live immediately. Note that you can also deactivate catalogs (but not re-activate, for technical reasons). --Magnus Manske (talk) 15:23, 17 July 2018 (UTC)
Mix'n'match
editI have deactivated catalog 1156, as it is a duplicate of 807. --Magnus Manske (talk) 10:20, 20 July 2018 (UTC)
Duda
editHola, Gerwoman. Feliz agosto. Con base a tu experiencia como 'web scraper / creador de catálogos'... ¿sabes si es posible crear un catálogo de poblaciones a partir del Población del Padrón Continuo por Unidad Poblacional del INE? Vamos, una lista con nombres de localidades y numeritos como el que sale en la barra de autoridades de este artículo. Un saludo. strakhov (talk) 11:51, 5 August 2018 (UTC)
- Muchas gracias. Le echaré un ojo a las herramientas. Es complicada la jerarquía "municipio (en principio están todos puestos, los que acaban en seis ceros, con la forma recortada), localidad, concejo, parroquia, lugar,..." pero algo se puede hacer con ello. A los de las provincias que empiezan con 0 (Álava, Almería, Barcelona,... las nueve primeras por orden alfabético) creo que les falta el 0 inicial. Un cordial saludo. strakhov (talk) 12:20, 22 August 2018 (UTC)
- Me parece bien añadir el cero, valiendo tan poco él y saliendo tan barato. Me parece bien quitar los de 6 ceros al final. Los que tienen los dos últimos números finales distintos de 00... como prefieras (podemos probar a quitarlo y ver si el match con los restantes se hace más fluido; no me hago una idea de cuántos items ¿gallegos sobre todo? se quedarían fuera con la criba), si bien por el momento es menos importante y la selección de la "entidad singular" sobre la "localidad" no deja de ser ... una preferencia editorial con el consenso de 2 personas. Me parece bien darte las gracias por tu gentil ofrecimiento. ¡Gracias! strakhov (talk) 17:39, 24 August 2018 (UTC)
- Como la seda. :) Gracias. strakhov (talk) 17:35, 25 August 2018 (UTC)
HU Press catalog
editLovely work! I'm getting access to the MIT Press catalog soon; any advice for uploading + matching? Sj (talk) 17:33, 21 August 2018 (UTC)
- <3, checking in :) Sj (talk) 21:54, 13 May 2019 (UTC)
BVPB
editHola! No sé si el catálogo lo creaste tú, pero BVPB authors no coincide con la propiedad BVPB authority ID (P4802). No tienen la misma formatter url ni coinciden los números. Los de la propiedad son un identificador estable que responde a esta forma http://bvpb.mcu.es/es/consulta_aut/registro.cmd?id=$1
: mientras que los del catálogo son algo que la página llama idValor, que responde a este otro churro (http://bvpb.mcu.es/es/consulta/busqueda_referencia.cmd?campo=idautor&idValor=$1
).
Por ejemplo, para Gervasio de Artiñano y Galdácano (Q11924149)
- El valor de "idValor" es 485274
- El valor del código de autoridad tal como lo define la propiedad es 74486.
Independientemente de que haría falta una nueva propiedad para este código alternativo si se quisiera almacenar ¿cómo se corregiría esto en el catálogo? ¿Habría que desactivarlo antes de que se sigan añadiendo códigos erróneos y se haga más grande la bola de nieve? (porque visualmente y/o vía regex no es fácil determinar cuáles están bien y cuáles mal). Un saludo. strakhov (talk) 19:20, 14 September 2018 (UTC)
- Nop. ¡Tú eres el experto extrayendo cosas de las güébs! Podríamos preguntar a Discasto al menos si lo ve 'factible', pero no quisiera cargarle con más labores, que está muy apretado últimamente con unas cosas valencianas. Si hubiera un método rápido e indoloro para descargarse estos códigos estaría muy bien, pues BVPB authority ID (P4802) es idéntica a BDCYL authority ID (P3964) y Digital Valencian Library author ID (P3932) por ejemplo (además de varias otras bibliotecas digitales aún sin identificador). Con Galiciana authority ID (P3307) (también de la misma ralea) hubo un problema relacionado con esto (intento de repurpose del identificador especificado inicialmente por el otro; a mitad de partido, sucio y poco ortodoxo) y al final no sé en qué quedó el tema de la formatter url a usar. strakhov (talk) 10:05, 15 September 2018 (UTC)
- @Gerwoman, Strakhov: Si me dais más datos, puedo ver cómo de fácil/difícil sería. Pero sí, hasta que no termine lo de la Comunidad Valenciana, complicado. Un saludo --Discasto (talk) 16:49, 15 September 2018 (UTC)
- @Discasto: La idea sería sacar todos los valores de BVPB authority ID (P4802). No me parece que sirva un simple scraping de la web. Quizá habría que consultar el Repositorio OAI-PMH, pero eso me supera de momento. Gracias por el interés. --Gerwoman (talk) 10:23, 16 September 2018 (UTC)
Help understanding scraper tool?
editHi Gerwoman, I see that you're quite proficient at using the scraper tool for Mix'n'match. I'd like to learn how to use it--could you teach me? Rachel Helps (BYU) (talk) 17:36, 17 September 2018 (UTC)
- I got a co-worker to help me understand it a little better, but I still don't understand what a "follow level" is. Rachel Helps (BYU) (talk) 20:23, 17 September 2018 (UTC)
- @Rachel Helps (BYU) The "follow level" hasn't been never very clear for me. I hope that Magnus Manske can give us some light with an example. --Gerwoman (talk) 15:18, 19 September 2018 (UTC)
Two catalogs of yours
editIt appears that mix'n'match catalogs 1154 (todostuslibros books with ISBN beginning 978) and 1186 (the RoMEO journal database) never finished (or never started?) their imports. Could you check those out? Mahir256 (talk) 03:09, 19 September 2018 (UTC)
- Thanks Mahir256. I've restarted 1186. The 1154 seems to be too big for mix'n'match according to Magnus Manske. Let's see. --Gerwoman (talk) 15:16, 19 September 2018 (UTC)
- @Mahir256: The scraping of RoMEO is finished. Do you want to help with the matching? --Gerwoman (talk) 16:30, 27 September 2018 (UTC)
About MNAV artist ID
editHi!, and thanks for uploading the catalog to mixnmatch. I just wanted to tell you that since i proposed the creation of the property i've been working in the spreadsheet in openrefine to mass upload the ID. I think i can have it uploaded next week. Regards and thanks again.--Zeroth (talk) 22:46, 24 November 2018 (UTC)
Austrian Biographical Encylopedia and scraper help
editHey, thanks for adding the Austrian Biographical Encylopedia! Do you think it would be possible to fix the order of the names? Because now Franz Abel is Abel, Franz and the automatic matching doesn't work this way I guess.
Also, I would like to ask for some help with adding new scrapers as I would really like to ,earn how to do that but I don't know much about Regex and stuff. If you have time, could you show me for example how to make scrapers from http://www.bmlo.lmu.de/ and http://hbl.lzmk.hr/abecedarij.aspx ? Thank you! --Adam Harangozó (talk) 16:15, 28 November 2018 (UTC)
Wikidata:Property proposal/IEC database ID commemorative monument of Catalonia
editHe propuesto la propiedad Wikidata:Property proposal/IEC database ID commemorative monument of Catalonia al estilo de COAM structure ID (P2917). Cualquier ayuda es bien recibida. No sé si puedo ir enlazando algunos ítems. AlvarezGomez (talk) 16:23, 6 December 2018 (UTC)
Austria
editHi, FYI, I have matched and created some items for your OEML catalog, and started a new Vienna biographies catalog. --Magnus Manske (talk) 14:16, 10 December 2018 (UTC)
Deprecated Mix'n'match catalog?
editHello, Gerwoman.
It seems that this catalog is based upon a deprecated version of the AIBL's website, isn't it?
Regards,
Executed Today
editCould you try making a scraper for the site for ExecutedToday ID (P4361), please? Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 19:41, 27 December 2018 (UTC)
- Done. Happy Holidays! --Gerwoman (talk) 11:11, 28 December 2018 (UTC)
- Thank you. And to you! The dataset has few automatic matches, because the names are "polluted" with years and descriptions, for example;
1958: Istvan Angyal, Hungarian revolutionary
. Would it be possible to strip the leading digits and colon, and everything after (and including) the first comma? Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 11:59, 28 December 2018 (UTC)- Yes, I created a new catalog (2129) although several entries don't follow this pattern. --Gerwoman (talk) 12:56, 28 December 2018 (UTC)
- Thank you. And to you! The dataset has few automatic matches, because the names are "polluted" with years and descriptions, for example;
m'n'm question
editIt seems my ping didn't work. Please see Topic:Uquq1pu31lagfcuh. --- Jura 17:33, 1 January 2019 (UTC)
Question about the the Mix n' Match catalogue for Worldwide Database of University Museums and Collections
editHey
I've been looking at if it was possible to get data from the Worldwide Database of University Museums and Collections and it looks like you've already scraped it and started matching :) Can I ask you how you scraped it and when to know when we need to update Wikidata? Can I ask you to fill in a new import on Wikidata:Dataset Imports (just click the blue button), don't worry about it being perfect. I'll do my best to find some people to help with the matching.
Best
--John Cummings (talk) 14:08, 21 January 2019 (UTC)
- Hi John Cummings. I created the UMAC ID catalog last April, when the proposal was approved. I don't remember well but probably I used the search page. Perhaps Spinster, who proposed the property, is more interested in requesting the import. Best --Gerwoman (talk) 17:24, 8 February 2019 (UTC)
CBS municipality code (P382) Mix'n'Match
editI noticed on Amsterdam code (P6434) that you added Mix'n'Match based on Gemeentegeschiedenis.nl. Can you do the same for CBS code in Mix'n'Match? Thank you, Multichill (talk) 22:24, 3 February 2019 (UTC)
- Multichill, is this what you need? 2198 --Gerwoman (talk) 19:58, 12 February 2019 (UTC)
- Yes, excellent, thank you very much! Multichill (talk) 20:47, 12 February 2019 (UTC)
Bidicam
editHola! [1]. Gracias por crear el catálogo. Solo que me temo que ocurre lo que ¿otra vez? Que los identificadores del mix-n-match no son los mismos que los propuestos como propiedad. La culpa creo que es mía por la url que puse como fuente en la propuesta, que no era tanto para 'scrapearla' sino como muestra de la lista de autores que hay (mas el identificador sigue siendo ese identificador estable que se encuentra después de hacer clic etc etc y que responde a la formatter url con el formato estándar consulta_aut/registro.cmd?id=$1
). strakhov (talk) 19:40, 8 February 2019 (UTC)
Two new data sets
editHi, please could you work your magic, and scrape https://www.nzonscreen.com/people for NZ On Screen person ID (P6548); and the various categories at https://www.nzonscreen.com/explore - chiefly as https://www.nzonscreen.com/explore/category/film and https://www.nzonscreen.com/explore/documentary- for NZ On Screen work ID (P6549)? Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 10:10, 1 March 2019 (UTC)
- @Pigsonthewing: Done. --Gerwoman (talk) 16:42, 1 March 2019 (UTC)
- Those are great. Thank you. Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 00:00, 2 March 2019 (UTC)
Stalled scraping of property:P5011
editHello Gerwoman, I often use Mix'n'Match to scrape for items with Prazdne Domy building ID (P5011). The current database now includes IDs over 4000 but the scraping is stalled at about 2900. You seem to be the only person able to tweak the scraping, do you think you would be so kind to look at it and try to find out why it's not scraped any more? Thank you, --Vojtěch Dostál (talk) 17:49, 3 March 2019 (UTC)
- Hi @Vojtěch Dostál:. It seems that they are adding each day new objects to the database. I've created a second catalog for the new ones. Ping me when you have finished with these two. But there are more people using the m'n'm tool. ;) --Gerwoman (talk) 19:54, 3 March 2019 (UTC)
- Oh, I thought that Mix'n'Match is able to add new entries automatically to the existing datasets! Anyway, thanks a lot :) --Vojtěch Dostál (talk) 20:15, 3 March 2019 (UTC)
- Hello, when I try to create new entries in the new catalogue, it says "The supplied language code was not recognized." Do you know what it can mean? --Vojtěch Dostál (talk) 11:20, 4 March 2019 (UTC)
Plant Illustrations artists
editGreat minds think alike! https://tools.wmflabs.org/mix-n-match/#/catalog/2272
Do you have any preference as to which we keep? Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 20:31, 11 March 2019 (UTC)
- It's your proposal @Pigsonthewing:. I've deactivated mine. --Gerwoman (talk) 19:34, 12 March 2019 (UTC)
- Thank you, but it turns out there was a bug in mine, so I've disabled that. Please re-enable yours. Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 21:28, 12 March 2019 (UTC)
- May I nudge you? Plant Illustrations artist ID (P6605) has been created. Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 21:38, 21 March 2019 (UTC)
- Thank you, but it turns out there was a bug in mine, so I've disabled that. Please re-enable yours. Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 21:28, 12 March 2019 (UTC)
Whitney museum artists
editHi Gerwoman, could you maybe setup a catalog for https://whitney.org/artists in Mix'n'Match? Multichill (talk) 12:39, 14 April 2019 (UTC)
- This was a challenge but finally here it is: https://tools.wmflabs.org/mix-n-match/#/catalog/2381 --Gerwoman (talk) 10:48, 19 April 2019 (UTC)
- Thanks for doing this! I tried a couple, but it's really tedious with only the description "artist". Any possibility to use the date of birth/death, like for example "1889–1957" at https://whitney.org/artists/52, as part of the description? That would make everything a lot easier. Multichill (talk) 17:11, 30 April 2019 (UTC)
- Now scraping Multichill. But this will take a while. --Gerwoman (talk) 17:53, 7 May 2019 (UTC)
- Thanks! I don't see a difference yet. How long do you think a while will be? Multichill (talk) 17:38, 9 May 2019 (UTC)
- Sorry, Multichill, I created a new catalog 2407. Best regards. --Gerwoman (talk) 18:01, 9 May 2019 (UTC)
- Ah! Great. That's much easier. Did you ping Magnus about the duplication so he can do a bit of clean up? Multichill (talk) 18:08, 9 May 2019 (UTC)
- Take into account that the last one has 200 less items than the first. The matched items in the first are already matched in the last. When you finish matching in this, I will sincronize the other one, I'll try to match the 200 and I will deactivate it. Do you think is correct Magnus, Multichill? --Gerwoman (talk) 18:26, 9 May 2019 (UTC)
- Ah! Great. That's much easier. Did you ping Magnus about the duplication so he can do a bit of clean up? Multichill (talk) 18:08, 9 May 2019 (UTC)
- Sorry, Multichill, I created a new catalog 2407. Best regards. --Gerwoman (talk) 18:01, 9 May 2019 (UTC)
- Thanks! I don't see a difference yet. How long do you think a while will be? Multichill (talk) 17:38, 9 May 2019 (UTC)
- Now scraping Multichill. But this will take a while. --Gerwoman (talk) 17:53, 7 May 2019 (UTC)
- Thanks for doing this! I tried a couple, but it's really tedious with only the description "artist". Any possibility to use the date of birth/death, like for example "1889–1957" at https://whitney.org/artists/52, as part of the description? That would make everything a lot easier. Multichill (talk) 17:11, 30 April 2019 (UTC)
Add ID to catalog
editHi! Could you add Dimore Storiche Italiane ID (P6727) to its Mix'n'Match catalog? Thank you! --Epìdosis 21:32, 4 May 2019 (UTC)
- Done. --Gerwoman (talk) 17:53, 7 May 2019 (UTC)
mix 'n match big thanks
editthank you so much for working on the fellows of the AAAS mix 'n match. you're the best :) Sbbarker19 (talk) 19:49, 9 May 2019 (UTC)
BlackPast.org
editHi, can you connect mix'n'match catalog 2385 with BlackPast.org ID (P6723)? Thanks! Trivialist (talk) 22:23, 9 May 2019 (UTC)
- Thanks for the above. Could you also link Official Charts artist ID (P6559) and 2232? I just discovered that they hadn't been connected. Thanks again. Trivialist (talk) 15:40, 11 May 2019 (UTC)
- done. --Gerwoman (talk) 15:48, 11 May 2019 (UTC)
Whitney
editHi, I closed the smaller one of the two Whitney artist catalogs you created. 2381 remains. --Magnus Manske (talk) 07:46, 10 May 2019 (UTC)
- I just saw the discussion above. I am porting the descriptions over to the old catalog. --Magnus Manske (talk) 08:48, 10 May 2019 (UTC)
- Also added an auto-scraper. --Magnus Manske (talk) 08:48, 10 May 2019 (UTC)
Mix'n'Match 2348
editGood afternoon! I wonder if you are able to help me with the Mix'n'Match catalouge https://tools.wmflabs.org/mix-n-match/#/list/2348/unmatched. As I have discussed this case with @Anne-Sophie Ofrim: I am not able to "automatically" match into an item. I.e for https://www.wikidata.org/w/index.php?button=&title=Special%3ASearch&search=Lorentz%20Diderich%20Kl%C3%BCwer%20haswbstatement%3AP31%3DQ5&ns0=1&ns120=1 I was thinking that it should be possible to search wikidata, find the item and then have Arkivportalen agent ID (P5887) for Lorentz Diderich Klüwer (Q6680503) automatically created. Do you have any comment, if I have explained the problem in an understandable way. Breg Pmt (talk) 18:41, 12 May 2019 (UTC)
- Yes Pmt. I've linked the property to the catalog. Now you can use, for example, this visual tool to match the entries. --Gerwoman (talk) 18:02, 13 May 2019 (UTC)
catalog link request
editCan you link Know Your Meme ID (P6760) and 2417? Thanks. Trivialist (talk) 23:11, 15 May 2019 (UTC)
- Done. --Gerwoman (talk) 18:10, 16 May 2019 (UTC)
Thank you
editWith reference to Your edit and help on Mix'n'Match 2348 as asked fronm me 12. may 2019 I would like to thank. Your help made it possible for me to do quite an important work on Norwegian Archives. Again thank you. Best regards Pmt (talk) 08:11, 18 May 2019 (UTC)
Find New Zealand Artists
editHi there; I'm a beginner with Mix'n'Match, so forgive me if I've got it wrong. I saw you just uploaded this dataset. I've been working with the Christchurch Art Gallery Te Puna o Waiwhetū, one of the maintainers of this database, and we have property proposal for Find NZ Artists ID currently under discussion. Is there a way we can make sure that this property, when approved, can be included in the Wikidata items via Mix'n'Match? Do we need to reset the import and have another go? And perhaps it would make more sense if the Gallery uploaded a subset of the artist names that included birth and/or death dates, to help with matching. Thoughts? —Giantflightlessbirds (talk) 09:05, 20 May 2019 (UTC)
- Yes Giantflightlessbirds. We can link the property to the mix&match catalog, synchronize with WD the already matched items, and from then on the property will be added to the new matched items. Of course the list you propose would help. Or you can create it yourself with the download action of mix'n'match. --Gerwoman (talk) 18:21, 23 May 2019 (UTC)
- OK, I'll let you know as soon as the property is approved, and if you can help link/synchronize the already-matched items in the catalog that would be great. I'll talk to the Gallery about generating a list from Find NZ Artists with Entry IDs, names, and higher-quality descriptions (if possible). Should I upload that as a new catalog, or is there a way of replacing the current descriptions with improved ones? Advice welcome. —Giantflightlessbirds (talk) 02:29, 24 May 2019 (UTC)
- No problem. You can create a new catalog. --Gerwoman (talk) 16:09, 24 May 2019 (UTC)
- Property is ready to go. Magnus is synching the first catalogue now. --99of9 (talk) 12:37, 29 May 2019 (UTC)
- Right, I've uploaded another cleaned-up and slightly better-formatted catalogue as Find NZ Artists 2.0; restricted to people where birth/death dates and or birth/death place known, which should make things easier to match. We found quite a few mistakes with the original metadata and made some corrections for this upload. I notice the automatic matching wasn't able to detect pretty simple matches like William Mein Smith (Q8015537), although the name, birth, and death dates were all available. Any way of mapping over the matches already confirmed in the original Find NZ Artists upload? —Giantflightlessbirds (talk) 02:08, 4 June 2019 (UTC)
- To map any IDs that are already matched on Wikidata, you can do "Action"->"Manually Sync Catalogue". I'm just doing that now for your new catalogue. --99of9 (talk) 05:48, 4 June 2019 (UTC)
- Right, I've uploaded another cleaned-up and slightly better-formatted catalogue as Find NZ Artists 2.0; restricted to people where birth/death dates and or birth/death place known, which should make things easier to match. We found quite a few mistakes with the original metadata and made some corrections for this upload. I notice the automatic matching wasn't able to detect pretty simple matches like William Mein Smith (Q8015537), although the name, birth, and death dates were all available. Any way of mapping over the matches already confirmed in the original Find NZ Artists upload? —Giantflightlessbirds (talk) 02:08, 4 June 2019 (UTC)
- Property is ready to go. Magnus is synching the first catalogue now. --99of9 (talk) 12:37, 29 May 2019 (UTC)
- No problem. You can create a new catalog. --Gerwoman (talk) 16:09, 24 May 2019 (UTC)
- OK, I'll let you know as soon as the property is approved, and if you can help link/synchronize the already-matched items in the catalog that would be great. I'll talk to the Gallery about generating a list from Find NZ Artists with Entry IDs, names, and higher-quality descriptions (if possible). Should I upload that as a new catalog, or is there a way of replacing the current descriptions with improved ones? Advice welcome. —Giantflightlessbirds (talk) 02:29, 24 May 2019 (UTC)
This has no Wikidata property
edit|Do you know how to fix this? Kind regards Trade (talk) 06:40, 27 May 2019 (UTC)
New Mix'n'match catalogs
editHi! Could you create catalogs for Radio Radicale organizer ID (P4339), Radio Radicale person ID (P4521) and Treccani's Dizionario di Storia ID (P6404)? Thank you! --Epìdosis 11:08, 3 June 2019 (UTC)
- BTW, thank you very much for PHI Latin Texts and Pinakes! --Epìdosis 15:24, 3 June 2019 (UTC)
- Thank you for the catalog of Treccani's Dizionario di Storia ID (P6404).
- Could you also remove ".asp" in the formatter URL of catalog 1097? At the moment the URLs are broken. Thank you, --Epìdosis 20:35, 3 June 2019 (UTC)
- I cannot change the URL. Let's ask Magnus Manske. --Gerwoman (talk) 18:18, 4 June 2019 (UTC)
Could you add Pinakes author ID (P6831) to catalog 2501? Thank you! --Epìdosis 07:08, 10 June 2019 (UTC)
- Could you create a catalog for digilibLT author ID (P6862)? Thank you very much! --Epìdosis 15:24, 18 June 2019 (UTC)
- Sorry, I don't know how to scrape that site. --Gerwoman (talk) 17:46, 18 June 2019 (UTC)
How about Radio Radicale organizer ID (P4339) and Radio Radicale person ID (P4521)? Thank you again, --Epìdosis 17:23, 22 August 2019 (UTC)
- Epì, I don't how to obtain these lists. --Gerwoman (talk) 14:52, 26 August 2019 (UTC)
MnM catalogue 1112 again...
editHello, are you able to extend the MnM catalogue 1112 so that it scrapes latest identifiers (recently added to the database?) It currently ends at 4500 but I'd like it to be able to go on indefinitely as new entries are added to the database. Thank you! :) --Vojtěch Dostál (talk) 13:05, 17 June 2019 (UTC)
- Thank you for reaching out to Magnus! It works very well now. --Vojtěch Dostál (talk) 11:29, 19 June 2019 (UTC)
Mix'n'match - Hoopla publisher
editHi, Gerwoman, could you link Hoopla publisher ID (P6869) and catalog 2442? Thanks! Trivialist (talk) 23:38, 19 June 2019 (UTC)
- Done. --Gerwoman (talk) 18:32, 20 June 2019 (UTC)
Learn to create an specific catalog in Mix'n'match
editHi Gerwoman,
I have seen you have created a lot of catalogs in Mix'n'match and you always try to help to the ones who ask you for help in Mix'n'match. I have read the instructions to create catalogs and I understand them but I have doubts with the website I have to configure and scrape. The website is the catalog of the BIC (cultural heritage) of the Canary Islands. Each item has its own page with its identifier, e.g. 1027 for Barranco de Guayadeque (Q781932).
I have though in two ways to work. The first one is with this url http://www.gobiernodecanarias.org/cultura/patrimoniocultural/bics/index.html?bic=true&cod=$1
with a range level. However, the number of the code isn't correlative so it is very complicated to configure a range level. The second way I though is scraping the pages in which the items are listed (e.g., www.gobiernodecanarias.org/cultura/patrimoniocultural/bics/index.html?inicio=12
). This one is possible to scrape because ?inicio=$1
is from 0 to 408 with steps of 12 (0, 12, 24, 36, 48 and correlative). But then the HTML tag to scrape is very dirty, check this example:
<a style="background:rgba(235,122,26,0.7);" target="_self" href="index.html?bic=true&cod=130" title="Zona Arqueológica">Barranco de Silva</a>
I tried to make a regular expression to catch it but I couldn't. Could you help me with this catalog? Thanks in advance!
Regards, Ivanhercaz (Talk) 20:42, 9 July 2019 (UTC)
- I forgot to share with you the property proposal in which I am working. Regards, Ivanhercaz (Talk) 20:58, 9 July 2019 (UTC)
- Hi Ivanhercaz, for the second way you can try this regex:
<div class="cartel" > <a style="background:rgba.+?" target="_self" href="index.html\?bic=true&cod=(\d+)" title=".+?">(.+?)</a> </div>
. With a little longer regex you can include Isla and Municipio. --Gerwoman (talk) 15:11, 10 July 2019 (UTC)- Fantastic, Gerwoman! Thank you very much for your help. Nice regex. The catalog is now available, we only have to wait for the creation of the property if it is accepted. A doubt: when the property will be ready, I now that I can edit the catalog to add it, but would it be added to all the items matched before he creation of the property? Regards, Ivanhercaz (Talk) 22:46, 10 July 2019 (UTC)
- Yes, we can do a sync. --Gerwoman (talk) 15:05, 11 July 2019 (UTC)
- Fantastic, Gerwoman! Thank you very much for your help. Nice regex. The catalog is now available, we only have to wait for the creation of the property if it is accepted. A doubt: when the property will be ready, I now that I can edit the catalog to add it, but would it be added to all the items matched before he creation of the property? Regards, Ivanhercaz (Talk) 22:46, 10 July 2019 (UTC)
- Hi Ivanhercaz, for the second way you can try this regex:
Ciência ID Mix'n'match catalog
editHi Gerwoman,
I want to propose again the Ciência ID, but I wanted to create a Mix'n'match catalog before to propose it again. However, in my opinion it would be very complicated due to the format of the url (e.g., https://www.cienciavitae.pt/portal/en/A11B-38F7-A1B1
). The identifier isn't consecutive and there isn't any page with all the authors to scrape the data or the possibility to make a general search nor a basic search by the main letter of a surname.
Have you any idea about how could be scraped this website?
Regards, Ivanhercaz (Talk) 10:44, 13 July 2019 (UTC)
- Sorry Ivanhercaz, I don't know how to do that. Perhaps you can ask for access to the API. --Gerwoman (talk) 18:14, 13 July 2019 (UTC)
- Nice idea. I will ask for access to the API. Regards, Ivanhercaz (Talk) 10:41, 15 July 2019 (UTC)
New Mix'n'match catalogs (2)
editHi! Could you create a catalog for Musisque Deoque author ID (P6999)? Thank you very much, --Epìdosis 17:43, 13 July 2019 (UTC)
- Scraping now. Let see. --Gerwoman (talk) 18:14, 13 July 2019 (UTC)
Mix'n'match catalogue for Media Art Database
editHi, could you create a catalogue for P3231 (P3231)? Thank you very much --Lombres (talk) 13:19, 17 July 2019 (UTC)
- the main language is japanese, you won't find any English names in the database --Lombres (talk) 13:40, 17 July 2019 (UTC)
- Have a look at this: catalog 2698. --Gerwoman (talk) 15:46, 17 July 2019 (UTC)
Barnstar
edit¡Buen trabajo colgando cruces benéficas! Un saludo.--Asqueladd (talk) 16:37, 21 July 2019 (UTC)
Hello Gerwoman, now that Paris Foreign Missions Society ID (P7077) has been created, could you linked it in Mix'n'Match please? Thanks. Ayack (talk) 10:56, 24 July 2019 (UTC)
- Done. --Gerwoman (talk) 15:07, 24 July 2019 (UTC)
Mix'n'Match for some new properties
editHi, User:Epìdosis told me to ask you directly. I have proposed some new properties such as Pinakes and he asked you the Mix'n Match. There is another one I proposed recently I would like to complete the insertion, it's the database of Max Planck Society. Can you help me? No hurry!--Alexmar983 (talk) 12:52, 30 July 2019 (UTC)
- Sorry Alexmar983, I don't know how to get the list of items in that site. Perhaps Magnus Manske can help us. --Gerwoman (talk) 15:28, 30 July 2019 (UTC)
- ok. If it's too complicated/not possible it will be a site we do by hand. Not the end of the world, it's an important ID, we won't forget.--Alexmar983 (talk) 19:52, 30 July 2019 (UTC)
New catalog for The Latin Library
editHi! Could you create a catalog for The Latin Library author ID (P7042)? Thank you very much! --Epìdosis 10:55, 12 August 2019 (UTC)
- Done. 2730. --Gerwoman (talk) 12:00, 15 August 2019 (UTC)
New catalogs
editHi, could you please create Mix'n'Match scrapers for the following encyclopedias? Thanks!
- Encyclopedia of Indigenous Peoples in Brazil
- Cambridge Encyclopedia of Anthropology
- Encyclopedia on Early Childhood Development
- Holocaust Encyclopedia
--Adam Harangozó (talk) 13:37, 15 August 2019 (UTC)
- Hi Adam Harangozó, I've created 2746, 2747, and 2748. I don't manage to create the one for the first one. --Gerwoman (talk) 15:57, 23 August 2019 (UTC)
- Thanks a lot! Could you somehow share some examples of filled "Add new scraper" forms? I really want to learn how to add new ones but it never works when I try. Thanks! --Adam Harangozó (talk) 12:46, 6 September 2019 (UTC)
Hi Adam Harangozó. One example for this request: Bursa Malaysia stock code.
- add range level from 0 to 0
- URL pattern: http://www.bursamalaysia.com/market/listed-companies/list-of-companies/plc-profile.html
- RegEx entry: <option value="(.+?)">(.+?) \(.+?\)</option>
- id: $1
- name: $2
- url: http://www.bursamalaysia.com/market/listed-companies/list-of-companies/plc-profile.html?stock_code=$1
Hope this helps. --Gerwoman (talk) 15:34, 6 September 2019 (UTC)
Adjust catalog 2186's formatter URL
editIt turns out that the West Bengal Council of Higher Secondary Education (Q7984436)'s website only accepts HTTPS requests and bungles HTTP ones, so that the outbound links from mix'n'match in the catalog titling this section do not work at all. Could the formatter URL be adjusted so that HTTPS is used? Mahir256 (talk) 22:12, 16 August 2019 (UTC)
- Sorry Mahir256, I can't modify that. Please ask Magnus Manske. Best regards. --Gerwoman (talk) 16:07, 23 August 2019 (UTC)
Arkivportalen Contributor Error
editHello! I am having some errormessages when using Mix 'n' Match catalogue 2348 can you help? What happens is that when trying to match an item manualy I end up With tis Message from Bing in Norwegian; Kontroller at du har riktig nettadresse(check correct net-adress: http://live.arkivportalen.no and Søk etter (search for) "http://live.arkivportalen.no" på Bing. Breg Pmt (talk) 10:48, 20 August 2019 (UTC)
- Sorry Pmt, I can't modify that. Please ask Magnus Manske. Best regards. --Gerwoman (talk) 15:53, 23 August 2019 (UTC)
- Okay, I will. Thank you anyway. As I have nowbeen told it is about changing http://live.arkivportalen.no to http://arkivportalen.no in all entries in catalogue 2348. Breg Pmt (talk) 16:33, 23 August 2019 (UTC)
Mix 'n' Match visualation tool
editGood afternoon! I have been using the tool you made for Arkivportalen Contributor M 'n' M cat. no 2348. https://tools.wmflabs.org/mix-n-match/visual_match.html#catalog=2348&auto_advance=1 . Can you help tell me where I can find the tool for creating it for another catalogue? Breg Pmt (talk) 19:02, 4 September 2019 (UTC)ogue
- Yes Pmt, go to https://tools.wmflabs.org/mix-n-match/#/scraper/new --Gerwoman (talk) 14:32, 5 September 2019 (UTC)
- Thank you! Breg Pmt (talk) 14:40, 5 September 2019 (UTC)
Community Insights Survey
editShare your experience in this survey
Hi Gerwoman,
The Wikimedia Foundation is asking for your feedback in a survey about your experience with Wikidata and Wikimedia. The purpose of this survey is to learn how well the Foundation is supporting your work on wiki and how we can change or improve things in the future. The opinions you share will directly affect the current and future work of the Wikimedia Foundation.
Please take 15 to 25 minutes to give your feedback through this survey. It is available in various languages.
This survey is hosted by a third-party and governed by this privacy statement (in English).
Find more information about this project. Email us if you have any questions, or if you don't want to receive future messages about taking this survey.
Sincerely,
Reminder: Community Insights Survey
editShare your experience in this survey
Hi Gerwoman,
A couple of weeks ago, we invited you to take the Community Insights Survey. It is the Wikimedia Foundation’s annual survey of our global communities. We want to learn how well we support your work on wiki. We are 10% towards our goal for participation. If you have not already taken the survey, you can help us reach our goal! Your voice matters to us.
Please take 15 to 25 minutes to give your feedback through this survey. It is available in various languages.
This survey is hosted by a third-party and governed by this privacy statement (in English).
Find more information about this project. Email us if you have any questions, or if you don't want to receive future messages about taking this survey.
Sincerely,
New encyclopedia
editHi, could you please make a Mix'n'Match scraper for the Croatian Encyclopedia? Thank you! --Adam Harangozó (talk) 17:29, 22 September 2019 (UTC)
- Done in 2 catalogs: 2811 and 2812. --Gerwoman (talk) 17:40, 23 September 2019 (UTC)
P7334 mix'n'match catalogs
editHi, can you link catalogs 2799, 2800, and 2801 to Vudu video ID (P7334)? Thanks. Trivialist (talk) 21:41, 22 September 2019 (UTC)
- Done. --Gerwoman (talk) 17:46, 23 September 2019 (UTC)
Thanks for creating that one!
(I was kind-of hoping to use it as exercise for a workshop I’m making in two weeks, but ah well − there is no monopoly on good ideas ;-))
Cheers, Jean-Fred (talk) 17:56, 3 October 2019 (UTC)
- No problem, Jean-Fred, I can disable it so you can create again in your workshop. This was easy to create although took a while to scrape and match. --Gerwoman (talk) 18:12, 3 October 2019 (UTC)
- Thanks for offering! It should be okay though − I have a couple more on my todo (please don’t make one for sixpackfilmdata film ID (P7340) and sixpackfilmdata person ID (P7341)! ;-D) (in particular video-game related ones).
- Also, attendees are supposed to come with their own catalogue to import, so I only need these for backup.
- Jean-Fred (talk) 18:46, 3 October 2019 (UTC)
Q63954904
editHi there!
Could you please add some IDs to Jean Dumortier (Q63954904)? I searched but unfortunately found nothing.
Best, Nomen ad hoc (talk) 20:36, 5 October 2019 (UTC).
- Done! --Gerwoman (talk) 16:54, 18 October 2019 (UTC)
Catholic Hierarchy person 1
editHello Gerwoman! Unfortunately the Mix'n'match for Catholic Hierarchy person 1 seems to have stopped working. I guess that is because the website is not longer reachable via https while http still works. But the Mix'n'match iframe for visual matches tries to load the Website via https. Could you look at this problem, please. Thank you very much. --Looperz (talk) 02:02, 6 October 2019 (UTC)
- Sorry, Looperz, I can't change that. Please ask Magnus Manske or use the mobile matching. --Gerwoman (talk) 17:03, 18 October 2019 (UTC)
New page for catalogues
editHi, I created a new page where I started collecting sites that could be added to Mix'n'match and I plan to expand it with the ones that already have scrapers by category. Feel free to use, expand. Best, --Adam Harangozó (talk) 09:54, 17 October 2019 (UTC)
mix'n'match: Austrian Parliament 1848-1918, please add Property:P7491
editPlease add https://www.wikidata.org/wiki/Property:P7491 to https://tools.wmflabs.org/mix-n-match/#/catalog/2896 if that is still possible. thnx --PeterTheOne (talk) 19:18, 30 October 2019 (UTC)
- Done. --Gerwoman (talk) 19:26, 30 October 2019 (UTC)
Sächsische Biografie
editHey, my bad but it turns out that the Sächsische Biografie already had a catalogue (https://tools.wmflabs.org/mix-n-match/#/catalog/103), sorry about that. Can you delete the scraper? Best, --Adam Harangozó (talk) 17:24, 3 November 2019 (UTC)
- Done --Gerwoman (talk) 17:59, 3 November 2019 (UTC)
Polish theatre at Mix'n'match
editHi! I've noticed that you uploaded to Mix'n'match two sets of records from the Polish Theatre Institute: e-teatr.pl people (Encyklopedia Teatru Polskiego person ID (P5058)) and, more recently, The Polish Theatre Encyclopedia. Do you think you could do something similar for Encyklopedia Teatru Polskiego play ID (P6679), which also uses e-teatr.pl records? Thank you in advance. Powerek38 (talk) 16:46, 19 November 2019 (UTC)
- Done 3031 --Gerwoman (talk) 08:40, 21 November 2019 (UTC)
Disney+ mix'n'match
editHi, could you link these catalogs?
Thanks as always. :) Trivialist (talk) 15:01, 24 November 2019 (UTC)
- Done --Gerwoman (talk) 15:14, 24 November 2019 (UTC)
Overdrive publisher ID mix'n'match
editAnother catalog link request: 3040 → OverDrive publisher ID (P7639). Thanks. Trivialist (talk) 15:53, 29 November 2019 (UTC)
- Done but there is a problem with some entries, e.g. Piter (Q4363832). Did you select UTF-8 encoding? --Gerwoman (talk) 16:54, 29 November 2019 (UTC)
- I don't think I selected UTF-8, unfortunately. Also, could you link 3041 and OverDrive series ID (P7648)? Thanks. Trivialist (talk) 18:40, 6 December 2019 (UTC)
- Done There is a problem with the Cyrillic script (Q8209), for example in The Hunger Games (Q11679). --Gerwoman (talk) 18:59, 6 December 2019 (UTC)
- I had been leaving that unchecked when creating catalogs, but I'll pay more attention to that in the future. Trivialist (talk) 19:43, 6 December 2019 (UTC)
- Done There is a problem with the Cyrillic script (Q8209), for example in The Hunger Games (Q11679). --Gerwoman (talk) 18:59, 6 December 2019 (UTC)
- I don't think I selected UTF-8, unfortunately. Also, could you link 3041 and OverDrive series ID (P7648)? Thanks. Trivialist (talk) 18:40, 6 December 2019 (UTC)
Mix 'n' Match catalogues
editGood evening! A general question regardig Mix 'n' Match and adding values when matching. If I am using the Visual tool and matchin an item. Is it then a way that i can add other P-values. I.e for https://tools.wmflabs.org/mix-n-match/#/catalog/3063 can I edit and add for instance P-values like country (P17), located in the administrative territorial entity (P131) or maximum size or capacity (P3559) in the Visual Tools rigth window?. Breg Pmt (talk) 17:02, 30 November 2019 (UTC)
- I don't think so. But you can click in the Q number and edit the item in a new window. --Gerwoman (talk) 17:21, 30 November 2019 (UTC)
mix'n'match catalogs 3104, 3105, 3141, 802
editA few more mix'n'match catalogs to link to properties:
Thanks. Trivialist (talk) 23:11, 30 December 2019 (UTC)
- I add Mille Anni di Scienza in Italia ID (P7744) - 802. Thank you as always! --Epìdosis 11:22, 31 December 2019 (UTC)
Done --Gerwoman (talk) 12:20, 31 December 2019 (UTC)
New catalog for P7753
editHi! Could you create a catalog for Projekt Gutenberg-DE author ID (P7753)? Thank you very much and happy 2020, --Epìdosis 11:24, 31 December 2019 (UTC)
- Done Happy New Year! --Gerwoman (talk) 13:28, 31 December 2019 (UTC)
Atlas Obscura mix'n'match
editCould you link 3146 and Atlas Obscura ID (P7772)? Thanks. Trivialist (talk) 22:08, 1 January 2020 (UTC)
- Done! --Gerwoman (talk) 11:12, 2 January 2020 (UTC)
EDIT16 author
editHi! Could you create a catalog for EDIT16 catalogue author ID (P5492)? The values are all numbers from 1 to 30077; however, at least for now only values with "Stato: Massimo" should be imported, because other values are not sufficiently stable; if it is not possible to apply this restriction, it is better doing no import at all for now. Thank you very much as always, --Epìdosis 17:52, 12 January 2020 (UTC)
- Done --Gerwoman (talk) 18:41, 13 January 2020 (UTC)
RI_OPAC
editIs it fair to say that everyone in this catalog is occupation (P106):historian (Q201788)? If so, I can add that to all the entries, and auto-sync it to matched items that don't have an occupation. --Magnus Manske (talk) 08:41, 15 January 2020 (UTC)
- Yes, if they don't have an occupation I think it's safe to say that they are historians. Thank you. --Gerwoman (talk) 18:24, 15 January 2020 (UTC)
Connect catalog
editHi! https://tools.wmflabs.org/mix-n-match/#/catalog/2470 should be connected to National Film Board of Canada director ID (P6891). Thanks! --Epìdosis 15:49, 19 January 2020 (UTC)
- Thank you Epì. The problem is that the catalog is duplicated: 3251. --Gerwoman (talk) 16:22, 19 January 2020 (UTC)
- @Gerwoman: OK: 3251 contains more entries, so 2470 should be deactivated, is it correct @Magnus Manske:? --Epìdosis 16:37, 19 January 2020 (UTC)
Bibliotheca Hagiographica Latina
editMany thanks for creating the Mix'n'match import for BHL! Very few of the works in there will already be in Wikidata (though many are in VIAF), but I am hoping to try importing them en masse.
A challenge is that the titles as BHL represents them are not as one would expect: e.g. for BHL 8889 it gives 'Wilfridus ep. Eboracensis' under 'dossier' and 'Vita auct. Eddio Stephano' as the title. I've aimed to show in the Vita sancti Wilfrithi entry how one would ideally import all this information (let me know if it can be improved), using main subject (P921), author (P50), first line (P1922), and last line (P3132). I realize that this will require a large amount of manual editing, and I am still working out how to do this. I was planning to use OpenRefine then convert it all into QuickStatements, but let me know if you know a better way of going about it. AndrewNJ (talk) 10:44, 4 February 2020 (UTC)
- Hi AndrewNJ. I think that your approach is correct. It's a challenge indeed. But I have no experience with OpenRefine I'm afraid. Perhaps Epì and Bargioni could help you. Best --Gerwoman (talk) 18:50, 13 February 2020 (UTC)
- @AndrewNJ: I'm not sure if you need something like this command (very raw)
- @AndrewNJ: I'm not sure if you need something like this command (very raw)
curl -s 'http://bhlms.fltr.ucl.ac.be/Nquerysainttitre.cfm?code_bhl=8889' | grep -E 'Dossier:|Titre:|Incipit:|Desinit:' | sed 's/<[^>]*>//g'
to extract data from BHLms. If so, perhaps it could be improved, up to obtain CREATE commands for QuickStatements. --Bargioni (talk) 08:27, 17 February 2020 (UTC)
- @Bargioni: Brilliant – thank you! AndrewNJ (talk) 10:16, 17 February 2020 (UTC)
PUST author ID
editHi, since you added PUST author ID https://tools.wmflabs.org/mix-n-match/?#/catalog/1268 in mix'n'match, I'd suggest to change the link to PUST records from (e.g.)
https://pust.urbe.it/cgi-bin/koha/opac-authoritiesdetail.pl?authid=306898
to
https://pust.urbe.it/cgi-bin/koha/opac-authoritiesdetail.pl?marc=1&authid=306898. @Epìdosis: agrees with me. Thx a lot. --Bargioni (talk) 14:22, 6 February 2020 (UTC)
- Hi, @Epìdosis, Bargioni: you can change it in Angelicum ID (P5731). As the identifier is the same, there is no need to change it in the catalog, and I cannot change it there anyway. --Gerwoman (talk) 19:31, 6 February 2020 (UTC)
- OK, changed in the property. --Epìdosis 19:35, 6 February 2020 (UTC)
- @Epìdosis, Gerwoman: Thx to you both. --Bargioni (talk) 21:32, 13 February 2020 (UTC)
- OK, changed in the property. --Epìdosis 19:35, 6 February 2020 (UTC)
Catalan
editHi! I saw you imported Gran Enciclopèdia de la Música ID (P6412) to Mix'n'match. Do you think you could do the same with Diccionari de la Literatura Catalana ID (P7357) and Enciclopèdia de l'Esport Català ID (P5513)? It should be a similar structure. --Davidpar (talk) 14:25, 9 February 2020 (UTC)
- Done --Gerwoman (talk) 19:04, 11 February 2020 (UTC)
UvA identifiers with missing leading zero's
editHi, in the mix n match of the university of Amsterdam https://tools.wmflabs.org/mix-n-match/#/catalog/2510 we have identifiers without leading zeros to match with. The statistics and matching is becoming incorrect. Is it possible to change the numbers wih the zeros? --Hannolans (talk) 22:16, 11 February 2020 (UTC)
- Hi Hannolans. I don't know what happened with that catalog. I've created a new one with all the items and the correct identifiers: 3377. --Gerwoman (talk) 18:20, 13 February 2020 (UTC)
- Thanks! If possible, one improvement could be to place the dob, now in the description, in the year column for automatic matching. --Hannolans (talk) 13:42, 14 February 2020 (UTC)
- Asked Magnus Manske to do that. Be patient. --Gerwoman (talk) 16:13, 14 February 2020 (UTC)
- Thanks! If possible, one improvement could be to place the dob, now in the description, in the year column for automatic matching. --Hannolans (talk) 13:42, 14 February 2020 (UTC)
Auxiliary data
editHi! Could you explain me and @Bargioni: how "auxiliary data" work on Mix'n'match when importing a new catalog? Thank you very much! --Epìdosis 14:12, 13 February 2020 (UTC)
- Hi Epì, Bargioni. I think that Magnus has to add a new field with the data. For example ORCID ID in: https://tools.wmflabs.org/mix-n-match/#/entry/87330610 --Gerwoman (talk) 18:38, 13 February 2020 (UTC)
- OK. So the auxiliary data are added after the catalog has been imported, right? --Epìdosis 19:59, 13 February 2020 (UTC)
- Not only added, in my opinion, but also formatted in some way. E.g.: VIAF vvvvv|ISNI 0000-0000-1234-5678 (if more than one is allowed). --Bargioni (talk) 21:37, 13 February 2020 (UTC)
- At the moment, only ID, name and description can be automatically added with the available tools: import or scraping. All the rest is done by Magnus, although he is adding continuously more and more functionalities to M&M. I usually write the auxiliary data to the description field, hoping it to be useful in the near future. --Gerwoman (talk) 16:28, 14 February 2020 (UTC)
- Not only added, in my opinion, but also formatted in some way. E.g.: VIAF vvvvv|ISNI 0000-0000-1234-5678 (if more than one is allowed). --Bargioni (talk) 21:37, 13 February 2020 (UTC)
- OK. So the auxiliary data are added after the catalog has been imported, right? --Epìdosis 19:59, 13 February 2020 (UTC)
SIUSA people (bis)
editHi! In SIUSA there are now 3911 pages (not only 3587); I tried autoscrape in https://tools.wmflabs.org/mix-n-match/#/catalog/2395, but IDs remained 3587. Would it maybe be necessary to re-import the catalog? Thanks! --Epìdosis 17:41, 3 March 2020 (UTC)
- Perhaps Magnus can extend the number of pages escraped. --Gerwoman (talk) 18:08, 3 March 2020 (UTC)
New catalogs - bis
editHi! Could you create a catalog for BNCF Thesaurus ID (P508)? The list of terms starts here: please consider only terms in bold. Ask me if you have doubts. Thanks as always! --Epìdosis 23:26, 4 March 2020 (UTC)
Could you also scrape this? Thanks, --Epìdosis 13:29, 5 March 2020 (UTC)- For ToposText we can wait a bit, since here I was informed that a new upload with some changes will be soon online. --Epìdosis 13:39, 5 March 2020 (UTC)
- Hi Epì. Perhaps in this case is better to download the data from https://thes.bncf.firenze.sbn.it/thes-dati_eng.htm I don't manage to scrape this web. --Gerwoman (talk) 17:50, 5 March 2020 (UTC)
- OK, I will try. Thanks, --Epìdosis 17:52, 5 March 2020 (UTC)
Mix'n'match catalogs
editCan you link these catalogs to their properties?
Thanks. Trivialist (talk) 23:37, 4 March 2020 (UTC)
- Someone already did. --Gerwoman (talk) 16:58, 5 March 2020 (UTC)
- Oops, sorry! :) I do have one that doesn't appear to have been linked yet: FandangoNow ID (P7970) and catalog 3268. Trivialist (talk) 22:04, 16 March 2020 (UTC)
- Linked now! --Gerwoman (talk) 16:05, 17 March 2020 (UTC)
Poeti d'Italia in lingua latina
editHi! Could you create a catalog for new Poeti d'Italia in lingua latina author ID (P7992)? It has the same structure as Musisque Deoque author ID (P6999) (= https://tools.wmflabs.org/mix-n-match/#/catalog/2692). Thank you very much! --Epìdosis 21:54, 19 March 2020 (UTC)
- Done https://tools.wmflabs.org/mix-n-match/#/catalog/3447 --Gerwoman (talk) 12:19, 20 March 2020 (UTC)
- Hi! @Bargioni: and I have just noticed that in this catalog there are many problematic entries, where the name of the entry on MnM doesn't coincide with the name of the entry in the website (e.g. https://mix-n-match.toolforge.org/#/entry/90161998 and https://mix-n-match.toolforge.org/#/entry/90162037). Could you rescrape the catalog? Thanks, --Epìdosis 14:31, 28 July 2020 (UTC)
- Hi @Bargioni:, @Epìdosis:. I've created a new catalog, https://mix-n-match.toolforge.org/#/catalog/3710 , but I think that the IDs aren't stable. They have added, removed and changed some entries. --Gerwoman (talk) 16:36, 28 July 2020 (UTC)
- @Gerwoman: @Epìdosis: Thx for the new catalog. Let's wait some days to verify IDs stability. Otherwise, we may write to some contact, listed in http://mizar.unive.it/poetiditalia/public/index/collaboratori. -- Bargioni 🗣 21:06, 28 July 2020 (UTC)
- Hi @Bargioni:, @Epìdosis:. I've created a new catalog, https://mix-n-match.toolforge.org/#/catalog/3710 , but I think that the IDs aren't stable. They have added, removed and changed some entries. --Gerwoman (talk) 16:36, 28 July 2020 (UTC)
- Hi! @Bargioni: and I have just noticed that in this catalog there are many problematic entries, where the name of the entry on MnM doesn't coincide with the name of the entry in the website (e.g. https://mix-n-match.toolforge.org/#/entry/90161998 and https://mix-n-match.toolforge.org/#/entry/90162037). Could you rescrape the catalog? Thanks, --Epìdosis 14:31, 28 July 2020 (UTC)
Mix'n'Match DBE
editHola!
Estoy usando bastante el Mix'n'Match del DBE, y quería hacerte una consulta. ¿Cómo se hace para que sugiera enlaces automáticos? Hace una semana había más de 2500 y los reduje a 200, y creo que sería útil hacer otro barrido para que haga nuevas búsquedas. Cuando asocio de forma manual, veo que bastantes fichas del DBE se corresponden bastante con Qs existentes, por lo que creo que se podría adelantar trabajo. No sé si es posible, vaya, pero creo que sería interesante.
Muchas gracias! Un saludo! --Estevoaei (talk) 00:26, 22 March 2020 (UTC)
- Buen trabajo, Estevoaei. He lanzado Jobs -> automatch by search, a ver qué tal. Tardará un rato. --Gerwoman (talk) 09:58, 22 March 2020 (UTC)
- No ha encontrado nada. El "automatch from other catalogs" unos 40 nuevos. --Gerwoman (talk) 11:26, 22 March 2020 (UTC)
- El automatch "normal" tampoco encuentra nada nuevo. Prueba con Names in other catalogs. --Gerwoman (talk) 11:47, 22 March 2020 (UTC)
- No ha encontrado nada. El "automatch from other catalogs" unos 40 nuevos. --Gerwoman (talk) 11:26, 22 March 2020 (UTC)
Encyklopedia Solidarności
editHi,
I started syncing the catalog of Encyklopedia Solidarności with its new property but it's adding wrong characters instead of the Polish ones and the identifiers are not working. Could you check it please? Thanks! --Adam Harangozó (talk) 15:23, 8 April 2020 (UTC)
- Hi Adam Harangozó, I've created a new catalog https://tools.wmflabs.org/mix-n-match/#/catalog/3490. I think it's ok now. --Gerwoman (talk) 16:21, 8 April 2020 (UTC)
- Thank you! --Adam Harangozó (talk) 16:47, 8 April 2020 (UTC)
Brno
editHi, there is already a catalog for the Encyclopedia of Brno: 3048. But there are other Czech encyclopedias in the list. :) --Adam Harangozó (talk) 18:59, 9 April 2020 (UTC)
- I feel that I am beginning to repeat myself... --Gerwoman (talk) 08:54, 10 April 2020 (UTC)
MxM for article items
editHi Gerwoman,
As you are an active user of Mix'n'Match, I was wondering what you think of the approach outlined at Help:Add_main_subject_with_Mix-n-Match, including Wikidata:Property_proposal/MxM_xref.
The idea is to be able to match up items for entries of biographical dictionaries (and other publications with items), e.g. the ones at Help:Import_NBD_from_enwikisource/lists/all_entries. I actually tried it in the mentioned catalogue and it went fairly well.
To move ahead with others, Wikidata:Property_proposal/MxM_xref would be most helpful. --- Jura 11:19, 13 April 2020 (UTC)
Connecting mix'n'match catalog to property
editCould you connect catalog 3200 to Fandango person ID (P8125)? Thanks! Trivialist (talk) 22:55, 24 April 2020 (UTC)
- Done --Gerwoman (talk) 08:31, 25 April 2020 (UTC)
Check BVPB
editHi! I write you because today @Jahl de Vautban: reported me that in many cases the BVPB authority ID (P4802) I added using Mix'n'match where in fact inexistent. Could you have a quick look at BVPB catalog and eventually reimport it if necessary, in order to clean such entries? Thank you very much as always! Bye, --Epìdosis 15:56, 26 April 2020 (UTC)
- Yes Epì, this was discused here. I've unlinked the property from the catalog and the catalog from the property. And changed the name of the catalog to avoid confusion. Thank you for your advice. --Gerwoman (talk) 16:42, 26 April 2020 (UTC)
Re-scrape P7993
editHi! When you have time, could you re-scrape Treccani's Dizionario di Filosofia ID (P7993)? I'm sure my scraper missed some entries. Thank you very much! – The preceding unsigned comment was added by Epìdosis (talk • contribs).
- Thank you very much, I've matched the entries I forgot in the previous catalog and I've requested Magnus to delete that old catalog. Bye, --Epìdosis 11:08, 30 May 2020 (UTC)
Connect catalog
edithttps://mix-n-match.toolforge.org/?#/catalog/3581 > Liszt Academy Lexikon person ID (P8281). Thank you as always, --Epìdosis 07:16, 4 June 2020 (UTC)
- Done --Gerwoman (talk) 15:42, 4 June 2020 (UTC)
New Mix'n'Match catalouges
editGood afternoon! Do you have time to create Mix'n'Match catalogues for both ISCO-88 occupation class (P952) and ISCO-08 occupation class (P8283) based upon https://www.ilo.org/public/english/bureau/stat/isco/isco08/index.htm ? Breg Pmt (talk) 14:50, 4 June 2020 (UTC)
- Hi Pmt, there is already a catalog https://mix-n-match.toolforge.org/#/catalog/148 Not sure if it's correct. --Gerwoman (talk) 15:44, 4 June 2020 (UTC)
Hello and thank you for your answer. Indeed not your problem, but I can not see that it is functioning. It uses ISCO-88 occupation class (P952) but the name says International Standard Classification of Occupations 08 So that must be checked and corrected by the creator of the catalouge maybe. In addition I can see that User:Magnus Manske have made some redirections for the URL (?). But anyway today the ISCO-08 occupation class (P8283) have been created, so if possible can you create a Mix'n'Match for that one? Breg Pmt (talk) 19:05, 4 June 2020 (UTC)
Worlds Without End author ID (P8287) mix'n'match linking
editHi, could you link Worlds Without End author ID (P8287) and catalog 2934? Thanks! Trivialist (talk) 22:06, 4 June 2020 (UTC)
- Done --Gerwoman (talk) 15:44, 5 June 2020 (UTC)
HDI 2019
editGood morning! Is it possible to have a Mix'n'Match for the HDI 2019 created based upon http://hdr.undp.org/en/content/human-development-index-hdi. An exel spreadsheet is available. Breg Pmt (talk) 09:59, 25 June 2020 (UTC)
- Hi Breg. It seems that the last available data is for 2018. I don't know if this is what you need: https://mix-n-match.toolforge.org/#/catalog/3651
Best regards. --Gerwoman (talk) 15:51, 25 June 2020 (UTC)
- Thank you very much, exactly what was needed. I was I suppose confused by this sentence Download 2019 Human Development Data All Tables and Dashboards I should clearly be able to produce shuch catalouge my self, but again thank you for your help and assistance. B(est) reg(ards) ;) Pmt (talk) 16:05, 25 June 2020 (UTC)
PHI Latin Texts work ID (P8311)
editHi! When you have time, could you please set up the Mix'n'Match catalogue for the PHI Latin Texts work ID (P8311)? i would like to work on it during the next two months. Thanks!--Alexmar983 (talk) 20:27, 14 July 2020 (UTC)
Catalog for Archivio Ricordi
editHi! I've seen you have just created a MnM catalog for Archivio Storico Ricordi person ID (P8290). I remember @Marco Chemello (WMIT): said he was working on it two months ago, so maybe he didn't succeed in creating it. Anyway, I've noticed that in your catalog there aren't birth/death dates, which can always be useful for matches - I would ask @Bargioni: to create a new catalog extracting also them, if no one wants to anticipate us. Bye! --Epìdosis 09:51, 3 August 2020 (UTC)
- @Epìdosis: We are already working to create a new Mix'n'Match catalog; it's not yet finished, but I hope we can publish it within a month. --Marco Chemello (WMIT) (talk) 10:07, 3 August 2020 (UTC)
- @Marco Chemello (WMIT): OK, thanks for feedback! So we will wait. --Epìdosis 10:13, 3 August 2020 (UTC)
CPhCl
editHi! I think that the formatter URL of https://mix-n-match.toolforge.org/#/catalog/3732 should be corrected in http://www.aristarchus.unige.net/CPhCl/it-IT/Database/CardExport?cardId=$id. Thanks, --Epìdosis 16:41, 5 August 2020 (UTC)
- Thank you Epìdosis, but I think that the URL is not working because the method should be a post, not a get. Try, for example, the cardId=5144 that should run for Zofia Abramowiczówna. --Gerwoman (talk) 16:53, 5 August 2020 (UTC)
- Yes, you are probably right. --Epìdosis 16:56, 5 August 2020 (UTC)
MnM 2861
editHello,
as you have been really helpful in a lot of MnM-related issues and I’m new to this, I decided to ask you: I want to MnM the “Frauen in Bewegung“ database of the Austrian National Library (and then apply for a Property). There is a MnM catalogue 2861, but the links are invalid. The people behind the project assured me that the new format will be permanent.
I managed to extract date from the homepage (585 persons). I now have a CSV file similar to this one:
Link | Name | BirthDate | BirthPlace | DeathDate | DeathPlace | JobTitles |
---|---|---|---|---|---|---|
https://fraueninbewegung.onb.ac.at/node/10 | Lucy Weissel | 1849-10-20 | 1920-03-06 | Wien | Vereinsfunktionärin | |
https://fraueninbewegung.onb.ac.at/node/11 | Helene Bondy | 1868-03-28 | Wien | 1954-08-30 | Wien | Erzieherin |
https://fraueninbewegung.onb.ac.at/node/12 | Cölestine Truxa | 1858-08-04 | Verona | ? | Zeitungsherausgeberin, Vereinsfunktionärin | |
https://fraueninbewegung.onb.ac.at/node/776 | Hermine von Friese | ? | ? | Vereinsfunktionärin | ||
… | … | … | … | … | … | … |
Can I import this to a MnM catalogue myself or do I need help? I’m unsure because of the birth/death data … Thank you in advance! --Emu (talk) 17:50, 11 August 2020 (UTC)
- Hi Emu. You can use the mix-n-match importer. Just put the birth/death data together in the 3rd column. --Gerwoman (talk) 18:25, 11 August 2020 (UTC)
- Thank you! I did it! --Emu (talk) 12:26, 12 August 2020 (UTC)
Solicitud de borrado relacionada con BNE Temas
editHola:
¿Puedes darme tu opinión acerca de la solicitud de borrado relacionada con mi aportación a BNE Temas en Mix'n'match? Me daría pena que se desperdiciase el trabajo pero tampoco estoy seguro de cuál debería ser la argumentación correcta. Gracias de antemano. Olea (talk) 20:34, 15 August 2020 (UTC)
- Hola Olea. No creo que eso tenga mucho futuro. En cambio, pienso que podrías proponer una propiedad, como identificador externo, para añadir a los items que ya existen en WD. Hay más de 1000 fully matched en el catálogo. Un saludo y gracias por tu trabajo. --Gerwoman (talk) 15:31, 17 August 2020 (UTC)
Spanish Artists from the Fourth to the Twentieth Century ID
editHi! Thanks so much for setting up the Mix'n'Match for the Spanish Artists from the Fourth to the Twentieth Century and attaching the newly created property.
I was wondering if you could update the name of the Mix'n'Match catalog (3777)? The Frick has other Mix'n'Match projects (2895 and 3556), so using the database's name itself for this particular catalog would help disambiguate it and be consistent with the others.
- Name: change "Frick" → "Spanish Artists from the Fourth to the Twentieth Century"
- Description: change "Spanish artists from the fourth to twentieth century: a critical dictionary" → "identifier from the Spanish Artists from the Fourth to the Twentieth Century: A Critical Dictionary"
And lastly, it looks like you scraped the live site for the 3777 Mix'n'Match catalog label and descriptions, is that right? I had been working on a TSV to import to Mix'n'Match from the dictionary's raw data export by tidying and parsing out specific fields such as birth and death date. Not sure if it would be helpful at this point to use that, but let me know if it would be and if you have any thoughts on how to format it/share it with you if you want to refresh the catalog.
Thanks again! --Infopetal (talk) 17:21, 1 September 2020 (UTC)
- Hi Infopetal. Name and description changed. And yes, I scraped the live site for labels and descriptions. As many items have the birth/death dates in the description, this is usually enough for using, for example, the visual tool to match entries. But, of course, if you did the job of exporting and parsing the row data, it could be better to create a new catalog. For the correct format you can contact Magnus Manske who can import the fields to the catalog and run an automatch by dates, or other automatching by auxiliary data. Thanks for your job. --Gerwoman (talk) 18:00, 1 September 2020 (UTC)
- Thank you, Gerwoman! As I've been working with the catalog in Mix'n'Match, I think the way you were able to parse the data works for now. Once the catalog is reconciled, I can go back and enhance reconciled items with OpenRefine using the raw data. --Infopetal (talk) 19:21, 4 September 2020 (UTC)
Hello. Unfortunately, this catalog doesn't work. And i can't edit it's title. Better is, for example, Virtual Shtetl - Biographies. Have a nice day! Matlin (talk) 18:42, 23 September 2020 (UTC)
- Hi Matlin. The name is now changed. Let's see if Magnus can do something for the scraping to work: https://www.wikidata.org/wiki/Topic:Vmsfxm2dj81rfwut Best regards. --Gerwoman (talk) 16:50, 24 September 2020 (UTC)
We sent you an e-mail
editHello Gerwoman,
Really sorry for the inconvenience. This is a gentle note to request that you check your email. We sent you a message titled "The Community Insights survey is coming!". If you have questions, email surveys@wikimedia.org.
You can see my explanation here.
MediaWiki message delivery (talk) 18:46, 25 September 2020 (UTC)
ALLCAPS SURNAMES
editHi Gerwoman,
Wikidata:Database_reports/Complex_constraint_violations/P8631 finds a few items with labels that capitalize surnames. The items seem to be created based on a mix'n'match catalog you set up: https://mix-n-match.toolforge.org/#/catalog/2850 and more will eventually be created. I suppose you are aware of Help:Label. If you need help with fixing the labels, please make a request at Wikidata:Bot requests. --- Jura 06:39, 2 October 2020 (UTC)
- Hi Jura, the items seem to be created by Reinheitsgebot, not me. Best regards. --Gerwoman (talk) 11:44, 2 October 2020 (UTC)
- I think you created the (malformed) catalog it's using. --- Jura 11:55, 2 October 2020 (UTC)
- Why malformed? The original has the same format, and M&M has no option to transform into lowercase. --Gerwoman (talk) 12:04, 2 October 2020 (UTC)
- For some others, Magnus transformed them (no idea how). Alternatively, it needs to be handled on import manually or after item creation in some other way. --- Jura 12:12, 2 October 2020 (UTC)
- Let's ask Magnus. --Gerwoman (talk) 12:14, 2 October 2020 (UTC)
- Ok, I let you try, but he seems quite busy elsewhere recently. In any case, I think it's unlikely that he will fix your data already added to Wikidata. How do you plan to go about it? --- Jura 19:37, 2 October 2020 (UTC)
- Fixed. Let's ask Amqui if he wants to create the rest. --Gerwoman (talk) 09:35, 3 October 2020 (UTC)
- Ok, I let you try, but he seems quite busy elsewhere recently. In any case, I think it's unlikely that he will fix your data already added to Wikidata. How do you plan to go about it? --- Jura 19:37, 2 October 2020 (UTC)
- Let's ask Magnus. --Gerwoman (talk) 12:14, 2 October 2020 (UTC)
- For some others, Magnus transformed them (no idea how). Alternatively, it needs to be handled on import manually or after item creation in some other way. --- Jura 12:12, 2 October 2020 (UTC)
- Why malformed? The original has the same format, and M&M has no option to transform into lowercase. --Gerwoman (talk) 12:04, 2 October 2020 (UTC)
- I think you created the (malformed) catalog it's using. --- Jura 11:55, 2 October 2020 (UTC)
Norwegian professors at University of Oslo
editGood afternoon! I came across a file listing all scientific personal (i.e.professors and docents) of University of Oslo containing lastname, firstname, academic degree year of creation and faculty from 1813 to 1984 This file exist boht as Exel, Plain text and JSON. As per definition all those persons should be notabel within wikipedia. Many of these persons already do have an item on Wikidata, but will be missing their faculty, and year of service and academic profession. The JSON is said to be even more detailed. So then, how can I have this dumped to Wikidata? There is no "public" identificator as far as I can see for each person. Breg Pmt (talk) 19:58, 12 October 2020 (UTC)
Emojipedia ID MnM
editCould you connect MnM 3392 to Emojipedia ID (P8711)? Much appreciated, as always. Trivialist (talk) 19:53, 25 October 2020 (UTC)
- Done --Gerwoman (talk) 18:14, 26 October 2020 (UTC)
Could you also link MnM 2430 to P8748? Thank you! Emu (talk) 18:46, 26 October 2020 (UTC)
- Done --Gerwoman (talk) 19:35, 26 October 2020 (UTC)
MnM catalogue 2644
editHello, I wonder if you could help me with a large catalogue #2644. A large number of IDs were recently imported into Wikidata independently of Mix'n'Match. How difficult would it be to get it in sync with Wikidata? I tried it but the database is probably too big... Thank you for your help on this, Vojtěch Dostál (talk) 06:39, 3 December 2020 (UTC)
- Excuse me for the intromission, but I think I can give a (negative) answer. @Vojtěch Dostál: Unfortunately it's a known problem, it's impossible to sync big catalogs (e.g. also NUKAT). You can find the problem listed in this summary topic. Anyway, I use this occasion to thank you for the great import you have done. Let's wait news from Magnus. --Epìdosis 11:21, 3 December 2020 (UTC)
- @Epìdosis: :-( but thanks for the answer and the praise Vojtěch Dostál (talk) 12:33, 3 December 2020 (UTC)
@Epìdosis, Gerwoman: I have one more question if you don't mind. Is there any way to import a new dataset, but already with automatic matches? I have some reconciliations from OpenRefine which are not import-ready and need to be verified item by item. Vojtěch Dostál (talk) 19:25, 8 December 2020 (UTC)
- @Vojtěch Dostál: Unfortunately no way as of now, as far as I know. --Epìdosis 20:28, 8 December 2020 (UTC)
DeCS
editHola, Gerwoman, estuve trasteando con los diputados de España y he visto que controlas bastante de Mix'n'match, ojalá puedas materializar esta idea que tengo desde hace un tiempo. ¿Crees posible crear un catálogo para los elementos del tesauro Health Sciences Descriptors (Q5690673)? La url es así: https://decs.bvsalud.org/ths/resource/?id=$1 (https://decs.bvsalud.org/es/ths/resource/?id=$1 si lo quieres en español) y dicen tener 34118 descriptores y calificadores. Quizá pudiera crearse alguna propiedad similar a las de Medical Subject Headings (Q199897), con el que comparte bastantes cosas. Saludos. -sasha- (talk) 20:55, 8 January 2021 (UTC)
- Hola -sasha-. He hecho un par de intentos, 4099 y 4100, pero el número de resultados en las búsquedas está limitado. No sé si te vale para empezar. Un saludo. --Gerwoman (talk) 17:53, 13 January 2021 (UTC)
- Sí, muchísimas gracias, ahí hay trabajo para bastante tiempo, una lástima que no hayas podido exportar el catálogo completo. Estoy intentando aprender sobre las herramientas semiautomáticas porque editarlo todo a mano es muy cansado, a ver qué tal... Saludos. -sasha- (talk) 19:29, 13 January 2021 (UTC)
Another link
editHey, since you helped me last time by adding a link to Silesian Wikipedia article about animals, could you now add another one (also from "szl"), this time to Q36732? The article's name is Krōlestwo (biologijŏ). Also, is there any way for me to add links to protected pages myself or do I need some special rights? --Psiŏczek (talk) 17:38, 12 January 2021 (UTC)
- I think you need to reach 100 editions in wikidata to be allowed. Regards.
- Done --Gerwoman (talk) 17:50, 13 January 2021 (UTC)
MnM
editCould you deactivate 3970? At first it didn’t work, then a working scraper was added, but it only scrapes 500 irrelevant items … thank you!--Emu (talk) 21:49, 23 January 2021 (UTC)
- Done --Gerwoman (talk) 09:24, 24 January 2021 (UTC)
Could you also rename 4143? I managed to get fetch biographies with a scraper. I would suggest:
- Wien Geschichte Wiki – Biographien
- Vienna History Wiki – biographies
Thank you! --Emu (talk) 11:42, 8 February 2021 (UTC)
- Done Good job! --Gerwoman (talk) 18:22, 8 February 2021 (UTC)
P9256 - Diccionari de la traducció catalana
editHi! Diccionari de la traducció catalana ID (P9256) has just been created and I have imported MnM matches through QS. But I've found a little problem: IDs on MnM have the final ".html" which I have excluded from my regex because it is always present. Could you rescrape the catalog omitting the final ".html" from the IDs? Thank you very much as always, --Epìdosis 08:55, 5 March 2021 (UTC)
- Done new catalog 4257. --Gerwoman (talk) 17:38, 5 March 2021 (UTC)
Herder Conceptos
editHi! I've just noticed there are some problems in the encode of non-ASCII characters (e.g. here); could you re-import this catalog with the correct encode, if possible? Thank you very much, --Epìdosis 09:47, 11 March 2021 (UTC)
- Sorry Epì, I've tried to import it twice, one with UTF-8 encode and one without it, and the result seems the same. --Gerwoman (talk) 18:23, 11 March 2021 (UTC)
DeCS (bis)
editHola, ya se creó una nueva propiedad para los catálogos que pedí que crearas. ¿Podrías enlazar 4099 y 4100 a DeCS ID (P9272)? Lo intenté yo mismo y no pude, ¿a lo mejor solo puede hacerlo quien lo creó? Gracias de antemano. -sasha- (talk) 22:22, 16 March 2021 (UTC)
- Done Solo algunos "privilegiados" podemos hacer esas cosas. --Gerwoman (talk) 18:11, 17 March 2021 (UTC)
Thanks for the MnM catalog for the Dictionary of Archives Terminology!
editI happened across it, and enjoyed doing various matching. I noticed it doesn't have an item (or property) yet. Are you interested in shepherding that thru? JesseW (talk) 19:05, 20 March 2021 (UTC)
- JesseW, If you propose this property, I will support it. --Gerwoman (talk) 11:22, 4 April 2021 (UTC)
- OK, I'll try and get to it (eventually). :-) BTW, I haven't been able to get the MnM scraper functionality to work -- have you had better luck? JesseW (talk) 17:33, 4 April 2021 (UTC)
Scrape encyclopedias
editHi! Could you try scraping YIVO Encyclopedia of Jews in Eastern Europe ID (P8569) and Polignosi ID (P9407)? Thank you very much as always, --Epìdosis 17:59, 3 April 2021 (UTC) Done Your requests are always a challenge! Buona Pasqua. --Gerwoman (talk) 11:19, 4 April 2021 (UTC)
- Thank you! Excuse me for disturbing you again: I see that unfortunately Polski Słownik Judaistyczny ID (P8759) has renewed its website, now it has numeric IDs and all old IDs don't work anymore. Could you rescrape the catalog? I will at the same time delete the old IDs from Wikidata. Thank you very much, --Epìdosis 23:02, 4 April 2021 (UTC)
-
- I've also noticed that Visuotinė lietuvių enciklopedija ID (P7666) hasn't been scraped yet. Could you have a look? No haste at all, of course. Thanks! --Epìdosis 17:32, 14 April 2021 (UTC)
- Done 4376 --Gerwoman (talk) 16:06, 16 April 2021 (UTC)
Akadem
editHello,
I saw you created a Mix'n'match catalog so hope you can help me with my project. I would like to create a new catalog to be associated with Akadem person ID (former scheme) (P5378). The list of URLs to capture is found on https://akadem.org/index_list.php?tb=1. Would you be able to give me some insight on how to achieve this? Regards Moumou82 (talk) 11:39, 4 April 2021 (UTC)
- Unfortunately the scraping tool is not working. This would be the easy way to create the catalog. I've created manually a file and uploaded to the catalog 4352. --Gerwoman (talk) 17:34, 4 April 2021 (UTC)
- Wonderful, thank you! Moumou82 (talk) 12:57, 5 April 2021 (UTC)
Catálogo eDBA 2 en Mix'n'Match
editHola:
Estoy observando que en la importación del Biographical Dictionary of Almería (Q100743334) (catálogo 2765) están faltando entradas disponibles en el original. El caso es que no estoy familiarizado con esta parte de Mix'n'Match y no sé si debo crear un catálogo nuevo (espero que no) o si yo mismo puedo forzar la actualización, que tampoco sé si debo preparar un listado csv actualizado que subir. ¿Puedes darme alguna indicación. Estoy particularmente interesado en mantener al día este catálogo, aunque tampoco creo que esté hirviendo de actividad. Chasgracias. Olea (talk) 14:10, 14 April 2021 (UTC)
- Hola Olea. En este caso me parecía más rápido hacerlo que explicarlo. Faltaban del 732 al 783 o así. He lanzado el "scraper" con ese rango y el mismo número de catálogo. Si echas en falta alguno más me dices. Saludos. --Gerwoman (talk) 17:33, 14 April 2021 (UTC)
- Gracias, majo :-)
- (de todas formas tengo claro que es cuestión de tiempo que me empape de la herramienta :-) ) Olea (talk) 17:50, 14 April 2021 (UTC)
Norsk biografisk leksikon completeness
editI just created an item Arne Jensen (Q106622485) which includes the statement: Arne Jensen (Q106622485)Norsk biografisk leksikon ID (P5080)Arne_Jensen. However, I noticed that the Mix'n'match catalog you created at https://mix-n-match.toolforge.org/#/catalog/1187 reports that it is 100% complete. Since Norsk biografisk leksikon ID (P5080)property constraint (P2302)distinct-values constraint (Q21502410) and I'm not seeing a constraint violation warning, is the Norsk biografisk leksikon collection actually complete? Might new articles have been added since you created the catalog? Do you have any other ideas what is happening here?
OpenWeatherMap city ID
editHi! It is possible to remove the entry type from catalog 4489? I think it's doing more harm than good. NMaia (talk) 02:04, 30 May 2021 (UTC)
- Hi NMaia. I'm not sure to understand what is the problem. Anyway, I cannot change the entry type. Please ask in User_talk:Magnus_Manske if anyone can help. --Gerwoman (talk) 09:10, 30 May 2021 (UTC)
- Hi! Thanks for replying. Actually, it seems it all worked out in the end, so no worries. NMaia (talk) 09:40, 30 May 2021 (UTC)
EAS Fellows on mix n match
editHi, Gerwoman. Thanks very much for all the great stuff you've added to mixm match. It looks to me as though the eas-et.org links at EAS Fellows are broken - e.g. that for Abraham Aseffa (Q42431793) links through to Gunnar Aksel. I don't know if the EAS site links have changed since you imported, or where there was an error in import. Either way, I think we've added some confused data to Wikidata. Dsp13 (talk) 17:07, 13 August 2021 (UTC)
Mix'n'Match problems?
editHello, is Mix'n'Match importer working for you? I tried to import CSV files several times, using different formats, and even attempted to import from a Google Spreadsheet, but I'm stuck at "Test is running..." How long should that run? Longer than overnight? :) Thanks for help, Vojtěch Dostál (talk) 20:55, 8 November 2021 (UTC)
- @Vojtěch Dostál: I noticed the problem 6 days ago; in fact, it is presently impossible to upload new catalogs or to edit existing catalogs through the CSV/TSV interface. I have reported it to @Magnus Manske:, I will retry. --Epìdosis 23:06, 8 November 2021 (UTC)
- @Epìdosis Thanks! Good to know that the problem is not on my side. Vojtěch Dostál (talk) 07:19, 9 November 2021 (UTC)
- Not working for me either. Thanks for reporting. --Gerwoman (talk) 09:03, 9 November 2021 (UTC)
- @Epìdosis Thanks! Good to know that the problem is not on my side. Vojtěch Dostál (talk) 07:19, 9 November 2021 (UTC)
Should be fixed now. --Magnus Manske (talk) 09:49, 11 November 2021 (UTC)
MmM 4433: biografiA ID (P10085)
editHi, could you link MmM 4433 to biografiA ID (P10085)? Thank you! --Emu (talk) 12:13, 22 November 2021 (UTC)
- Done Emu You can manually sync the catalog if you want. --Gerwoman (talk) 17:48, 22 November 2021 (UTC)
Hola, he visto que en la web de la Leopoldina han cambiado los identificadores y ahora todos los enlaces están rotos (por ejemplo, ahora https://www.leopoldina.org/mitglieder/mitgliederverzeichnis/mitglieder/member/Member/show/7055 es https://www.leopoldina.org/mitgliederverzeichnis/mitglieder/member/Member/show/rudolf-virchow/). ¿Conoces alguna forma de arreglarlo sin mucho esfuerzo, quizá aprovechando el catálogo de Mix'n'match? Un saludo. -sasha- (talk) 22:34, 14 January 2022 (UTC)
- No se me ocurre nada más que empezar de cero: borrar la propiedad en todos los items, crear un nuevo catálogo, modificar la propiedad... Aunque ahora, como todos tienen member of (P463): Q543804 supongo que será más rápido el cotejo. Quizá a User:Magnus Manske se le ocurre algo mejor. Gerwoman (talk) 11:43, 15 January 2022 (UTC)
WikiProject Nonprofit Organizations
editHello Gerwoman, I created a new WikiProject Nonprofit Organizations. If that's something interesting for you, I would be happy if you join the project. Best --Newt713 (talk) 20:59, 19 February 2022 (UTC)
Africultures
editHello,
You have been helping me with the creation of a Mix'n'match catalog last year so hope you can help me with my new project. I would like to create a new catalog (or two new catalogs probably) to be associated with Africultures movie ID (P4513) and Africultures person ID (P4514). The lists of URLs to capture is found on http://www.spla.pro/liste.film.html and http://www.spla.pro/liste.personne.html. Is there a way catalogs can be extracted from this? Moumou82 (talk) 23:08, 9 March 2022 (UTC)
- Done Moumou82: 5084 and 5083 --Gerwoman (talk) 18:09, 11 March 2022 (UTC)
- This is great, thank you so much! Moumou82 (talk) 20:47, 11 March 2022 (UTC)
Help replacing Category 3092 to convert from string to numeric ID for P2190
editHi, I've noticed you are familiar with Mix'n'match, and I believe the scraping configuration. I'm struggling to follow the documentation which clarification on a configuration with more examples would help, but I'm reaching out to see if you would be interested and have time to configure a replacement for catalog 3092 as the C-SPAN person ID (P2190) ID format has changed from string to numeric for more reliability, but the catalog inserts strings to Wikidata.
I've already uploaded new numeric IDs where matches were available to strings prior to 26 Feb 2022, but to prevent more strings being added the mix'n'match catalog would need to be replaced. Category 3092 only contains 9563 entries while C-SPAN persons show 154,909 and could be created from https://www.c-span.org/search/?searchtype=People&sort=Alphabetical
I'm coordinating the string to numeric transition and am working on cleaning up EN Wikipedia's use of the C-SPAN Template 11,576 have 11,015 strings needing converted to numeric format before full enforcement of numeric IDs there.
A complementary catalog which does not exist is C-SPAN organizations C-SPAN organization ID (P4725) with 90,261 entries which would be a nice addition could be created from https://www.c-span.org/search/?searchtype=Organizations&sort=Alphabetical
Could you help with the replacement Mix'n'match category? If you feel so inclined, also create the organization category? Thank you for your consideration. P.S. I also mentioned to Magnus Manske Wolfgang8741 (talk) 13:09, 16 March 2022 (UTC)
- At the moment I've unlinked P2190 in Catalog3092 to avoid new inclusions of strings ID's in Wikidata. I'm sorry but I don't know how to create these new catalogs you proposed. Gerwoman (talk) 18:11, 16 March 2022 (UTC)
- Thanks for unlinking the property. I believe the scrape could be done by stepping through the following urls which display 100 items per page.
- The number of times needing to step can be calculated by the number of records listed.
- For people (P2190) 154928/100=1550 steps starting at page 1 with last page being 1550 due to rounding up. URL: https://www.c-span.org/search/?searchtype=People&sort=Alphabetical&show100=&searchtype=People&sort=Alphabetical&ajax&page=1
- regex for the name and id would probably be - <a href='\/\/www\.c-span\.org\/person\/\?(\d+)\/([a-zA-Z0-9]*)'>(.+?)<\/span><\/a>
- For organizations (P4725) 90270/100=903 start on page 1 and end on page 903 rounding up. URL: https://www.c-span.org/search/?searchtype=People&sort=Alphabetical&show100=&searchtype=Organizations&sort=Alphabetical&ajax&page=1
- For people (P2190) 154928/100=1550 steps starting at page 1 with last page being 1550 due to rounding up. URL: https://www.c-span.org/search/?searchtype=People&sort=Alphabetical&show100=&searchtype=People&sort=Alphabetical&ajax&page=1
- When I run I keep getting after the first attempt which runs successfully and showed 100 matches from the retrieved result as expected, then second test results in: "ERROR: Test has failed, for reasons unknown. Could be the server to be scraped is too slow?" which is not descriptive to know if the server load is too high or if there is an issue with the configuration, but I don't see a way to save the scraper config for handoff or trying later.
- Either way, thanks for unlinking. Wolfgang8741 (talk) 23:37, 17 March 2022 (UTC)
MnM Linking
editHi, could you link
- MnM 4897 to Salzburger Literatur Netz ID (P10618) and
- MnM 4898 to Literatur Netz Oberösterreich ID (P10620)?
Thank you! --Emu (talk) 12:08, 15 April 2022 (UTC)
- Done Gerwoman (talk) 12:19, 15 April 2022 (UTC)
- And also 5145 to Musik und Gender im Internet ID (P10670)? Thanks! --Emu (talk) 12:38, 21 April 2022 (UTC)
Jeu de Paume in Mix'n'Match
editHi Gerwoman, do you think you can add Database of Art Objects at the Jeu de Paume ID (P10750) to Mix'n'Match? A search for * seems to return all entries. I'm planning to import the paintings at some point and would be nice to not have too many duplicates. Multichill (talk) 12:04, 15 May 2022 (UTC)
- Done 5229 --Gerwoman (talk) 11:16, 28 May 2022 (UTC)
- Thanks! Multichill (talk) 13:44, 28 May 2022 (UTC)
- Looks like you missed some? The mix'n'match catalog has 26145 entries. When I query the database for all records using https://www.errproject.org/jeudepaume/card_show.php?Query=%2A&StartDoc=1 , I get 41347 results. You can just iterate over this url's to https://www.errproject.org/jeudepaume/card_show.php?Query=%2A&StartDoc=41347 . Each url will redirect to a card.
- I'm currently importing a lot of missing paintings with MCCP ID (P10760). I'm also adding missing Database of Art Objects at the Jeu de Paume ID (P10750) links. Multichill (talk) 12:55, 4 June 2022 (UTC)
- Hi Multichill. I don't know how obtain the CardId from the StarDoc number, using M&M. So I iterated by CardId until 60K or so. And then grep the Artist, Title and Description from every page. That's why there is a difference. If you know a better method please explain or try to do it yourself. Best. Gerwoman (talk) 14:34, 4 June 2022 (UTC)
- In the robot I just follow the redirect, but I have no clue how to do that in Mix'n'Match.
- Maybe just rerun it for the higher numbers? Based on https://w.wiki/5EaT , 76526 is currently the highest number. Probably wait until I complete this part of the import to see how much more it increases? Multichill (talk) 14:57, 4 June 2022 (UTC)
- Thanks! Multichill (talk) 13:44, 28 May 2022 (UTC)
John Flower, and John E. Flower
editHi, @Gerwoman:, I may have done something wrong: this British professor [2] had is Q item overtaken by a cricketer Q6233497 of the same name [3]. I cleaned up the cricketer entry and created a new one for the academic : John E. Flower (Q113230953) - Could you please check if it is going to be OK (Viaf is still confused). Thanks DDupard (talk) 15:40, 23 July 2022 (UTC)
- Hi User:DDupard, I think that "our" WD is OK now. In VIAF they usually make corrections from time to time, so don't worry. Gerwoman (talk) 16:44, 23 July 2022 (UTC)
- Thanks a mil Gerwoman, I am so happy now to know it's all right.--DDupard (talk) 18:08, 23 July 2022 (UTC)
Hi, I noticed that the links in MnM 5349 do not work as there seems to be a 256 char limit on URLs. Could you try to fix this? The formatter URL should be https://www.dgkj.de/die-gesellschaft/geschichte/juedische-kinderaerztinnen-und-aerzte-1933-1945/suchergebnis-der-datenbank?tx_dgkjpaediatristsnsera_searchpaediastrists%5Baction%5D=show&tx_dgkjpaediatristsnsera_searchpaediastrists%5Bcontroller%5D=PaediatristNSEra&tx_dgkjpaediatristsnsera_searchpaediastrists%5BpaediatristNSEra%5D=$1
. Thanks! --Emu (talk) 05:30, 4 August 2022 (UTC)
- Sorry Emu, I don't know how to change that. --Gerwoman (talk) 18:40, 4 August 2022 (UTC)
- No problem, thanks anyway! --Emu (talk) 19:27, 4 August 2022 (UTC)
- I managed to import it again as I found a way to shorten the URL, could you deactivate MnM 5349? Thanks! --Emu (talk) 20:10, 4 August 2022 (UTC)
- No problem, thanks anyway! --Emu (talk) 19:27, 4 August 2022 (UTC)
Linking MnM to property
editCould you like Jewish Pediatricians 1933-1945 ID (P10958) to 5354 and deactivate 5349? Thank you! --Emu (talk) 11:06, 21 August 2022 (UTC)
- Done by someone else.
Spam pages in womenwhoknowhistory
editHi there, I noticed that MixnMatch for womenwhoknowhistory contains a huge number of spam pages (in fact, most items are spam from pages 60 to 273, i.e. over 10K spam items). Most of these should be easy to autoidentify and exclude because they contain urls, which the genuine names & descriptions don't. The website itself has how blanked these pages, so presumably the web scraping has picked them up before they were reviewed and removed. Is there any way to clean these up? Dsp13 (talk) 02:12, 30 September 2022 (UTC)
Africultures (2)
editHello,
You have been helping me with the creation of two Mix'n'match catalogs last year so hope you can help me with my new project. I would like to create a new catalog to be associated with Africultures structure ID (P11462). The lists of URLs to capture is found on http://www.spla.pro/list.organization.html. Is there a way a catalog can be extracted from this? Moumou82 (talk) 10:13, 5 January 2023 (UTC)
Property:P9051
editHi, could you please import this CCD cultural heritage ID catalogue, linked to this Property:P9051, on mix n match? I think that in en.wiki, but not only in it.wiki, it is very convenient and enhances relevance to have the link of the entry directly on the site among the external links. Thank you in advance for your attention. Threecharlie (talk) 10:54, 2 May 2023 (UTC)
TMDB Mix-N-Match
editHi, It looks like you were the one to import the IDs to Mix-N-Match for TMDB Movie, TV series, and person. Would you be interested to update these with the ID dumps from https://developer.themoviedb.org/docs/daily-id-exports? (note there are two ID dumps to get all the IDs split by adult and non adult content) There is an discussion thread at TMDB of how to improve the TMDB and Wikidata ID integration process at https://www.themoviedb.org/talk/5e639fe63e01ea0015e99824 if you're interested in that too. Wolfgang8741 (talk) 14:02, 11 July 2023 (UTC)
Conor
editHi. Any chance you can recreate [4] and [5] as autoscrape stopped working some years ago after website changed? Sporti (talk) 08:54, 20 March 2024 (UTC)
Clavis Clavium
editHi Gerwoman, I was wondering if you could take the time to create a Mix'n'Match catalogue for the Clavis Clavium ID (P7908). I tried setting it up with a scraper, but I could not understand the instructions for the Levels.
Formatter URL and regex format of IDs can be found in the property.
Herzliche Grüße, Jonathan Groß (talk) 13:40, 6 May 2024 (UTC)
Q14012761
editHola! Cómo va todo?
Voy a eliminar de Q14012761 la distinción Q28861961. Debieron dársela a otra persona con el mismo nombre, porque en 1973 esta persona aún no tendría méritos. No logro referencias para su fecha de nacimiento, pero en las fotos de las entrevistas parece nacido en la década de 1960. Un saludo! --Estevoaei (talk) 07:01, 6 August 2024 (UTC)
Update Mix'n'match Lobbyregister
editMoin Gerwoman, magst du das Mix'n'match zum Lobbyregister noch einmal updaten? Oder kann ich das auch? Dann müsste ich kurz wissen, wie die ursprüngliche CSV aufgebaut ist. Schöne Grüße NGOgo (talk) 15:27, 9 October 2024 (UTC)