Commons talk:Structured data

From Wikimedia Commons, the free media repository
Jump to navigation Jump to search
SpBot archives all sections tagged with {{Section resolved|1=~~~~}} after 7 days.

Talk pages of subpages and archives

Maintenance category broken?

[edit]

Quality images missing SDC Commons quality assessment currently has over 14,000 files in it. But from a random sample of 15 files in this category, I found 9 to actually already include the statement. Clearing the category's cache didn't help. This makes that category not really helpful any more. Any ideas, what could cause this problem?

@Mike Peel, @Dschwen: could QICbot possibly set Commons quality assessment (P6731) Wikimedia Commons quality image (Q63348069) upon promotion and make this maintenance category obsolete? MB-one (talk) 10:34, 11 March 2024 (UTC)[reply]

This was being done by @Multichill's bot, I'm not sure why that stopped? I haven't figured out how to set SDC values using pywikibot/the API yet, it's on my to-do list, but I'm not sure when I'll get to it. Thanks. Mike Peel (talk) 11:27, 11 March 2024 (UTC)[reply]
@Mike Peel Maybe that helps you to get started: User:SchlurcherBot/commonsapiaddclaimsoauth Let me know anything else you need. --Schlurcher (talk) 06:55, 23 May 2024 (UTC)[reply]
Indeed https://commons.wikimedia.org/w/index.php?sort=last_edit_desc&search=-haswbstatement%3AP6731+hastemplate%3AQualityImage+filetype%3Abitmap&title=Special%3ASearch&profile=advanced&fulltext=1&advancedSearch-current=%7B%22fields%22%3A%7B%22hastemplate%22%3A%5B%22QualityImage%22%5D%2C%22filetype%22%3A%22bitmap%22%7D%7D&ns6=1 shows far fewer results and no false positives so far. MB-one (talk) 18:46, 11 March 2024 (UTC)[reply]
@MB-one and Mike Peel: Jobs User:BotMultichillT used to run have been disabled by the Wikimedia Foundation, see phab:T319912. Didn't really have the motivation to fix it yet. Multichill (talk) 17:40, 15 March 2024 (UTC)[reply]
@Multichill: so do I understand correctly that you decided that with this change of technical environment, you are abandoning all your bot tools and have no interest in migrating them? Are you willing to work at all with some other user who might be interested in migrating them? I suspect there are a fair number of these someone would like not to lose. - Jmabel ! talk 18:40, 15 March 2024 (UTC)[reply]
I've had to convert several of my bot scripts over to the new system (e.g., setup script for the venv, then new job submission script, with the run script loading the venv), would be happy to help here if you need. Although, this is why I run most of my scripts on an environment I have full control over... Thanks. Mike Peel (talk) 18:59, 15 March 2024 (UTC)[reply]
Update: Multichill's bot BotMultichillT has started thankfully started to tag images in Quality images. But Quality images missing SDC Commons quality assessment still lists about 5,000 files at any given time, although the actual number of files missing this tag hasn't exceeded around 150 in a month now. I wonder why.--MB-one (talk) 13:37, 22 May 2024 (UTC)[reply]
@MB-one: There seems to be something amiss due to Template:Quality image and its categorisation and the data it pulls from the WD ether. The metadata may need to be refreshed, however, it is so not visible and is beyond the scope of normal people to push purge, view, manipulate.  — billinghurst sDrewth 02:04, 23 May 2024 (UTC)[reply]
Based on my experience you have to perform a null edit as a purge is not sufficient. I can do this ealy next week for the category. --Schlurcher (talk) 06:51, 23 May 2024 (UTC)[reply]
Yeah, I'm slowly recovering some of the bots. Category:Quality images missing SDC Commons quality assessment is empty now and looks like all quality images have the statement. Multichill (talk) 11:42, 24 May 2024 (UTC)[reply]

Tool to copy sdc from file to file

[edit]

is there one for problems like https://commons.wikimedia.org/w/index.php?title=Commons_talk:CropTool&oldid=867301107#Structured_Data ? we need a tool like this. RZuo (talk) 07:31, 12 April 2024 (UTC)[reply]

Maybe d:MediaWiki:Gadget-moveClaim.js could be adjusted to do this, @Matěj Suchánek, Melderick? --Marsupium (talk) 16:22, 15 April 2024 (UTC)[reply]

Aerial point of view

[edit]

What would be the appropriate way to depict with structured data that a photograph was taken from an aerial point of view? Sdkbtalk 15:54, 16 April 2024 (UTC)[reply]

Commons_talk:Structured_data/Modeling/Location#Aerial_photographs have some dicussion related to this. -- Zache (talk) 17:10, 16 April 2024 (UTC)[reply]
Thanks for the link! Let's centralize discussion there. Sdkbtalk 18:38, 16 April 2024 (UTC)[reply]

Proposal to add perceptual hashes to SDC

[edit]

Just FYI, I made a Village pump proposal for adding pHash checksum (P9310) and Imagehash difference hash (P12563) values to commons images. Currently, FinnaUploadBot have been adding these only to images uploaded from Finna.fi --Zache (talk) 17:00, 17 May 2024 (UTC)[reply]

So this is essentially a front-end to Wikidata?

[edit]

Do I understand this correctly: every file on Commons is also a node (item?) on Wikidata, in the sense that it is assigned a QID, and thus you can start making statements about it — the structured data tab is essentially a front-end to the Wikidata database? The main difference to editing statements on Wikidata appears to be that this interface does not show the subject QID anywhere! Or are you using other URIs as resource identifiers? (Being explicit with such technical details would have helped me understanding what this was all about.)

Do you have any examples of statements relating one file to another? (I didn't encounter anything like that, browsing a few of the examples in the list of properties.) Practical application: relating different BSicons to each other. 88.129.117.158 14:59, 7 July 2024 (UTC)[reply]

No, each image does not have a QID. Each media item instead has an identifier consisting of "M" followed by its page ID, but as you say the interface mostly hides this. I don't think there's any way to relate files to one another directly; I've certainly never seen one. Notably, when Wikidata wants to refer to Commons files, it does it by name and not by using the "M" IDs. --bjh21 (talk) 15:31, 7 July 2024 (UTC)[reply]

Bot to remove duplicate statements

[edit]

Hi folks, sometimes we end up with duplicate statements on files. This often happens when bots edit the same file twice. I set up bot to clean that up about once a month (code). It goes through the latest dump at https://dumps.wikimedia.org/commonswiki/entities/ (the copy on Toolforge) and removes any duplicate statements. It hashes the statement for quick processing so it doesn't catch cases (yet) like this one where the qualifiers are in different order. Multichill (talk) 20:29, 7 July 2024 (UTC)[reply]