Commons talk:Structured data/Modeling/Works without Wikidata item

From Wikimedia Commons, the free media repository
Jump to navigation Jump to search

Trying to reach consensus about this data modeling proposal

[edit]

Because of a recent discussion in the SDC Telegram group, I am taking the initiative to try to come to a consensus about this!

It would be absolutely great if we can agree on this, because this specific situation occurs so frequently. It would really help (batch) upload tools and would help many batch uploaders around the world.

Pinging a few people who will have opinions - @Multichill, @Jarekt, @Schlurcher who edited the page, and also @Seav who came up with a proposal in the Telegram group.

I have created a proposal in this file, where there is no Wikidata item yet: https://commons.wikimedia.org/w/index.php?title=File:Marlborough-duke-first.jpg&oldid=906350755

@Seav proposed a different approach, see for instance this file, and here's a WCQS query to see all examples of his proposed data modeling.

My own key recommendation would be to do the following: describe the properties of the artwork and the file together in the SDC and distinguish them with an agreed-upon uniform qualifier.

My thoughts:

  • It would be great if @Jarekt could update the {{Artwork}} template to use SDC to support the data model we eventually agree upon. This template allows the creation of a Wikidata item, which is awesome, and I can imagine it would be great if it can already take some SDC to help create that Wikidata item. This is why I proposed this approach with qualifiers: the file on Commons is described with the same properties as it would have as an artwork on Wikidata, but with one single uniform qualifier to designate that that data belongs to the artwork, not the file. The template could then "grab" these and move these to Wikidata.
    • This means I (sympathetically) disagree with this edit by @Schlurcher. I argue to not use different properties on Wikimedia Commons between the work and the file, but to follow the Wikidata data modeling conventions for artworks. This is to make sure that artwork items on Wikidata created from Wikimedia Commons follow the same data modeling guidelines as on Wikidata, and do not do it differently.
  • Which qualifier to use though? I have heard @Multichill say informally that he has doubts about applies to part, aspect, or form (P518) and would prefer something like object of statement has role (P3831) (?). I have no very strong opinion on this, but I have observed that many people find the Subject/Object distinction very abstract and confusing (including myself!).

Thanks all! Spinster (talk) 07:55, 4 August 2024 (UTC)[reply]

@Spinster, thanks for starting this topic. My main problem with your proposal of using standard distinguishers to separate statements about the file and about the object/artwork is that this breaks down if there are multiple depicted objects in the file. You no longer have any feasible way to distinguish which statement applies to which analog work (Q112134971). To give a concrete example, see the following file for which I have added a SDC statement for each of the three currently identified objects (a person, a plaque, and a statue) in the photo: File:Jose Rizal statue and historical marker (Campbelltown, NSW).jpg. As you can see I can add qualifiers to each depicts statement to describe each object separately without issue. (For now, ignore the fact that the statue is probably notable enough to have its own Wikidata item. Let's assume that we're talking about the situation where some depicted objects still don't have Wikidata yet but we want to describe them via SDC now.)
Some other photos that I found where others have gone a similar route:
seav (talk) 09:41, 4 August 2024 (UTC)[reply]
@Spinster: Yes, thank you for trying to nail down some standards. Years ago I created several subpages of Commons:Structured data/Modeling page with some standards I observed and which I was supporting in Module:Information and {{Information}}. Module:Artwork which supports {{Artwork}}, {{Book}} and {{Photograph}} currently relies much less on SDC, mostly due to inconsistent standards and whatever standard we come up with should make sense in context of all those infoboxes. To me the biggest issue with SDC is lack of standard for cases when a file is suppose to pull all the metadata from Wikidata. digital representation of (P6243) when it was proposed was supposed to cover 2D artworks, 3D artwork, movies, music, etc. however when it was approved the scope changed to 2D artwork only. I tried again to have a way to connect 3D artworks, movies, music etc. to Wikidata with d:Wikidata:Property proposal/Infobox based on, but it was rejected. User:Multichill changed Module:Artwork to connect some of non-2D files with wikidata using main subject (P921), and it works fine for most cases but it is a huge source of files connected to incorrect wikidata items as main subject (P921) often is used to indicate main subject of the artwork and not the photo. I like your "applies to part: analog work" to clarify some of this confusion.
However, your question was about cases when there is no matching Wikidata item. In my opinion the best solution for files like File:Marlborough-duke-first.jpg is to create one and upload metadata there. I do not see a point placing metadata in SDC in case someone create Wikidata item so it can be moved to the right place. However, if for some reason it is easier to put it in SDC than I think "applies to part analog work" seems like a great approach to distinguish, for example inception (P571) date of the photo from the inception (P571) date of analog work (Q112134971). That seems quite good approach for photographs which are clearly associated with a single artwork. User:Seav's examples are all concentrating on SDC properties for images depicting multiple artworks and objects. Such images are often not well served by {{Artwork}} and should probably be better off using {{Information}}. The SDC metadata for those files seem perfectly reasonable.
I am ok with modifying Module:Artwork to support the data model we agree on, but SDC's data model for artworks, books, etc. should be the same as Wikidata's as long as we specify that given property "applies to analog part". I am a little weary of complicating the code which is already at ~3000 lines of lua and divided among multiple modules. An easier solution could be to just write a bot to move some of those properties from SDC to Wikidata, although I still think that putting them on Wikidata in a first place is a better solution. SDC is lacking many tools I use a lot when editing: Duplicate Item tool (Q108311191), Duplicate References (Q97500668), moveClaim (Q110793966) and many others. Without those tools I find SDC very cumbersome to work with. --Jarekt (talk) 03:29, 5 August 2024 (UTC)[reply]
Sitting att a pretty big round table at Wikimania discussing this now. This suggestions are:
/ André Costa (WMSE) (talk) 09:43, 10 August 2024 (UTC)[reply]

Use case(s) for not creating a Wikidata item (yet)

[edit]

@Lokal Profil mentioned in his talk page that it would be good to explain the use case where/why one would upload files of artworks without a Wikidata item. I approach it very pragmatically: for many uploaders, creating Wikidata items is an extra complexity that may pose a too high barrier. Wikidata items can also be created afterwards. Spinster (talk) 09:05, 10 August 2024 (UTC)[reply]