[go: nahoru, domu]

Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Use batch mode for Wikidata reconciliation #6

Open
wetneb opened this issue Dec 3, 2020 · 3 comments
Open

Use batch mode for Wikidata reconciliation #6

wetneb opened this issue Dec 3, 2020 · 3 comments

Comments

@wetneb
Copy link
wetneb commented Dec 3, 2020

Thanks a lot for this!

I just wanted to note that the Wikidata reconciliation API that you query can be used in batch mode: you can supply multiple reconciliation queries in a single request. This should speed up the resolution of these queries.
https://reconciliation-api.github.io/specs/latest/#sending-reconciliation-queries-to-a-service

@shigapov
Copy link
Collaborator
shigapov commented Dec 4, 2020

Thank you, Antonin @wetneb! We could also optimise our SPARQL-requests to Wikidata using the query hints of the Blazegraph (https://github.com/blazegraph/database/wiki/QueryHints). But I do not know whether the Wikidata-community would be fine with that...

@wetneb
Copy link
Author
wetneb commented Dec 4, 2020

I don't think it would cause any harm to optimize queries that you are already running anyway, right?

@thadguidry
Copy link

@shigapov Also, you might also think about optimizations and subqueries that might involve analysis of https://www.wikidata.org/wiki/Property:P1963 and gaps or differences between objects and statements that include or do not include defined "properties for this type", where many are commonly referred to as disambiguting properties. Those are the closest thing to what we had in Freebase when it was operational https://web.archive.org/web/20151002083332/http://wiki.freebase.com/wiki/Disambiguation
But are only 1 signal, and where for any particular type, those properties could certainly even be derived with machine learning, and not only hard coded as they are with P1963 inside Wikidata. Food for thought :-)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants