[go: nahoru, domu]

Skip to content

Materials for Preference-Based Offline Evaluation Tutorial at WSDM 2023

License

Notifications You must be signed in to change notification settings

claclark/wsdm2023-tutorial

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 

Repository files navigation

Preference-Based Offline Evaluation Tutorial at WSDM 2023

Charles Clarke, University of Waterloo
Fernando Diaz, Google, Montréal
Negar Arabzadeh, University of Waterloo

A core step in production model research and development involves the offline evaluation of a system before production deployment. Traditional offline evaluation of search, recommender, and other systems involves gathering item relevance labels from human editors. These labels can then be used to assess system performance using offline evaluation metrics. Unfortunately, this approach does not work when evaluating highly-effective ranking systems, such as those emerging from the advances in machine learning. Recent work demonstrates that moving away from pointwise item and metric evaluation can be a more effective approach to the offline evaluation of systems. This tutorial, intended for both researchers and practitioners, reviews early work in preference-based evaluation and covers recent developments in detail.

Slides

About

Materials for Preference-Based Offline Evaluation Tutorial at WSDM 2023

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages