screening-stopping

Code to evaluate evaluate screening stopping algorithms applied to multiple rankings by calling tar_eval multiple times and averaging results.

Metrics used:

Recall
Cost (== percentage of documents examined)
Loss (== amount by which achieved recall is below target recall)

Results are averaged across runs (with standard deviation reported in brackets).

Prerequisite

Script assumes that tar_eval.py (and associated scripts) are available via ./tar-master/scripts/tar_eval.py

Download from https://github.com/CLEF-TAR/tar and uncompress in same directory as eval_multiple_runs.py

Running script

Syntax:

  % python eval_multiple_runs.py  -f DIR -t TARGET_RECALL -q QRELS_FILE [-v]

where

  DIR is directory containing a list of runs
  TARGET_RECALL is the target recall provided to the stopping algorithm (required for loss metric)
  QRELS_FILE is the location of the qrels file
  -v flag produces verbose output (results printed for each topic)

  % python eval_multiple_runs.py  -h

Prints command line help

Example:

% python eval_multiple_runs.py  -f ./run_outputs/ -t 0.8 -q ./qrels.txt

Example output:

  Runs: 4
  Recall: 0.974	(0.002)
  Cost:   0.579	(0.005)
  Loss:   0.003	(0.001)

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
LICENSE		LICENSE
README.md		README.md
eval_multiple_runs.py		eval_multiple_runs.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

screening-stopping

Prerequisite

Running script

Syntax:

Example:

Example output:

About

Releases

Packages

Languages

License

drmarkstevenson/screening-stopping

Folders and files

Latest commit

History

Repository files navigation

screening-stopping

Prerequisite

Running script

Syntax:

Example:

Example output:

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages