ENH Added warning for `ndcg_score` when used w/ negative `y_true` values #23461

Micky774 · 2022-05-25T13:42:40Z

Reference Issues/PRs

Resolves #22710
Fixes #17639

What does this implement/fix? Explain your changes.

Adds warning for ndcg_score when used w/ negative y_true values

Any other comments?

deprecation warning for negative ndcg

…earn into fix/ndcg-score

…nto scikit-learn-main

sklearn/metrics/_ranking.py

doc/whats_new/v1.2.rst

thomasjpfan

I have not looked into the literature. Is there a valid use case for y_true < 0?

sklearn/metrics/_ranking.py

doc/whats_new/v1.2.rst

Micky774 · 2022-06-07T15:42:55Z

I have not looked into the literature. Is there a valid use case for y_true < 0?

There was a bit of discussion about this in the original issue #17639. I'm largely basing this off of Wang et. al. 2013 which has ~250 citations (details) and affirms the expectation that NDCG is contained within $[0,1]$. More directly, this survey paper on information-retrieval methods also uses relevance values (y_true) no less than 0.

Indeed, with the idea of relevance in document retrieval where NDCG was first formulated, it makes more sense for an additional document to offer no information and have no relevance, than to remove information or relevance from the overall query. From what I've read so far, including this additional survey paper talking about IR methods overall, cumulative gain (CG) is assumed to be monotone non-decreasing, so I don't think that y_true<0 is a valid use case, at the very least not for what it was designed to do.

Co-authored-by: Thomas J. Fan <thomasjpfan@gmail.com>

thomasjpfan

Minor nit, otherwise LGTM.

sklearn/metrics/_ranking.py

Co-authored-by: Thomas J. Fan <thomasjpfan@gmail.com>

jeremiedbb

LGTM. I just fixed the what's new entry that was not in the appropriate section.
Thanks @Micky774

…ues (scikit-learn#23461) Co-authored-by: trinhcon <conroy.trinh@mail.utoronto.ca> Co-authored-by: Victor Ko <vk07275@gmail.com> Co-authored-by: Thomas J. Fan <thomasjpfan@gmail.com> Co-authored-by: Jérémie du Boisberranger <34657725+jeremiedbb@users.noreply.github.com>

trinhcon and others added 17 commits February 27, 2022 16:47

fix: DeprecationWarning for negative y_true values in ndcg_score

5a6df33

fixed for checking np.ndarrays and regular arrays

689e72d

deprecation warning for negative ndcg

b628cb0

Merge pull request #2 from trinhcon/fix/ndcg-score-with-test

07a2447

deprecation warning for negative ndcg

updated the docstrings and changed exception to warning

ebce452

Merge branch 'fix/ndcg-score' of https://github.com/trinhcon/scikit-l…

a32da19

…earn into fix/ndcg-score

Created a changelog entry

7e2d9bb

Edited a changelog entry with pr number

7bb74cd

Fixed changelog entry formatting

36eafd3

Fixed changelog entry formatting

0e23aa0

Merge branch 'main' of https://github.com/scikit-learn/scikit-learn i…

9483840

…nto scikit-learn-main

Merge branch 'scikit-learn-main' into fix/ndcg-score

20fcffa

Moved changelog entry to appropriate location

c49cf4f

Fixed changelog entry formatting

e559817

Merge branch 'main' into ndcg-score

850bdf6

Updated emitted warning, fixed changelogs, updated tests

0bfd0a1

Reimplemented deprecation warning and added TODO comments

910fd43

github-actions bot added the module:metrics label May 25, 2022

Micky774 and others added 4 commits May 25, 2022 09:51

Added missing PR number

548371a

Streamline test

5a38ce4

Merge branch 'main' into ndcg-score

986f91c

Merge branch 'main' into ndcg-score

df1b3dc

thomasjpfan reviewed May 31, 2022

View reviewed changes

sklearn/metrics/_ranking.py Show resolved Hide resolved

Micky774 and others added 3 commits May 31, 2022 12:49

Merge branch 'main' into ndcg-score

da6295f

Merge branch 'main' into ndcg-score

a19312c

Merge branch 'main' into ndcg-score

a2ce07e

thomasjpfan reviewed Jun 4, 2022

View reviewed changes

sklearn/metrics/_ranking.py Show resolved Hide resolved

sklearn/metrics/_ranking.py Outdated Show resolved Hide resolved

sklearn/metrics/_ranking.py Outdated Show resolved Hide resolved

doc/whats_new/v1.2.rst Outdated Show resolved Hide resolved

Micky774 added 2 commits June 6, 2022 13:17

Merge branch 'main' into ndcg-score

88118e9

Incorporated review feedback

273ac2b

thomasjpfan reviewed Jun 7, 2022

View reviewed changes

sklearn/metrics/_ranking.py Outdated Show resolved Hide resolved

doc/whats_new/v1.2.rst Outdated Show resolved Hide resolved

Micky774 and others added 3 commits June 7, 2022 11:43

Apply suggestions from code review

e9b5d62

Co-authored-by: Thomas J. Fan <thomasjpfan@gmail.com>

Merge branch 'main' into ndcg-score

180724c

Updated test

b65e7b2

thomasjpfan approved these changes Jun 7, 2022

View reviewed changes

sklearn/metrics/_ranking.py Outdated Show resolved Hide resolved

thomasjpfan added the Quick Review For PRs that are quick to review label Jun 7, 2022

Update sklearn/metrics/_ranking.py

85f1639

Co-authored-by: Thomas J. Fan <thomasjpfan@gmail.com>

glemaitre self-requested a review June 13, 2022 18:35

Update v1.2.rst

5261413

jeremiedbb approved these changes Jun 22, 2022

View reviewed changes

jeremiedbb merged commit 9bb2098 into scikit-learn:main Jun 22, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH Added warning for `ndcg_score` when used w/ negative `y_true` values #23461

ENH Added warning for `ndcg_score` when used w/ negative `y_true` values #23461

ENH Added warning for ndcg_score when used w/ negative y_true values #23461

ENH Added warning for ndcg_score when used w/ negative y_true values #23461

Conversation

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ENH Added warning for `ndcg_score` when used w/ negative `y_true` values #23461

ENH Added warning for `ndcg_score` when used w/ negative `y_true` values #23461