[go: nahoru, domu]

Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[MRG] Add a function to plot the taxonomy output #1607

Open
wants to merge 148 commits into
base: latest
Choose a base branch
from

Conversation

taylorreiter
Copy link
Contributor

Given an abundance csv generated by sourmash tax summarize --output-format lineage_summary,
plot a heatmap that is clustered on the lineage axis. Output as a png file.
tmp

Some potential improvement in the viz:

  • hierarchically cluster the samples as well
  • make labelled boxes optional

Some design choices that aren't considered here:

  • unlike above, reading in data and plotting data are not split
  • unlike plot_composite_matrix(), the saving of the plot is not split out into a different command.

So given these things, the exact function I included here might not fit with sourmash, but this PR at leasts preserves the code and gives us a jumping off point. I think the viz improvements can be addressed in a later PR also.

@luizirber @bluegenes ready for review/feedback merge etc.

ctb and others added 30 commits May 21, 2021 07:47
Base automatically changed from add-taxonomy to latest June 23, 2021 13:58
@codecov
Copy link
codecov bot commented Jun 25, 2021

Codecov Report

Merging #1607 (ab812ea) into latest (baf38c1) will increase coverage by 7.09%.
The diff coverage is 10.52%.

@@            Coverage Diff             @@
##           latest    #1607      +/-   ##
==========================================
+ Coverage   84.66%   91.76%   +7.09%     
==========================================
  Files         131      100      -31     
  Lines       15512    11274    -4238     
  Branches     2210     2218       +8     
==========================================
- Hits        13134    10346    -2788     
+ Misses       2085      635    -1450     
  Partials      293      293              
Flag Coverage Δ
python 91.76% <10.52%> (-0.28%) ⬇️
rust ?

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files Coverage Δ
src/sourmash/fig.py 53.57% <10.52%> (-35.56%) ⬇️
src/core/src/index/bigsi.rs
src/core/src/sketch/nodegraph.rs
src/core/src/storage.rs
src/core/tests/test.rs
src/core/src/sketch/minhash.rs
src/core/src/from.rs
src/core/src/index/sbt/mhbt.rs
src/core/src/cmd.rs
src/core/src/signature.rs
... and 22 more

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

@bluegenes
Copy link
Contributor

exploring here: https://github.com/bluegenes/2021-sourmash-taxonomy-hackathon/blob/main/plot-lineage.ipynb
current status: split into functions, but still needs some work

@ctb
Copy link
Contributor
ctb commented Aug 20, 2022

@taylorreiter @bluegenes does anyone remember why this didn't get reviewed & merged?

@taylorreiter
Copy link
Contributor Author

I don't, but looking at the notebook in the previous comment, maybe something to do with squished labels not being worked out when there are many samples? @bluegenes do you remember?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

4 participants