how to proceed #1

flying-sheep · 2017-02-04T16:39:48Z

make things work*
find out how to store PCA and friends in/with the AnnData 1cec418#commitcomment-20744162
determine how to read/write AnnData. maybe fields named var_* in the HDF5 will be var metadata and so on?

*apart from things still crashing, especially the group plotting should be fixed (should probably be transformed to one scatter call with a list of all groups)

The text was updated successfully, but these errors were encountered:

falexwolf · 2017-02-05T09:23:57Z

Good! So, I'd really like to jump in and work on ann_matrix as well, if you think this is efficient. Of course, I don't want to mess up what you had in mind.

yes, that's important - can i help?
that's easy, simply put it in smp as a multicolumn object
should be very easy as well, maybe recarray can directly be written with a single key, if not, one has to make the separation between str and float columns -> shall I attack that? see this for how it was done with the ddata using its 'rowcat' attribute. should be straightforwardly adapted, right?*

*sorry, I simply forgot to add readwrite.py on thursday night, which caused master to be non-working since then, of course. with readwrite.py added, master now works just fine. I guess the only change you made to utils.py was adding the AnnData.from_dict(...) in the function read()? so one could use readwrite.py from master within ann_matrix. or just create readwrite.py again by cutting out everything related to reading/writing from utils and pasting it into the new module readwrite.py.

falexwolf · 2017-02-05T09:44:08Z

Generally: What shall I do in order to merge ann_matrix as quickly as possible with the master branch? Starting from tomorrow, fiona would like to work on one tool using the nestorowa16 case i mentioned before. So if you allow me, I'll try to get everything running and polished tonight.
PS: During the day, I'll be offline.

flying-sheep · 2017-02-05T17:53:03Z

sure, go ahead, i’m occupied today preparing my mitarbeitergespräch :D

falexwolf · 2017-02-05T19:41:05Z

damn, I'm not fit enough to make ann_matrix work tonight. so, in order to get figures, analysis and a barebone code for fiona ready (we have a skype conference with fabian and the group in cambridge tomorrow at 11am, and fabian is quite pushy), i'll use the working master branch.

let's discuss merging with ann_matrix in person during the next days.

Updated read_10x_h5: - Renamed the original `read_10x_h5` as `_read_legacy_10x_h5`; - Added `_read_v3_10x_h5` to read the new Cell Ranger output format; - The new `read_10x_h5` determines the version of HDF5 input by the presence of the matrix key, and wraps the above two functions. In addition, it takes a `gex_only` argument which filters out feature barcoding counts from the outcome object when it is True (default). Otherwise, the full matrix will be retained. - For CR-v3, `feature_types` and `genome` were added into the outcome object as new attributes. Updated read_10x_mtx: - Renamed the original `read_10x_mtx` as `_read_legacy_10x_mtx`; - Added `_read_v3_10x_mtx` to read the new Cell Ranger output format; - The new `read_10x_mtx` determines the version of matrix input by the presence of the `genes.tsv` file under the input directory, and wraps the above two functions. In addition, it takes a `gex_only` argument which filters out feature barcoding counts from the outcome object when it is `True` (default). Otherwise, the full matrix will be retained. - For CR-v3, `feature_types` was added into the outcome object as a new attribute. Added test data and code for the revised functions. Note for the genome argument: - There is a genome argument in Scanpy's `read_10x_h5` function but not in `read_10x_mtx` as the genome was already specified by the path of input directory. The outcome object of the two functions should be the same which always take one genome at a time. - In this PR, when there are multiple genomes (e.g. Barnyard), `read_10x_mtx` always read them all, whereas `read_10x_h5` always need to specify one of them (mm10 by default). However, when `gex_only == False`, the `genome` argument will be ignored and the whole matrix will be read.

Let Scanpy read from Cell Ranger 3.0 outputs (#1)

Merge with master

update to match upstream

falexwolf closed this as completed Feb 8, 2017

worker000000 mentioned this issue Sep 29, 2018

unable to install scanpy #276

Closed

falexwolf added a commit that referenced this issue Oct 29, 2018

Merge pull request #334 from 10XGenomics/master

1284599

Let Scanpy read from Cell Ranger 3.0 outputs (#1)

ivirshup pushed a commit that referenced this issue Apr 8, 2019

Merge pull request #1 from theislab/master

c2b777c

Merge with master

flying-sheep pushed a commit that referenced this issue Jun 28, 2019

Merge pull request #1 from theislab/master

1c0a56c

update to match upstream

flying-sheep mentioned this issue Oct 24, 2019

mnn_correct() ValueError: not enough values to unpack (expected 3, got 1) #757

Closed

This was referenced Feb 28, 2022

TypingError: Failed in nopython mode pipeline (step: nopython frontend) #1652

Closed

Bug on scanpy, sc.pp.neighbors function #2160

Open

mdbabumiamssm mentioned this issue Nov 17, 2022

LoweringError: Failed in nopython mode pipeline (step: nopython mode backend) #1756

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

how to proceed #1

how to proceed #1

how to proceed #1

how to proceed #1

Comments