Chromatogram fdata #290

jorainer · 2017-12-18T10:51:50Z

Add a featureData slot to the Chromatograms object.
Add mz,Chromatograms, precursorMz,Chromatograms and productMz,Chromatograms.
Add all related unit tests and documentation.

Short description: each row of an Chromatograms object should contain chromatogram data for the same ion or m/z range (+ eventually rt range). Having this data in a featureData allows users a quick way to access such data.

- Add featureData slot to Chromatograms class. - Add getter/setter method for featureData. - Add precursorMz, productMz and mz,Chromatograms method. - Add related unit tests and documentation.

sgibb

All in all a fine PR.

sgibb · 2017-12-19T10:32:42Z

R/functions-Chromatograms.R

+#'
+#' @noRd
+.mz_chromatograms <- function(x, mz = "mz") {
+    mz <- match.arg(mz, c("mz", "precursorMz", "productMz"))


This is just my personal preference. I like to see all choices for an argument in the definition of a function (especially for an exported and documented function):

.mz_chromatograms <- function(x, mz = c("mz", "precursorMz", "productMz")) { mz <- match.arg(mz) ## will automatically use "mz" as default ## ... }

sgibb · 2017-12-19T10:35:24Z

R/functions-Chromatograms.R

+    ## If we've got the values in the featureData, use these.
+    if (mz %in% c("precursorMz", "productMz"))
+        vl <- rep(paste0(sub(mz, pattern = "Mz", replacement = ""),
+                         "IsolationWindowTargetMZ"), 2)


paste0 is superfluous here: vl <- rep(sub("Mz", "IsolationWindowTargetMZ", mz), 2)

sgibb · 2017-12-19T10:45:57Z

R/functions-Chromatograms.R

+        vl <- c("mzmin", "mzmax")
+    if (all(vl %in% fvarLabels(x))) {
+        ## Want to return a matrix, not a data.frame
+        cbind(mzmin = fData(x)[, vl[1]], mzmax = fData(x)[, vl[2]])


Why not explicitly convert into a matrix (the return value of cbind depends on its input, would be a numeric matrix in our use case but who knows ...).

m <- as.matrix(fData(x)[, vl]) dimnames(m) <- list(NULL, c("mzmin", "mzmax")) m

Or in one line:

as.matrix(setNames(fData(x)[, vl], c("mzmin", "mzmax")), rownames.force=FALSE)

I'll check - I think there was something behind using cbind here (performance wise).

cbind is slightly faster than as.matrix. Not that it matters here, I just prefer using cbind and extract individual columns from a data.frame instead of anything that involves accessing multiple columns at once in a data.frame, as that might/can involve copying of the data, while accessing single columns of a data.frame never copies the data.

Oh, cool. I wasn't aware of that. You are right:

library("microbenchmark") set.seed(2017) n <- 1e5 d <- data.frame(a=sample(n), b=sample(n), c=sample(n)) f1 <- function(x, vl=c("a", "b"))cbind(mzmin=x[, vl[1L]], mzmax=x[, vl[2L]]) f2 <- function(x, vl=c("a", "b"))matrix(c(x[, vl[1L]], x[, vl[2L]]), ncol=2L, dimnames=list(NULL, c("mzmin", "mzmax"))) all.equal(f1(d), f2(d)) # [1] TRUE microbenchmark(f1(d), f2(d)) # Unit: microseconds # expr min lq mean median uq max neval # f1(d) 112.814 122.4080 203.0495 126.7465 139.109 3693.441 100 # f2(d) 505.302 523.0355 638.2103 526.0990 536.168 4133.861 100

sgibb · 2017-12-19T14:41:49Z

R/functions-Chromatograms.R

+        ## the values in one row are not identical
+        mzr <- matrix(nrow = nrow(x), ncol = 2,
+                      dimnames = list(NULL, c("mzmin", "mzmax")))
+        for (i in 1:nrow(mzr)) {


Use seq_len(nrow(mzr)) instead of 1:nrow(mzr): https://bioconductor.org/developers/how-to/efficient-code/#avoid--style-iterations

sgibb · 2017-12-19T14:44:09Z

R/methods-Chromatograms.R

+#' @rdname Chromatograms-class
+#'
+#' @description \code{fData}: return the feature data as a \code{data.frame}.
+setMethod("fData", "Chromatograms", function(object) pData(object@featureData))


Is pData correct (instead of fData)?

This is correct, although confusing. pData,AnnotationDataFrame accesses the adf@data slot, but is also the accessor for object@phenoData@data slot where object is an eSet type instance.

lgatto

Looks good to me - @sgibb did all the work already anyway :-).

lgatto · 2017-12-19T16:00:00Z

@jotsetung - is this something that needs to be pushed to Bioc quickly?

(sorry, initially commented on wrong PR)

jorainer · 2017-12-19T17:50:30Z

Nope, no need to push to Bioc now. This is some preparatory work for the future readSRMData function to read mzML files with chromatogram data - but that depends on the related pull request in mzR (sneumann/mzR#142).

I'm confused - you did already merge? Then I'll change the requested minor stuff above in the master branch?

lgatto · 2017-12-19T17:55:25Z

Yes, I already merged, sorry.

What about readMRMData rather than readSRMData, or have the two that do the same thing? (SRMs and MRMs are essentially the same thing, as far as I know).

jorainer · 2017-12-19T17:57:24Z

yes, I believe SRM and MRM are the same thing - with SRM being the correct term. Having a readSRMData with an alias readMRMData could be OK. Hope its not confusing to the user.

jorainer added 4 commits December 15, 2017 11:20

Add featureData slot to Chromatograms class

6994dec

Add featureData slot to Chromatograms class (issue #289)

3526729

- Add featureData slot to Chromatograms class. - Add getter/setter method for featureData. - Add precursorMz, productMz and mz,Chromatograms method. - Add related unit tests and documentation.

Merge branch 'master' into chromatogram_fdata

fb101c1

Update NEWS

50c281c

jorainer requested review from sgibb and lgatto December 18, 2017 10:51

sgibb reviewed Dec 19, 2017

View reviewed changes

lgatto approved these changes Dec 19, 2017

View reviewed changes

Merge branch 'master' into chromatogram_fdata

588a1e2

lgatto merged commit f620044 into master Dec 19, 2017

jorainer added a commit that referenced this pull request Dec 20, 2017

Address comments from @sgibb in pull request #290

8a73f06

jorainer deleted the chromatogram_fdata branch December 20, 2017 06:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Chromatogram fdata #290

Chromatogram fdata #290

Chromatogram fdata #290

Chromatogram fdata #290

Conversation

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment