These functions perform Non-negative Matrix Factorization on data stored in a
TreeSummarizedExperiment
object.
getNMF(x, ...)
addNMF(x, ...)
# S4 method for class 'SummarizedExperiment'
getNMF(x, k = 2, assay.type = "counts", eval.metric = "evar", ...)
# S4 method for class 'SummarizedExperiment'
addNMF(
x,
k = 2,
assay.type = "counts",
eval.metric = "evar",
name = "NMF",
...
)
a
TreeSummarizedExperiment
object.
optional arguments passed to nmf::NMF
.
numeric vector
. A number of latent vectors/topics.
(Default: 2
)
Character scalar
. Specifies which assay to use for
NMF ordination. (Default: "counts"
)
Character scalar
. Specifies the evaluation metric
that will be used to select the model with the best fit. Must be one of the
following options: "evar"
(explained variance; maximized),
"sparseness.basis"
(degree of sparsity in the basis matrix;
maximized), "sparseness.coef"
(degree of sparsity in the coefficient
matrix; maximized), "rss"
(residual sum of squares; minimized),
"silhouette.coef"
(quality of clustering based on the coefficient
matrix; maximized), "silhouette.basis"
(quality of clustering based
on the basis matrix; maximized), "cophenetic"
(correlation between
cophenetic distances and original distances; maximized), "dispersion"
(spread of data points within clusters; minimized). (Default: "evar"
)
Character scalar
. The name to be used to store the result
in the reducedDims of the output. (Default: "NMF"
)
For getNMF
, the ordination matrix with feature loadings matrix
as attribute "loadings"
.
For addNMF
, a
TreeSummarizedExperiment
object is returned containing the ordination matrix in
reducedDims(x, name)
with the following attributes:
"loadings" which is a matrix containing the feature loadings
"NMF_output" which is the output of function nmf::NMF
"best_fit" which is the result of the best fit if k is a vector of integers
The functions getNMF
and addNMF
internally use nmf::NMF
compute the ordination matrix and
feature loadings.
If k is a vector of integers, NMF output is calculated for all the rank
values contained in k, and the best fit is selected based on
eval.metric
value.
data(GlobalPatterns)
tse <- GlobalPatterns
# Reduce the number of features
tse <- agglomerateByPrevalence(tse, rank = "Phylum")
# Run NMF and add the result to reducedDim(tse, "NMF").
tse <- addNMF(tse, k = 2, name = "NMF")
#> Loading required package: registry
#> Loading required package: rngtools
#> Loading required package: cluster
#> NMF - BioConductor layer [OK] | Shared memory capabilities [NO: bigmemory] | Cores 2/2
#> To enable shared memory capabilities, try: install.extras('
#> NMF
#> ')
#>
#> Attaching package: ‘NMF’
#> The following object is masked from ‘package:S4Vectors’:
#>
#> nrun
#> The following object is masked from ‘package:generics’:
#>
#> fit
# Extract feature loadings
loadings_NMF <- attr(reducedDim(tse, "NMF"), "loadings")
head(loadings_NMF)
#> [,1] [,2]
#> AD3 8.396228e-04 1.297661e-07
#> Acidobacteria 3.115335e-02 1.264605e-05
#> Actinobacteria 9.323458e-02 2.379576e-02
#> Armatimonadetes 4.523057e-04 2.220446e-16
#> BRC1 6.070034e-05 6.996023e-08
#> Bacteroidetes 8.244977e-02 1.795354e-01
# Estimate models with number of topics from 2 to 4. Perform 2 runs.
tse <- addNMF(tse, k = c(2, 3, 4), name = "NMF_4", nrun = 2)
#> NMF - BioConductor layer [OK] | Shared memory capabilities [NO: bigmemory] | Cores 2/2
#> To enable shared memory capabilities, try: install.extras('
#> NMF
#> ')
#>
#> Attaching package: ‘NMF’
#> The following object is masked from ‘package:S4Vectors’:
#>
#> nrun
#> The following object is masked from ‘package:generics’:
#>
#> fit
# Extract feature loadings
loadings_NMF_4 <- attr(reducedDim(tse, "NMF_4"), "loadings")
head(loadings_NMF_4)
#> [,1] [,2] [,3] [,4]
#> AD3 5.577926e-04 2.220446e-16 2.220446e-16 1.204598e-07
#> Acidobacteria 1.909888e-02 2.974441e-06 2.673443e-03 2.220446e-16
#> Actinobacteria 6.220831e-02 1.412575e-02 1.362639e-02 2.220446e-16
#> Armatimonadetes 1.929561e-04 2.220446e-16 1.792145e-04 2.220446e-16
#> BRC1 3.286524e-05 2.220446e-16 1.246815e-05 5.688422e-08
#> Bacteroidetes 1.385933e-02 1.478853e-02 1.800624e-02 2.848152e-01