Open
Conversation
Defined 2 new typedefs one for R datastructures and one for internal datastructures that use floats.
performance improvement attempt for huge datasets
- reduced iterations of kmeans to 4 - kmeans++ seeding centers now done according to efficient weighted sampling - mt19937 random() - uniform_real_distribution 0.01,0.99: Avoid bias towards high random numbers to bias towards high-acceptance range [DEV] kmeans centroids initialization - Low memory/ batch by cluster operation - Verbose for logging progress [FIX] remove existing centroids from kmeans++ init centroids [BUG] replace fill::arma::randu with stdlib::rand() Rcpp armadillo's rand function does not work with randu [FIX] add elements to set during centroid initialization If element exists already then backtrack and retry
Parameterize the batch proportion cutoff
unshuffle and then return
- also print some messages
- When covariate has one trivial level after subsetting it is dropped altogether
Flat dense matrix, not the most memory efficient but better performance
These is just for archival purposes
A cell may belong to several different batches when different covariates exist. The design was assuming that all cells MUST have covariates+1 entries in Phi. However, if for a cell only one batch was dropped but the other covariate has support, this is not true.
- Reproducible clusters by the R set.seed for the same embedding set
…us-integration
- null hypothesis gives a ratio close to 1 - Added pseudocount for cases when E is close to 1
RcppArmadillo/#438
- Fixed vignettes to interface correctly with the new API change - New Getter that gets all lambdas for each cluster
added times
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
No description provided.