Latent Dirichlet Allocation (LDA), Random Projections (RP), Hierarchical Dirichlet Process (HDP) or word2vec deep learning.ĭistributed computing: can run Latent Semantic Analysis and Latent Dirichlet Allocation on a cluster of computers.Įxtensive documentation and Jupyter Notebook tutorials. the corpus size (can process input larger than RAM, streamed, out-of-core)Įasy to plug in your own input corpus/datastream (simple streaming API)Įasy to extend with other Vector Space algorithms (simple transformation API)Įfficient multicore implementations of popular algorithms, such as online Latent Semantic Analysis (LSA/LSI/SVD), FeaturesĪll algorithms are memory-independent w.r.t. ![]() ![]() Target audience is the natural language processing (NLP) and information retrieval (IR) community. Gensim is a Python library for topic modelling, document indexing and similarity retrieval with large corpora.
0 Comments
Leave a Reply. |
Details
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |