fbpx
Wikipedia

Gensim

Gensim is an open-source library for unsupervised topic modeling, document indexing, retrieval by similarity, and other natural language processing functionalities, using modern statistical machine learning.

Gensim
Original author(s)Radim Řehůřek
Developer(s)RARE Technologies Ltd.
Initial release2009
Stable release
4.3.2[1] / 24 August 2023; 8 months ago (24 August 2023)
Repositorygithub.com/RaRe-Technologies/gensim
Written inPython
Operating systemLinux, Windows, macOS
TypeInformation retrieval
LicenseLGPL
Websiteradimrehurek.com/gensim/

Gensim is implemented in Python and Cython for performance. Gensim is designed to handle large text collections using data streaming and incremental online algorithms, which differentiates it from most other machine learning software packages that target only in-memory processing.

Main Features edit

Gensim includes streamed parallelized implementations of fastText,[2] word2vec and doc2vec algorithms,[3] as well as latent semantic analysis (LSA, LSI, SVD), non-negative matrix factorization (NMF), latent Dirichlet allocation (LDA), tf-idf and random projections.[4]

Some of the novel online algorithms in Gensim were also published in the 2011 PhD dissertation Scalability of Semantic Analysis in Natural Language Processing of Radim Řehůřek, the creator of Gensim.[5]

Uses of Gensim edit

Gensim library has been used and cited in over 1400 commercial and academic applications as of 2018,[6] in a diverse array of disciplines from medicine to insurance claim analysis to patent search.[7] The software has been covered in several new articles, podcasts and interviews.[8][9][10]

Free and Commercial Support edit

The open source code is developed and hosted on GitHub[11] and a public support forum is maintained on Google Groups[12] and Gitter.[13]

Gensim is commercially supported by the company rare-technologies.com, who also provide student mentorships and academic thesis projects for Gensim via their Student Incubator programme.[14]

References edit

  1. ^ "Release 4.3.2". 24 August 2023. Retrieved 18 September 2023.
  2. ^ Scalable *2vec training
  3. ^ Deep learning with word2vec and Gensim
  4. ^ Radim Řehůřek and Petr Sojka (2010). Software framework for topic modelling with large corpora. Proc. LREC Workshop on New Challenges for NLP Frameworks
  5. ^ Řehůřek, Radim (2011). "Scalability of Semantic Analysis in Natural Language Processing" (PDF). Retrieved 27 January 2015. my open-source gensim software package that accompanies this thesis
  6. ^ Gensim academic citations
  7. ^ Commercial adopters of Gensim
  8. ^ Podcast.__init__ episode #71 on Gensim
  9. ^ Interview with Radim Řehůřek, creator of Gensim
  10. ^ "DecisionStats Interview Radim Řehůřek Gensim #python". 8 December 2015.
  11. ^ Gensim source code on Github
  12. ^ Gensim mailing list on Google Groups
  13. ^ Gensim chat room on Gitter
  14. ^ Gensim open source Incubator

External links edit

  • Official website


gensim, confused, with, genshin, impact, open, source, library, unsupervised, topic, modeling, document, indexing, retrieval, similarity, other, natural, language, processing, functionalities, using, modern, statistical, machine, learning, original, author, ra. Not to be confused with Genshin Impact Gensim is an open source library for unsupervised topic modeling document indexing retrieval by similarity and other natural language processing functionalities using modern statistical machine learning GensimOriginal author s Radim RehurekDeveloper s RARE Technologies Ltd Initial release2009Stable release4 3 2 1 24 August 2023 8 months ago 24 August 2023 Repositorygithub wbr com wbr RaRe Technologies wbr gensimWritten inPythonOperating systemLinux Windows macOSTypeInformation retrievalLicenseLGPLWebsiteradimrehurek wbr com wbr gensim wbr Gensim is implemented in Python and Cython for performance Gensim is designed to handle large text collections using data streaming and incremental online algorithms which differentiates it from most other machine learning software packages that target only in memory processing Contents 1 Main Features 2 Uses of Gensim 3 Free and Commercial Support 4 References 5 External linksMain Features editGensim includes streamed parallelized implementations of fastText 2 word2vec and doc2vec algorithms 3 as well as latent semantic analysis LSA LSI SVD non negative matrix factorization NMF latent Dirichlet allocation LDA tf idf and random projections 4 Some of the novel online algorithms in Gensim were also published in the 2011 PhD dissertation Scalability of Semantic Analysis in Natural Language Processing of Radim Rehurek the creator of Gensim 5 Uses of Gensim editGensim library has been used and cited in over 1400 commercial and academic applications as of 2018 6 in a diverse array of disciplines from medicine to insurance claim analysis to patent search 7 The software has been covered in several new articles podcasts and interviews 8 9 10 Free and Commercial Support editThe open source code is developed and hosted on GitHub 11 and a public support forum is maintained on Google Groups 12 and Gitter 13 Gensim is commercially supported by the company rare technologies com who also provide student mentorships and academic thesis projects for Gensim via their Student Incubator programme 14 References edit Release 4 3 2 24 August 2023 Retrieved 18 September 2023 Scalable 2vec training Deep learning with word2vec and Gensim Radim Rehurek and Petr Sojka 2010 Software framework for topic modelling with large corpora Proc LREC Workshop on New Challenges for NLP Frameworks Rehurek Radim 2011 Scalability of Semantic Analysis in Natural Language Processing PDF Retrieved 27 January 2015 my open source gensim software package that accompanies this thesis Gensim academic citations Commercial adopters of Gensim Podcast init episode 71 on Gensim Interview with Radim Rehurek creator of Gensim DecisionStats Interview Radim Rehurek Gensim python 8 December 2015 Gensim source code on Github Gensim mailing list on Google Groups Gensim chat room on Gitter Gensim open source IncubatorExternal links editOfficial website nbsp This scientific software article is a stub You can help Wikipedia by expanding it vte Retrieved from https en wikipedia org w index php title Gensim amp oldid 1217334969, wikipedia, wiki, book, books, library,

article

, read, download, free, free download, mp3, video, mp4, 3gp, jpg, jpeg, gif, png, picture, music, song, movie, book, game, games.