Skip to content

Latest commit

 

History

History
22 lines (18 loc) · 1.27 KB

File metadata and controls

22 lines (18 loc) · 1.27 KB

LDA on Covid Open Research Dataset

Coronavirus has been a disease that has impact the world like no other virus has done in modern times, creating a worlwide emergencysituation. This work aims to help the classification of information by obtaining the topic papers are talking about as well as classify this topics in principal domains like vacciones and therapeutics this done possible using tools like Latent Dirichlet Allocation and dictionaries with terms that can help to improve information detection.

Python packages

  • Gensim - Topic modelling library
  • Matplotlib - 2D graphics library
  • Nltk - Natural language processing with python
  • Numpy - Scientific computation package
  • pyLDAvis - Topic modelling visualization
  • Scipy - Mathematics and science environment

More