The amount of scientific papers published every day is daunting and constantly increasing. Keeping up with literature represents a challenge. If one wants to start exploring new topics it is hard to have a big picture without reading lots of articles. Furthermore, as one reads through literature, making mental connections is crucial to ask new questions which might lead to discoveries. In this work, I present a web tool which uses a Text Mining strategy to transform large collections of unstructured biomedical articles into structured data. Generated results give a quick overview on complex topics which can possibly suggest not explicitly reported information. In particular, I show two Data Science analyses. First, I present a literature based rare diseases network build using this tool in the hope that it will help clarify some aspects of these less popular pathologies. Secondly, I show how a literature based analysis conducted with PubSqueezer results allows the describe of known facts about SARS-CoV-2. In one sentence, data generated with PubSqueezer make it easy to use scientific literate in any computational analysis such as machine learning, natural language processing etc.
Availability: this http URL
[1]
Alberto Calderone,et al.
Comparing Alzheimer’s and Parkinson’s diseases networks using graph communities structure
,
2016,
BMC Systems Biology.
[2]
Jaiminkumar P. Patel,et al.
Clinical Perspective on 2019 Novel Coronavirus Pneumonia: A Systematic Review of Published Case Reports
,
2020,
Cureus.
[3]
Ž. Vlaisavljević,et al.
Diabetes and COVID-19: A systematic review on the current evidences
,
2020,
Diabetes Research and Clinical Practice.
[4]
Sophia Ananiadou,et al.
SciLite: a platform for displaying text-mined annotations as a means to link research articles with biological data
,
2017,
Wellcome open research.
[5]
Anushya Muruganujan,et al.
PANTHER version 14: more genomes, a new PANTHER GO-slim and improvements in enrichment analysis tools
,
2018,
Nucleic Acids Res..
[6]
Gemma C. Garriga,et al.
Permutation Tests for Studying Classifier Performance
,
2009,
2009 Ninth IEEE International Conference on Data Mining.
[7]
Hiroyuki Ogata,et al.
KEGG: Kyoto Encyclopedia of Genes and Genomes
,
1999,
Nucleic Acids Res..
[8]
Nick Cramer,et al.
Automatic Keyword Extraction from Individual Documents
,
2010
.