McGill University, Facebook & Mila Introduces MeDAL: A NLP Pre-training Dataset for Medical Abbreviation Disambiguation With 14M Articles

  • by
McGill University, Facebook & Mila Introduces MeDAL: A NLP Pre-training Dataset for Medical Abbreviation Disambiguation With 14M Articles

At EMNLP 2020 (Empirical Methods in Natural Language Processing) Conference, a Montreal-based research team introduced a large medical text dataset designed to improve medical abbreviation disambiguation.

Correct terminology and related deep learning models for various tasks have a significant role in medicine and healthcare. However, there has been a lack of publicly available pre-training data in this field due to privacy restrictions and an overabundance of non-standard abbreviations. The patient-safety organization, Institute for Safe Medical Practices (ISMP), has listed more than 55,000 medical abbreviations that may not be interpreted correctly.

The researchers from McGill University, Facebook CIFAR AI Chair, and Mila – Quebec Artificial Intelligence Institute recently introduced MeDAL. MeDAL: Medical Dataset for Abbreviation Disambiguation for Natural Language Understanding helps resolve all the contradictory, ambiguous, and potentially dangerous abbreviations in the medical and healthcare field. An example of what it does is shown below.

Summary: https://www.marktechpost.com/2020/12/01/mcgill-university-facebook-mila-introduces-medal-a-nlp-pre-training-dataset-for-medical-abbreviation-disambiguation-with-14m-articles/

Github: https://github.com/BruceWen120/medal

Paper: https://www.aclweb.org/anthology/2020.clinicalnlp-1.15.pdf

https://preview.redd.it/k5wo0pdujp261.png?width=696&format=png&auto=webp&s=2850095931ba51bdc700387125bf291fd7609a00

submitted by /u/ai-lover
[link] [comments]

Leave a Reply

Your email address will not be published. Required fields are marked *