Search Results for author: Adrien Bazoge

Found 7 papers, 3 papers with code

Revue de la littérature : entrepôts de données biomédicales et traitement automatique de la langue (Literature review : biomedical data warehouse and natural language processing )

no code implementations • JEP/TALN/RECITAL 2021 • Adrien Bazoge

Cet article propose une revue de la littérature qui s’intéresse, à travers les publications sur PubMed, ACL Anthology et Google Scholar, à l’interaction entre deux thématiques : les entrepôts de données biomédicales et le traitement automatique des langues.

Paper
Add Code

Adaptation of Biomedical and Clinical Pretrained Models to French Long Documents: A Comparative Study

1 code implementation • 26 Feb 2024 • Adrien Bazoge, Emmanuel Morin, Beatrice Daille, Pierre-Antoine Gourraud

Recently, pretrained language models based on BERT have been introduced for the French biomedical domain.

named-entity-recognition Named Entity Recognition

Paper
Code

How Important Is Tokenization in French Medical Masked Language Models?

no code implementations • 22 Feb 2024 • Yanis Labrak, Adrien Bazoge, Beatrice Daille, Mickael Rouvier, Richard Dufour

Subword tokenization has become the prevailing standard in the field of natural language processing (NLP) over recent years, primarily due to the widespread utilization of pre-trained language models.

Paper
Add Code

DrBenchmark: A Large Language Understanding Evaluation Benchmark for French Biomedical Domain

1 code implementation • 20 Feb 2024 • Yanis Labrak, Adrien Bazoge, Oumaima El Khettari, Mickael Rouvier, Pacome Constant dit Beaufils, Natalia Grabar, Beatrice Daille, Solen Quiniou, Emmanuel Morin, Pierre-Antoine Gourraud, Richard Dufour

This limitation hampers the evaluation of the latest French biomedical models, as they are either assessed on a minimal number of tasks with non-standardized protocols or evaluated using general downstream tasks.

named-entity-recognition Named Entity Recognition +3

Paper
Code

BioMistral: A Collection of Open-Source Pretrained Large Language Models for Medical Domains

no code implementations • 15 Feb 2024 • Yanis Labrak, Adrien Bazoge, Emmanuel Morin, Pierre-Antoine Gourraud, Mickael Rouvier, Richard Dufour

This marks the first large-scale multilingual evaluation of LLMs in the medical domain.

Ranked #8 on Few-Shot Learning on MedConceptsQA

Few-Shot Learning Quantization +2

Paper
Add Code

FrenchMedMCQA: A French Multiple-Choice Question Answering Dataset for Medical domain

1 code implementation • LOUHI 2022 • Yanis Labrak, Adrien Bazoge, Richard Dufour, Mickael Rouvier, Emmanuel Morin, Béatrice Daille, Pierre-Antoine Gourraud

This paper introduces FrenchMedMCQA, the first publicly available Multiple-Choice Question Answering (MCQA) dataset in French for medical domain.

Ranked #1 on Multiple Choice Question Answering (MCQA) on FrenchMedMCQA

Multiple-choice Multiple Choice Question Answering (MCQA)

Paper
Code

DrBERT: A Robust Pre-trained Model in French for Biomedical and Clinical domains

no code implementations • 3 Apr 2023 • Yanis Labrak, Adrien Bazoge, Richard Dufour, Mickael Rouvier, Emmanuel Morin, Béatrice Daille, Pierre-Antoine Gourraud

In recent years, pre-trained language models (PLMs) achieve the best performance on a wide range of natural language processing (NLP) tasks.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.