Skip to main content

Learner corpora around the world

cecl |

This page offers an overview of learner corpora — electronic collections of continuous written or spoken data produced by foreign or second language learners. Our goal is to provide a comprehensive resource that includes essential information about learners, tasks, and corpus characteristics, supporting both findability and accessibility.

Keeping such a resource accurate and up to date is challenging given the rapid developments in the field. We therefore warmly invite your contributions. If you spot missing or incorrect information, or if you would like to suggest the inclusion of a new corpus, please contact us at cecl@uclouvain.be.

When suggesting a new corpus, please provide the information in line with the table columns: corpus name, target language, first language, medium, text type/task type, proficiency level, size (in words), project director, and availability (ideally with a permanent identifier if the corpus is available online).

Your updates will help us keep this overview as complete and useful as possible for the community.

To refer to this list :

Centre for English Corpus Linguistics (date of access): Learner Corpora around the World. Louvain-la-Neuve: Université catholique de Louvain. https://uclouvain.be/en/research-institutes/ilc/cecl/learner-corpora-around-the-world.html