Show simple item record

dc.contributor.authorAhmadi, Sina
dc.contributor.authorDaudert, Tobias
dc.identifier.citationAhmadi, Sina, & Daudert, Tobias. (2019). CoFiF: A corpus of financial reports in French language. Paper presented at the The First Workshop on Financial Technology and Natural Language Processing (FinNLP), Macao, China, 12 August,
dc.description.abstractIn an era when machine learning and artificial intelligence have huge momentum, the data demand to train and test models is steadily growing. We introduce CoFiF, the first corpus comprising company reports in the French language. It contains over 188 million tokens in 2655 reports, covering reference documents, annual, semestrial and trimestrial reports. Our main focus is on the 60 largest French companies listed in France s main stock indices CAC40 and CAC Next 20. The corpus spans over 20 years, ranging from 1995 to 2018. To evaluate this novel collection of organizational writing, we use CoFiF to generate two character-level language models, a forward and a backward one, which we use to demonstrate the corpus potential on business, economics, and management research in the French language.en_IE
dc.publisherNUI Galwayen_IE
dc.relation.ispartofThe First Workshop on Financial Technology and Natural Language Processing (FinNLP)en
dc.subjectfinancial reportsen_IE
dc.subjectFrench languageen_IE
dc.titleCoFiF: A corpus of financial reports in French languageen_IE
dc.typeWorkshop paperen_IE
dc.contributor.funderScience Foundation Irelanden_IE
dc.contributor.funderEuropean Regional Development Funden_IE
dc.local.contactSina Ahmadi, The Insight Centre For Data Analytics, National University Of Ireland, Galway , The Deri Building . Email:
dcterms.projectinfo:eu-repo/grantAgreement/SFI/SFI Research Centres/12/RC/2289/IE/INSIGHT - Irelands Big Data and Analytics Research Centre/en_IE

Files in this item

Attribution-NonCommercial-NoDerivs 3.0 Ireland
This item is available under the Attribution-NonCommercial-NoDerivs 3.0 Ireland. No item may be reproduced for commercial purposes. Please refer to the publisher's URL where this is made available, or to notes contained in the item itself. Other terms may apply.

The following license files are associated with this item:


This item appears in the following Collection(s)

Show simple item record