Show simple item record

dc.contributor.authorAhmadi, Sina
dc.contributor.authorDaudert, Tobias
dc.date.accessioned2019-07-19T10:33:35Z
dc.date.issued2019-08-12
dc.identifier.citationAhmadi, Sina, & Daudert, Tobias. (2019). CoFiF: A corpus of financial reports in French language. Paper presented at the The First Workshop on Financial Technology and Natural Language Processing (FinNLP), Macao, China, 12 August, https://doi.org/10.13025/zjf2-fn10en_IE
dc.identifier.urihttp://hdl.handle.net/10379/15276
dc.description.abstractIn an era when machine learning and artificial intelligence have huge momentum, the data demand to train and test models is steadily growing. We introduce CoFiF, the first corpus comprising company reports in the French language. It contains over 188 million tokens in 2655 reports, covering reference documents, annual, semestrial and trimestrial reports. Our main focus is on the 60 largest French companies listed in France s main stock indices CAC40 and CAC Next 20. The corpus spans over 20 years, ranging from 1995 to 2018. To evaluate this novel collection of organizational writing, we use CoFiF to generate two character-level language models, a forward and a backward one, which we use to demonstrate the corpus potential on business, economics, and management research in the French language.en_IE
dc.formatapplication/pdfen_IE
dc.language.isoenen_IE
dc.publisherNUI Galwayen_IE
dc.relation.ispartofThe First Workshop on Financial Technology and Natural Language Processing (FinNLP)en
dc.rightsAttribution-NonCommercial-NoDerivs 3.0 Ireland
dc.rights.urihttps://creativecommons.org/licenses/by-nc-nd/3.0/ie/
dc.subjectCoFiFen_IE
dc.subjectfinancial reportsen_IE
dc.subjectFrench languageen_IE
dc.subjectCorpusen_IE
dc.titleCoFiF: A corpus of financial reports in French languageen_IE
dc.typeWorkshop paperen_IE
dc.date.updated2019-07-13T19:37:58Z
dc.identifier.doi10.13025/zjf2-fn10
dc.local.publishedsourcehttps://doi.org/10.13025/zjf2-fn10
dc.description.peer-reviewedpeer-reviewed
dc.contributor.funderScience Foundation Irelanden_IE
dc.contributor.funderEuropean Regional Development Funden_IE
dc.description.embargo2019-08-10
dc.internal.rssid16784803
dc.local.contactSina Ahmadi, The Insight Centre For Data Analytics, National University Of Ireland, Galway , The Deri Building . Email: s.ahmadi1@nuigalway.ie
dc.local.copyrightcheckedYes
dc.local.versionPUBLISHED
dcterms.projectinfo:eu-repo/grantAgreement/SFI/SFI Research Centres/12/RC/2289/IE/INSIGHT - Irelands Big Data and Analytics Research Centre/en_IE
nui.item.downloads141


Files in this item

Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record

Attribution-NonCommercial-NoDerivs 3.0 Ireland
Except where otherwise noted, this item's license is described as Attribution-NonCommercial-NoDerivs 3.0 Ireland