dc.contributor.author | Ahmadi, Sina | |
dc.contributor.author | Daudert, Tobias | |
dc.date.accessioned | 2019-07-19T10:33:35Z | |
dc.date.issued | 2019-08-12 | |
dc.identifier.citation | Ahmadi, Sina, & Daudert, Tobias. (2019). CoFiF: A corpus of financial reports in French language. Paper presented at the The First Workshop on Financial Technology and Natural Language Processing (FinNLP), Macao, China, 12 August, https://doi.org/10.13025/zjf2-fn10 | en_IE |
dc.identifier.uri | http://hdl.handle.net/10379/15276 | |
dc.description.abstract | In an era when machine learning and artificial intelligence have huge momentum, the data demand to train and test models is steadily growing. We introduce CoFiF, the first corpus comprising company reports in the French language. It contains over 188 million tokens in 2655 reports, covering reference documents, annual, semestrial and trimestrial reports. Our main focus is on the 60 largest French companies listed in France s main stock indices CAC40 and CAC Next 20. The corpus spans over 20 years, ranging from 1995 to 2018. To evaluate this novel collection of organizational writing, we use CoFiF to generate two character-level language models, a forward and a backward one, which we use to demonstrate the corpus potential on business, economics, and management research in the French language. | en_IE |
dc.format | application/pdf | en_IE |
dc.language.iso | en | en_IE |
dc.publisher | NUI Galway | en_IE |
dc.relation.ispartof | The First Workshop on Financial Technology and Natural Language Processing (FinNLP) | en |
dc.rights | Attribution-NonCommercial-NoDerivs 3.0 Ireland | |
dc.rights.uri | https://creativecommons.org/licenses/by-nc-nd/3.0/ie/ | |
dc.subject | CoFiF | en_IE |
dc.subject | financial reports | en_IE |
dc.subject | French language | en_IE |
dc.subject | Corpus | en_IE |
dc.title | CoFiF: A corpus of financial reports in French language | en_IE |
dc.type | Workshop paper | en_IE |
dc.date.updated | 2019-07-13T19:37:58Z | |
dc.identifier.doi | 10.13025/zjf2-fn10 | |
dc.local.publishedsource | https://doi.org/10.13025/zjf2-fn10 | |
dc.description.peer-reviewed | peer-reviewed | |
dc.contributor.funder | Science Foundation Ireland | en_IE |
dc.contributor.funder | European Regional Development Fund | en_IE |
dc.description.embargo | 2019-08-10 | |
dc.internal.rssid | 16784803 | |
dc.local.contact | Sina Ahmadi, The Insight Centre For Data Analytics, National University Of Ireland, Galway , The Deri Building . Email: s.ahmadi1@nuigalway.ie | |
dc.local.copyrightchecked | Yes | |
dc.local.version | PUBLISHED | |
dcterms.project | info:eu-repo/grantAgreement/SFI/SFI Research Centres/12/RC/2289/IE/INSIGHT - Irelands Big Data and Analytics Research Centre/ | en_IE |
nui.item.downloads | 141 | |