Browsing Data Science Institute (Conference Papers) by Author "|~|SFI|~|"
Now showing items 1-20 of 26
-
The ACL RD-TEC: A Dataset for Benchmarking Terminology Extraction and Classification in Computational Linguistics
QasemiZadeh, Behrang; Handschuh, Siegfried (2014)This paper introduces ACL RD-TEC: a dataset for evaluating the extraction and classification of terms from literature in the domain of computational linguistics. The dataset is derived from the Association for Computational ... -
Analyzing Social Behavior of Software Developers Across Different Communication Channels
Iqbal, Aftab (2013)Software developers use different project repositories (i.e., mailing list, bug tracking repositories, discussion forums etc.) to interact with each other or to solve software related problems. The growing interest in the ... -
Benchmarking Domain-Specific Expert Search Using Workshop Program Committees
Bordea, Georgeta; Buitelaar, Paul (ACM, 2013)Traditionally, relevance assessments for expert search have been gathered through self-assessment or based on the opinions of co-workers. We introduce three benchmark datasets1 for expert search that use conference workshops ... -
Developing a Dataset for Technology Structure Mining
QasemiZadeh, Behrang; Buitelaar, Paul; Monaghan, Fergal (IEEE, 2010)This paper describes steps that have been taken to construct a development dataset for the task of Technology Structure Mining. We have defined the proposed task as the process of mapping a scientific corpus into a ... -
Domain-independent term extraction through domain modelling
Bordea, Georgeta; Buitelaar, Paul; Polajnar, Tamara (10th International Conference on Terminology and Artificial Intelligence, 2013-09-11)Extracting general or intermediate level terms is a relevant problem that has not received much attention in literature. Current approaches for term extraction rely on contrastive corpora to identify domain-specific terms, ... -
An Eigenvalue-Based Measure for Word-Sense Disambiguation
Hulpus, Ioana; Hayes, Conor; Greene, Derek (FLAIRS 2012, 2012)Current approaches for word-sense disambiguation (WSD) try to relate the senses of the target words by optimizing a score for each sense in the context of all other words' senses. However, by scoring each sense separately, ... -
A Formal Investigation of Semantic Interoperability of HCLS Systems
Sahay, Ratnesh (IGI Global, 2013)Semantic interoperability facilitates Health Care and Life Sciences (HCLS) systems in connecting stakeholders (e.g., patient, physician, pharmacy) at various levels as well as ensure seamless use of healthcare resources ... -
GenomeSnip: Fragmenting the Genomic Wheel to augment discovery in cancer research
Kamdar, M; Iqbal, A; Saleem, M; Deus, H; Decker, S (2014)Cancer genomics researchers have greatly benefited from high-throughput technologies for the characterization of genomic alterations in patients. These voluminous genomics datasets when supplemented with the appropriate ... -
Hot Topics and Schisms in NLP: Community and Trend Analysis with Saffron on ACL and LREC Proceedings
Buitelaar, Paul; Bordea, Georgeta; Coughlan, Barry (2014)In this paper we present a comparative analysis of two series of conferences in the field of Computational Linguistics, the LREC conference and the ACL conference. Conference proceedings were ... -
Investigating Context Parameters in Technology Term Recognition
QasemiZadeh, Behrang; Handschuh, siegfried (2014)We propose and evaluate the task of technology term recognition: a method to extract technology terms at a synchronic level from a corpus of scientific publications. The proposed method is built on the principles of ... -
Kanopy: Analysing the Semantic Network around Document Topics
Hulpus, Ioana; Hayes, Conor; Karnstedt, Marcel; Greene, Derek; Jozwowicz, Marek (Springer, 2013)External knowledge bases, both generic and domain specific, available on the Web of Data have the potential of enriching the content of text documents with structured information. We present the Kanopy system that makes ... -
On-the-Fly Adaptive Planning for Game-Based Learning
Hulpus, Ioana; Hayes, Conor (2010)In this paper, we present a model for competency development using serious games, which is underpinned by a hierarchical case-based planning strategy. In our model, a learner s objectives are addressed by retrieving a ... -
Querying Phenotype-Genotype Associations across Multiple Knowledge Bases using Semantic Web Technologies
Iqbal, Aftab (2013)Biomedical and genomic data are inherently heterogeneous and their recent proliferation over the Web has demanded innovative querying methods to help domain experts in their clinical and research studies. In this paper we ... -
Random Manhattan Indexing
QasemiZadeh, Behrang; Handschuh, Siegfried (2014)Vector space models (VSMs) are mathematically well-defined frameworks that have been widely used in text processing. In these models, high-dimensional, often sparse vectors represent text units. In an application, the ... -
Random Manhattan Integer Indexing: Incremental L1 Normed Vector Space Construction
QasemiZadeh, Behrang; Handschuh, Siegfried (2014)Vector space models (VSMs) are mathematically well-defined frameworks that have been widely used in the distributional approaches to semantics. In VSMs, high-dimensional vectors represent linguistic entities. In an ... -
A Roadmap for navigating the Life Sciences Linked Open Data Cloud
Hasnain, Ali; Sana e Zainab, Syeda; Kamdar, Maulik; Mehmood, Qaiser; Deus, Helena; Mehdi, Muntazir; Decker, Stefan (2014)Multiple datasets that add high value to biomedical research have been exposed on the web as a part of the Life Sciences Linked Open Data (LSLOD) Cloud. The ability to easily navigate through these datasets is crucial for ... -
A Roadmap for navigating the Life Sciences Linked Open Data Cloud
Mehmood, Qaiser (2014) -
Semantically Interlinked Notification System for Ubiquitous Presence Management
Mehmood, Qaiser; Ali, Muhammad Intizar; Mileo, Alessandra (Springer, 2013)Presence based notification systems play a pivotal role in any collaborative working environment by providing near real time information about the status, locality and presence of the collaborators. Instant Messaging (IM) ... -
SemStim at the Linked Open Data-enabled Recommender Systems 2014 challenge
Heitmann, Benjamin; Hayes, Conor (Springer, 2014-10-14)SemStim is a graph-based recommendation algorithm which is based on Spreading Activation and adds targeted activation and duration constraints. SemStim is not affected by data sparsity, the cold-start problem or data quality ... -
Towards Social Event Detection and Contextualisation for Journalists
Khare, Prashant; Heravi, Bahareh Rahmanzadeh (2014-08-23)Social media platforms have become an important source of information in course of a break- ing news event, such as natural calamity, political uproar, etc. News organisations and journal- ists are increasingly realising ...