2020
(2020) Intelligent Translation Memory Matching and Retrieval with Sentence Encoders, Proceedings of the 22nd Annual Conference of the European Association for Machine Translation, p. 175 - 184, pdf
(2020) TransQuest at WMT2020: Sentence-Level Direct Assessment, Proceedings ofthe 5th Conference on Machine Translation (WMT), p. 1047-1053, url
2019
(2019) Identifying signs of syntactic complexity for rule-based sentence simplification, Natural Language Engineering 25(1), p. 69-119, url, doi:10.1017/S1351324918000384
(2019) Sentence Simplification for Semantic Role Labelling and Information Extraction, Proceedings of Recent Advances in Natural Language Processing (RANLP2019), p. 285-294, pdf, doi:10.26615/978-954-452-056-4_033
(2019) Exploiting Data-Driven Hybrid Approaches to Translation in the EXPERT Project, Advances in Empirical Translation Studies, p. 198-216, Cambridge University Press, url, doi:10.1017/9781108525695.011
(2019) Automatic summarisation: 25 years On, Natural Language Engineering 25(6), p. 735-751, url, doi:10.1017/S1351324919000524
(2019) RGCL-WLV at SemEval-2019 Task 12: Toponym Detection, Proceedings ofthe 13th International Workshop on Semantic Evaluation (SemEval-2019), p. 1297-1301, url, doi:10.18653/v1/S19-2228
(2019) Toponym Detection in the Bio-Medical Domain: A Hybrid Approach with Deep Learning, Proceedings of Recent Advances in Natural Language Processing (RANLP2019), p. 912-921, pdf, doi:10.26615/978-954-452-056-4_106
(2019) RGCL at GermEval 2019: Offensive Language Detection with Deep Learning, Proceedings of the 15th Conference on Natural Language Processing (KONVENS 2019), p. 423 - 428, pdf
(2019) Large-scale Data Harvesting for Biographical Data, Proceedings of the International Conference on Biographical Data in a Digital World 2019(September), url
(2019) Semantic Textual Similarity with Siamese Neural Networks, Proceedings of Recent Advances in Natural Language Processing (RANLP2019), p. 1005-1012, pdf, doi:10.26615/978-954-452-056-4_116
(2019) RGCL at IDAT : Deep Learning models for Irony Detection in Arabic Language, Working Notes of the Forum for Information Retrieval Evaluation (FIRE 2019), p. 416 - 425, pdf
(2019) Enhancing Unsupervised Sentence Similarity Methods with Deep Contextualised Word Representations, Proceedings of Recent Advances in Natural Language Processing (RANLP 2019), p. 994-1003, pdf, doi:10.26615/978-954-452-056-4_115
(2019) A Survey of the Perceived Text Adaptation Needs of Adults with Autism, Proceedings of Recent Advances in Natural Language Processing (RANLP 2019), p. 1356-1363, pdf, doi:10.26615/978-954-452-056-4_155
(2019) Proceedings of the Human-Informed Translation and Interpreting Technology Workshop (HiT-IT 2019), pdf
2018
(2018) Detection of Stress and Relaxation Magnitudes for Tweets, Companion of the The Web Conference 2018 on The Web Conference 2018 - WWW '18, p. 1677 - 1684, New York, New York, USA: ACM Press, url, doi:10.1145/3184558.3191627
(2018) Intelligent Text Processing to Help Readers with Autism, Intelligent Natural Language Processing: Trends and Applications, K. Shaalan, A. Hassanien, F. Tolba (ed.), p. 713-740, Springer, url, doi:10.1007/978-3-319-67056-0_33
(2018) Aggressive Language Identification Using Word Embeddings and Sentiment Features, Proceedings of the First Workshop on Trolling, Aggression and Cyberbullying (TRAC-2018), p. 113 - 119, url
(2018) Trouble on the Road : Finding Reasons for Commuter Stress from Tweets, Proceedings ofthe Workshop on Intelligent Interactive Systems and Language Generation (2IS&NLG), p. 20 - 25, url, doi:10.18653/v1/W18-6705
(2018) What Makes You Stressed? Finding Reasons From Tweets, Proceedings ofthe 9th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis, p. 266 - 272, url, doi:10.18653/v1/W18-6239
2017
(2017) Questing for Quality Estimation A User Study, The Prague Bulletin of Mathematical Linguistics 108, p. 343–354, pdf, doi:10.1515/pralin-2017-0032
(2017) Combining Multiple Corpora for Readability Assessment for People with Cognitive Disabilities, Proceedings of the 12th Workshop on Innovative Use of NLP for Building Educational Applications, p. 121 - 132, url, doi:10.18653/v1/W17-5013
2016
(2016) 1st Shared Task on Automatic Translation Memory Cleaning Preparation and Lessons Learned, Proceedings of the 2nd Workshop on Natural Language Processing for Translation Memories (NLP4TM 2016), p. 1-6
(2016) The first Automatic Translation Memory Cleaning Shared Task, Machine Translation 30(3-4), p. 145-166, url, doi:10.1007/s10590-016-9183-x
(2016) Semantic Textual Similarity in Quality Estimation, Baltic Journal of Modern Computing 4(2), p. 256 - 268, url
(2016) WOLVESAAR at SemEval-2016 Task 1: Replicating the Success of Monolingual Word Alignment and Neural Embeddings for Semantic Textual Similarity, Proceedings of SemEval-2016(1), p. 634-639, url, doi:10.18653/v1/S16-1096
(2016) A Dynamic Programming Approach to Improving Translation Memory Matching and Retrieval Using Paraphrases, Text, Speech and Dialogue, P. Sojka, A. Horák, I. Kopeček, K. Pala (ed.), p. 259 - 269, Brno, CZ: Springer, url, doi:10.1007/978-3-319-45510-5_30
(2016) Improving translation memory matching and retrieval using paraphrases, Machine Translation 30(1), p. 19 - 40, url, doi:10.1007/s10590-016-9180-0
(2016) The EXPERT Project: Training the Future Experts in Translation Technology, In Proceedings of the 19th Annual Conference of the EAMT: Projects/Products, p. 393, pdf
2015
(2015) The Role of Corpus Pattern Analysis in Machine Translation Evaluation, Proceedings of the The 7th International Conference of the Iberian Association of Translation and Interpreting Studies (AIETI)
(2015) Can Translation Memories afford not to use paraphrasing?, Proceedings of the 18th Annual Conference of the European Association for Machine Translation (EAMT), p. 35 - 42, pdf
(2015) Machine Translation Evaluation using Recurrent Neural Networks, Proceedings of the Tenth Workshop on Statistical Machine Translation(September), p. 380-384, pdf
(2015) ReVal: A Simple and Effective Machine Translation Evaluation Metric Based on Recurrent Neural Networks, Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing(September), p. 1066-1072, url, doi:10.18653/v1/D15-1124
(2015) The EXPERT Project: Advancing the State of the Art in Hybrid Translation Technologies, Proceedings of Translating and the Computer 37
2014
(2014) Incorporating Paraphrasing in Translation Memory Matching and Retrieval, Proceedings of the Seventeenth Annual Conference of the European Association for Machine Translation (EAMT2014), p. 3 - 10, pdf
(2014) Intelligent Translation Memory Matching and Retrieval Metric Exploiting Linguistic Technology, Proceedings of the Translating and Computer 36, p. 86-89, pdf
2009
(2009) Comparative Evaluation of Term-Weighting Methods for Automatic Summarization, Journal of Quantitative Linguistics 16(1), p. 67-95, Routledge, url, doi:10.1080/09296170802514187
2008
(2008) Evaluation of a Cross-lingual Romanian-English Multi-document Summariser, Proceedings of 6th Language Resources and Evaluation Conference (LREC2008), p. 2114 -2119, url
2007
(2007) Corpora for computational linguistics, Ilha do Desterro: A Journal of Language and Literature 52, p. 65-101, url
2003
(2003) An evolutionary approach for improving the quality of automatic summaries, Proceedings of the ACL 2003 workshop on Multilingual summarization and question answering, p. 37, url
2000
(2000) A hybrid method for clause splitting in unrestricted English texts, Proceedings of ACIDCA '2000, Corpora and Natural Language Processing, p. 129-134, pdf