default search action
Serge Sharoff
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c61]Nurbanu Aksoy, Nishant Ravikumar, Serge Sharoff:
Enhancing Image-to-Text Generation in Radiology Reports through Cross-modal Multi-Task Learning. LREC/COLING 2024: 5977-5985 - [c60]Adaeze Ohuoba, Serge Sharoff, Callum Walker:
Quantifying the Contribution of MWEs and Polysemy in Translation Errors for English-Igbo MT. EAMT (1) 2024: 537-547 - [c59]Horacio Saggion, John O'Flaherty, Thomas Blanchet, Serge Sharoff, Silvia Sanfilippo, Lian Muñoz, Martin Gollegger, Almudena Rascón, José L. Martí, Sandra Szasz, Stefan Bott, Volkan Sayman:
Making Democratic Deliberation and Participation more Accessible: The iDEM Project. SEPLN (Projects and Demonstrations) 2024: 71-76 - 2023
- [j15]Dmitri Roussinov, Serge Sharoff, Nadezhda Puchnina:
Fine-tuning language models to recognize semantic relations. Lang. Resour. Evaluation 57(4): 1463-1486 (2023) - [c58]Dmitri Roussinov, Serge Sharoff:
BERT Goes Off-Topic: Investigating the Domain Transfer Challenge using Genre Classification. EMNLP (Findings) 2023: 468-483 - [c57]Mikhail Lepekhin, Serge Sharoff:
FTD at SemEval-2023 Task 3: News Genre and Propaganda Detection by Comparing Mono- and Multilingual Models with Fine-tuning on Additional Data. SemEval@ACL 2023: 549-555 - [i11]Yuqian Dai, Serge Sharoff, Marc de Kamps:
GATology for Linguistics: What Syntactic Dependencies It Knows. CoRR abs/2305.13403 (2023) - [i10]Yuqian Dai, Serge Sharoff, Marc de Kamps:
Syntactic Knowledge via Graph Attention with BERT in Machine Translation. CoRR abs/2305.13413 (2023) - [i9]Nurbanu Aksoy, Serge Sharoff, Selcuk Baser, Nishant Ravikumar, Alejandro F. Frangi:
Beyond Images: An Integrative Multi-modal Approach to Chest X-Ray Report Generation. CoRR abs/2311.11090 (2023) - [i8]Dmitri Roussinov, Serge Sharoff:
BERT Goes Off-Topic: Investigating the Domain Transfer Challenge using Genre Classification. CoRR abs/2311.16083 (2023) - 2022
- [c56]Valeriy Lobov, Alexandra Ivoylova, Serge Sharoff:
Applying Natural Annotation and Curriculum Learning to Named Entity Recognition for Under-Resourced Languages. COLING 2022: 4468-4480 - [c55]Jose Sosa, Serge Sharoff:
Multimodal Pipeline for Collection of Misinformation Data from Telegram. LREC 2022: 1480-1489 - [c54]Mikhail Lepekhin, Serge Sharoff:
Estimating Confidence of Predictions of Individual Classifiers and TheirEnsembles for the Genre Classification Task. LREC 2022: 5974-5982 - [c53]Yuqian Dai, Marc de Kamps, Serge Sharoff:
BERTology for Machine Translation: What BERT Knows about Linguistic Difficulties for Translation. LREC 2022: 6674-6690 - [c52]Nouran Khallaf, Serge Sharoff, Rasha Soliman:
Towards Arabic Sentence Simplification via Classification and Generative Approaches. WANLP@EMNLP 2022: 43-52 - [i7]Nouran Khallaf, Serge Sharoff:
Towards Arabic Sentence Simplification via Classification and Generative Approaches. CoRR abs/2204.09292 (2022) - [i6]Jose Sosa, Serge Sharoff:
Multimodal Pipeline for Collection of Misinformation Data from Telegram. CoRR abs/2204.12690 (2022) - [i5]Mikhail Lepekhin, Serge Sharoff:
Estimating Confidence of Predictions of Individual Classifiers and Their Ensembles for the Genre Classification Task. CoRR abs/2206.07427 (2022) - 2021
- [c51]Nouran Khallaf, Serge Sharoff:
Automatic Difficulty Classification of Arabic Sentences. WANLP 2021: 105-114 - [e9]Reinhard Rapp, Serge Sharoff, Pierre Zweigenbaum:
Proceedings of the 14th Workshop on Building and Using Comparable Corpora, BUCC@RANLP 2021, Online, September 6, 2021. INCOMA Ltd. 2021, ISBN 978-954-452-076-2 [contents] - [i4]Nouran Khallaf, Serge Sharoff:
Automatic Difficulty Classification of Arabic Sentences. CoRR abs/2103.04386 (2021) - [i3]Mikhail Lepekhin, Serge Sharoff:
Experiments with adversarial attacks on text genres. CoRR abs/2107.02246 (2021) - 2020
- [j14]Serge Sharoff:
Finding next of kin: Cross-lingual embedding spaces for related languages. Nat. Lang. Eng. 26(2): 163-182 (2020) - [c50]Reinhard Rapp, Pierre Zweigenbaum, Serge Sharoff:
Overview of the Fourth BUCC Shared Task: Bilingual Dictionary Induction from Comparable Corpora. BUCC@LREC 2020: 6-13 - [c49]Dmitri Roussinov, Serge Sharoff, Nadezhda Puchnina:
Recognizing Semantic Relations: Attention-Based Transformers vs. Recurrent Models. ECIR (1) 2020: 561-574 - [c48]Yu Yuan, Serge Sharoff:
Sentence Level Human Translation Quality Estimation with Attention-based Neural Networks. LREC 2020: 1858-1865 - [c47]Serge Sharoff:
Know thy Corpus! Robust Methods for Digital Curation of Web corpora. LREC 2020: 2453-2460 - [c46]Dmitri Roussinov, Serge Sharoff, Nadezhda Puchnina:
Recognizing Semantic Relations by Combining Transformers and Fully Connected Models. LREC 2020: 5838-5845 - [e8]Reinhard Rapp, Pierre Zweigenbaum, Serge Sharoff:
Proceedings of the 13th Workshop on Building and Using Comparable Corpora, BUCC@LREC 2020, Marseille, France, May, 2020. European Language Resources Association 2020, ISBN 979-10-95546-42-9 [contents] - [i2]Yu Yuan, Serge Sharoff:
Sentence Level Human Translation Quality Estimation with Attention-based Neural Networks. CoRR abs/2003.06381 (2020) - [i1]Serge Sharoff:
Know thy corpus! Robust methods for digital curation of Web corpora. CoRR abs/2003.06389 (2020)
2010 – 2019
- 2019
- [c45]Maria Kunilovskaya, Serge Sharoff:
Towards Functionally Similar Corpus Resources for Translation. RANLP 2019: 583-592 - [p6]Reinhard Rapp, Vivian Xu, Michael Zock, Serge Sharoff, Richard S. Forsyth, Bogdan Babych, Chenhui Chu, Toshiaki Nakazawa, Sadao Kurohashi:
New Areas of Application of Comparable Corpora. Using Comparable Corpora for Under-Resourced Areas of Machine Translation 2019: 255-290 - 2018
- [c44]Serge Sharoff:
Language adaptation experiments via cross-lingual embeddings for related languages. LREC 2018 - [c43]Yu Yuan, Yuze Gao, Yue Zhang, Serge Sharoff:
Cross-lingual Terminology Extraction for Translation Quality Estimation. LREC 2018 - [c42]Yu Yuan, Serge Sharoff:
Investigating the Influence of Bilingual MWU on Trainee Translation Quality. LREC 2018 - [c41]Pierre Zweigenbaum, Serge Sharoff, Reinhard Rapp:
A Multilingual Dataset for Evaluating Parallel Sentence Extraction from Comparable Corpora. LREC 2018 - [c40]Pierre Zweigenbaum, Serge Sharoff, Reinhard Rapp:
Overview of the Third BUCC Shared Task: Spotting Parallel Sentences in Comparable Corpora. BUCC@LREC 2018 - 2017
- [c39]Serge Sharoff:
Toward Pan-Slavic NLP: Some Experiments with Language Adaptation. BSNLP@EACL 2017: 1-2 - [c38]Pierre Zweigenbaum, Serge Sharoff, Reinhard Rapp:
Overview of the Second BUCC Shared Task: Spotting Parallel Sentences in Comparable Corpora. BUCC@ACL 2017: 60-67 - [e7]Serge Sharoff, Pierre Zweigenbaum, Reinhard Rapp:
Proceedings of the 10th Workshop on Building and Using Comparable Corpora, BUCC@ACL 2017, Vancouver, Canada, August 3, 2017. Association for Computational Linguistics 2017, ISBN 978-1-945626-61-6 [contents] - 2016
- [j13]Noushin Rezapour Asheghi, Serge Sharoff, Katja Markert:
Crowdsourcing for web genre annotation. Lang. Resour. Evaluation 50(3): 603-641 (2016) - [j12]Reinhard Rapp, Serge Sharoff, Pierre Zweigenbaum:
Preface. Nat. Lang. Eng. 22(4): 497-500 (2016) - [j11]Reinhard Rapp, Serge Sharoff, Pierre Zweigenbaum:
Recent advances in machine translation using comparable corpora. Nat. Lang. Eng. 22(4): 501-516 (2016) - [j10]Miguel Rios, Serge Sharoff:
Language Adaptation for Extending Post-Editing Estimates for Closely Related Languages. Prague Bull. Math. Linguistics 106: 181-192 (2016) - [c37]Erika Dalan, Serge Sharoff:
Genre classification for a corpus of academic webpages. WAC@ACL 2016: 90-98 - [c36]Roger Evans, Alexander F. Gelbukh, Gregory Grefenstette, Patrick Hanks, Milos Jakubícek, Diana McCarthy, Martha Palmer, Ted Pedersen, Michael Rundell, Pavel Rychlý, Serge Sharoff, David Tugwell:
Adam Kilgarriff's Legacy to Computational Linguistics and Beyond. CICLing (1) 2016: 3-25 - [c35]Yu Yuan, Serge Sharoff, Bogdan Babych:
MoBiL: A Hybrid Feature Set for Automatic Human Translation Quality Assessment. LREC 2016 - 2015
- [j9]Serge Sharoff:
Web Corpus Construction Roland Schäfer and Felix Bildhauer (Freie Universität Berlin) Morgan & Claypool (Synthesis Lectures on Human Language Technologies, edited by Graeme Hirst, volume 22), 2013, 145 pages, paper-bound, ISBN 9781608459834, doi: 10.2200/S00508ED1V01Y201305HLT022. Comput. Linguistics 41(1): 161-163 (2015) - [c34]Anisya Katinskaya, Serge Sharoff:
Applying Multi-Dimensional Analysis to a Russian Webcorpus: Searching for Evidence of Genres. BSNLP@RANLP 2015: 65-74 - [c33]Miguel Rios, Serge Sharoff:
Obtaining SMT dictionaries for related languages. BUCC@ACL/IJCNLP 2015: 68-73 - [c32]Serge Sharoff, Pierre Zweigenbaum, Reinhard Rapp:
BUCC Shared Task: Cross-Language Document Similarity. BUCC@ACL/IJCNLP 2015: 74-78 - [c31]Miguel Angel Ríos Gaona, Serge Sharoff:
Large Scale Translation Quality Estimation. DMTW 2015: 81-88 - [e6]Pierre Zweigenbaum, Serge Sharoff, Reinhard Rapp:
Proceedings of the Eighth Workshop on Building and Using Comparable Corpora, BUCC@ACL/IJCNLP 2015, Beijing, China, July 30, 2015. Association for Computational Linguistics 2015, ISBN 978-1-941643-60-0 [contents] - 2014
- [j8]Richard S. Forsyth, Serge Sharoff:
Document dissimilarity within and across languages: A benchmarking study. Lit. Linguistic Comput. 29(1): 6-22 (2014) - [j7]Serge Sharoff, Stefania Spina, Sofie Johansson Kokkinakis:
Introduction to the special issue on Resources and Tools for Language Learners. Lang. Resour. Evaluation 48(1): 1-3 (2014) - [j6]Adam Kilgarriff, Frieda Charalabopoulou, Maria Gavrilidou, Janne Bondi Johannessen, Saussan Khalil, Sofie Johansson Kokkinakis, Robert Lew, Serge Sharoff, Ravikiran Vadlapudi, Elena Volodina:
Corpus-based vocabulary lists for language learners for nine languages. Lang. Resour. Evaluation 48(1): 121-163 (2014) - [c30]Marilena Di Bari, Serge Sharoff, Martin Thomas:
Multiple views as aid to linguistic annotation error analysis. LAW@COLING 2014: 82-86 - [c29]Reinhard Rapp, Serge Sharoff:
Extracting Multiword Translations from Aligned Comparable Documents. HyTra@EACL 2014: 87-95 - [c28]Noushin Rezapour Asheghi, Serge Sharoff, Katja Markert:
Designing and Evaluating a Reliable Corpus of Web Genres via Crowd-Sourcing. LREC 2014: 1339-1346 - [c27]Noushin Rezapour Asheghi, Katja Markert, Serge Sharoff:
Semi-supervised Graph-based Genre Classification for Web Pages. TextGraphs@EMNLP 2014: 39-47 - 2013
- [c26]Pavel Braslavski, Alexander Beloborodov, Maxim Khalilov, Serge Sharoff:
English-to-Russian MT evaluation campaign. ACL (2) 2013: 262-267 - [c25]Marilena Di Bari, Serge Sharoff, Martin Thomas:
SentiML: functional annotation for multilingual sentiment analysis. DH-CASE@DocEng 2013: 15:1-15:7 - [p5]Serge Sharoff, Reinhard Rapp, Pierre Zweigenbaum:
Overviewing Important Aspects of the Last Twenty Years of Research in Comparable Corpora. Building and Using Comparable Corpora 2013: 1-17 - [p4]Serge Sharoff:
Measuring the Distance Between Comparable Corpora Between Languages. Building and Using Comparable Corpora 2013: 113-130 - [e5]Serge Sharoff, Reinhard Rapp, Pierre Zweigenbaum, Pascale Fung:
Building and Using Comparable Corpora. Springer Berlin Heidelberg 2013, ISBN 978-3-642-20127-1 [contents] - [e4]Serge Sharoff, Pierre Zweigenbaum, Reinhard Rapp:
Proceedings of the Sixth Workshop on Building and Using Comparable Corpora, BUCC@ACL 2013, Sofia, Bulgaria, August 8, 2013. Association for Computational Linguistics 2013, ISBN 978-1-937284-60-2 [contents] - 2012
- [c24]Bogdan Babych, Kurt Eberle, Johanna Geiß, Mireia Ginestí-Rosell, Anthony Hartley, Reinhard Rapp, Serge Sharoff, Martin Thomas:
Design of a hybrid high quality machine translation system. ESIRMT/HyTra@EACL 2012: 101-112 - [c23]Reinhard Rapp, Serge Sharoff, Bogdan Babych:
Identifying Word Translations from Comparable Documents Without a Seed Lexicon. LREC 2012: 460-466 - [e3]Reinhard Rapp, Pierre Zweigenbaum, Serge Sharoff:
11th Workshop on Building and Using Comparable Corpora - Special Topic: Comparable Corpora for Asian Languages, BUCC@LREC 2018, Miyazaki, Japan, May 8, 2018. European Language Resources Association 2012, ISBN 979-10-95546-07-8 [contents] - 2011
- [p3]Marina Santini, Alexander Mehler, Serge Sharoff:
Riding the Rough Waves of Genre on the Web. Genres on the Web 2011: 3-30 - [p2]Serge Sharoff:
In the Garden and in the Jungle. Genres on the Web 2011: 149-166 - [p1]Marina Santini, Serge Sharoff, Alexander Mehler:
Any Land in Sight? Genres on the Web 2011: 351-354 - [e2]Alexander Mehler, Serge Sharoff, Marina Santini:
Genres on the Web. Text, Speech and Language Technology 42, Springer 2011, ISBN 978-90-481-9177-2 [contents] - [e1]Pierre Zweigenbaum, Reinhard Rapp, Serge Sharoff:
Proceedings of the 4th Workshop on Building and Using Comparable Corpora: Comparable Corpora and the Web, BUCC@ACL 2011, Portland, OR, USA, June 24, 2011. Association for Computational Linguistics 2011, ISBN 9781937284015 [contents] - 2010
- [j5]Yun-hua Qu, Tian-jiong Tao, Serge Sharoff, Narisong Jin, Ruoyuan Gao, Nan Zhang, Yu-Ting Yang, Cheng-zhi Xu:
Using an integrated feature set to generalize and justify the Chinese-to-English transferring rule of the 'ZHE' aspect. J. Zhejiang Univ. Sci. C 11(9): 663-676 (2010) - [j4]Paul Rayson, Scott Piao, Serge Sharoff, Stefan Evert, Begoña Villada Moirón:
Multiword expressions: hard going or plain sailing? Lang. Resour. Evaluation 44(1-2): 1-5 (2010) - [c22]Zhili Wu, Katja Markert, Serge Sharoff:
Fine-Grained Genre Classification Using Structural Learning Algorithms. ACL 2010: 749-759 - [c21]Serge Sharoff, Zhili Wu, Katja Markert:
The Web Library of Babel: evaluating genre collections. LREC 2010 - [c20]James Wilson, Anthony Hartley, Serge Sharoff, Paul Stephenson:
Advanced Corpus Solutions for Humanities Researchers. PACLIC 2010: 769-778
2000 – 2009
- 2009
- [j3]Marina Santini, Serge Sharoff:
Web Genre Benchmark Under Construction. J. Lang. Technol. Comput. Linguistics 24(1): 129-145 (2009) - [j2]Serge Sharoff, Bogdan Babych, Anthony Hartley:
'Irrefragable answers' using comparable corpora to retrieve translation equivalents. Lang. Resour. Evaluation 43(1): 15-25 (2009) - [c19]Bogdan Babych, Anthony Hartley, Serge Sharoff:
Evaluation-Guided Pre-Editing of Source Text: Improving MT-Tractability of Light Verb Constructions. EAMT 2009 - 2008
- [c18]Bogdan Babych, Serge Sharoff, Anthony Hartley:
Generalising Lexical Translation Strategies for MT Using Comparable Corpora. LREC 2008 - [c17]Marco Baroni, Francis Chantree, Adam Kilgarriff, Serge Sharoff:
Cleaneval: a Competition for Cleaning Web Pages. LREC 2008 - [c16]Svitlana Kurella, Serge Sharoff, Anthony Hartley:
Corpus-Based Tools for Computer-Assisted Acquisition of Reading Abilities in Cognate Languages. LREC 2008 - [c15]Serge Sharoff, Mikhail Kopotev, Tomaz Erjavec, Anna Feldman, Dagmar Divjak:
Designing and Evaluating a Russian Tagset. LREC 2008 - 2007
- [j1]J. Delin, Serge Sharoff, S. Lillford, Catherine Barnes:
Linguistic support for concept selection decisions. Artif. Intell. Eng. Des. Anal. Manuf. 21(2): 123-135 (2007) - [c14]Bogdan Babych, Anthony Hartley, Serge Sharoff, Olga Mudraya:
Assisting Translators in Indirect Lexical Transfer. ACL 2007 - [c13]Bogdan Babych, Anthony Hartley, Serge Sharoff:
Translating from under-resourced languages: comparing direct transfer against pivot translation. MTSummit 2007 - 2006
- [c12]Serge Sharoff, Bogdan Babych, Anthony Hartley:
Using Comparable Corpora to Solve Problems Difficult for Human Translators. ACL 2006 - [c11]Serge Sharoff, Bogdan Babych, Paul Rayson, Olga Mudraya, Scott Piao:
ASSIST: Automated Semantic Assistance for Translators. EACL 2006 - [c10]Serge Sharoff, Bogdan Babych, Anthony Hartley:
Using collocations from comparable corpora to find translation equivalents. LREC 2006: 465-470 - [c9]Serge Sharoff:
A Uniform Interface to Large-Scale Linguistic Resources. LREC 2006: 539-542 - [c8]Dragoç Ciobanu, Tony Hartley, Serge Sharoff:
Using Richly Annotated Trilingual Language Resources for Acquiring Reading Skills in a Foreign Language. LREC 2006: 543-548 - 2004
- [c7]Serge Sharoff:
Towards Basic Categories for Describing Properties of Texts in a Corpus. LREC 2004 - 2002
- [c6]Serge Sharoff:
Meaning as use: exploitation of aligned corpora for the contrastive study of lexical semantics. LREC 2002 - 2001
- [c5]Dafydd Gibbon, Thorsten Trippel, Serge Sharoff:
Concordancing for parallel spoken language corpora. INTERSPEECH 2001: 2063-2066 - 2000
- [c4]Geert-Jan M. Kruijff, Elke Teich, John A. Bateman, Ivana Kruijff-Korbayová, Hana Skoumalová, Serge Sharoff, Elena G. Sokolova, Tony Hartley, Kamenka Staykova, Jiri Hana:
Multilinguality in a Text Generation System For Three Slavic Languages. COLING 2000: 474-480 - [c3]John A. Bateman, Elke Teich, Geert-Jan M. Kruijff, Ivana Kruijff-Korbayová, Serge Sharoff, Hana Skoumalová:
Resources for Multilingual Text Generation in Three Slavic Languages. LREC 2000
1990 – 1999
- 1999
- [c2]Serge Sharoff, Vlad Zhigalov:
Register-domain Separation as a Methodology for Development of Natural Language Interfaces to Databases. INTERACT 1999: 79-85 - 1996
- [c1]Irina Kononenko, Serge Sharoff:
Understanding Short Texts with Integration of Knowledge Representation Methods. Ershov Memorial Conference 1996: 111-121
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-07 21:21 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint