
Our paper about the new UD Bohairic Coptic Treebank was accepted at SyntaxFest 2025 !
This page lists selected papers by lab members - see also a complete list of Amir's publications
Our paper about the new UD Bohairic Coptic Treebank was accepted at SyntaxFest 2025 !
Our EMNLP 2024 paper presents a valuable genre-diverse PDTB-style dataset for English shallow discourse parsing across modalities, text types, and domains using a cascade of conversion modules leveraging enhanced RST annotations, thereby also enabling theoretical studies of discourse relation variation across frameworks
In our Machine Learning for Ancient Languages (ML4AL) workshop paper , we present a bidirectional RNN model for character prediction of Coptic characters in manuscript lacunae and use it to rank the likelihood of various textual reconstructions. A live demo of our models is available here !
Our EACL 2024 paper promotes a strict definition of entity salience by presenting GUMsley, a 12-genre challenge dataset for entity salience evaluation and shows how salient entities added to summarization models are beneficial for deriving higher-quality summaries with fewer hallucinated entities
Check our AACL-IJCNLP 2023 paper about incorporating singletons and mention-based features to improve coreference generalization
If you have any questions or feedback please let us know! If you'd like to join us: We accept new PhD and Masters students every year, please contact Amir Zeldes for more information.