Wersja 20 z 2014-08-19 12:23:00

Usuń wiadomość
Locked History Actions

About

The CORE project

The work reported here was carried out within the Computer-based methods for coreference resolution in Polish texts (PL: Komputerowe metody identyfikacji nawiązań w tekstach polskich) project financed by the Polish National Science Centre (contract number 6505/B/T02/2011/40) and carried out between April 2011 and July 2014 at the Institute of Computer Science, Polish Academy of Sciences. It was targeted at the creation of innovative methods and tools for automated coreference resolution in Polish, with planned quality compared to state-of-the-art tools available for other languages.

Project results

Publications

  1. Ogrodniczuk (ed.) – Coreference: Annotation, Resolution and Evaluation in Polish (a monograph published by Walther De Gruyter)

    • BibTeX

      PDF
      DFDF

  2. Ogrodniczuk, Wójcicka, Głowińska, Kopeć – Nested Mention Detection for Polish Coreference Resolution (PolTAL 2014 conference paper)

    • BibTeX
    • PDF

    • DFDF
  3. Zawisławska, Ogrodniczuk – The same or just much the same? Problems with coreference from the reader’s perspective

(book chapter; see also BibTeX )

Plakaty

Wystąpienia

Instrukcje i raporty

Inne dokumenty wewnętrzne

Raporty techniczne ze stażów

Piotra Batki ze stażu lingwistycznego:

Bartłomieja Nitonia ze stażu informatycznego:

Zasoby i narzędzia

External contributions

Parts of the work described here were also contributed by other externally funded projects, carried out simultaneously with CORE:

  • works on the new version of the Polish grammar for Spejd by Alicja Wójcicka and Katarzyna Głowińska were co-funded by the Polish Ministry of Science and Higher Education as an Investment in CLARINPL Research Infrastructure and by the European Union from resources of the European Social Fund
  • works related to linguistic evaluation of usefulness of Uryupina’s coreference features for Polish by Piotr Batko and development of adaptation of BART (Beautiful Anaphora Resolution Toolkit) for Polish by Bartłomiej Nitoń were co-funded by the European Union from financial resources of the European Social Fund, project PO KL Information technologies: Research and their interdisciplinary applications

  • works related to coreference-based approach to summarization were carried out within PhD studies of Mateusz Kopeć at the Institute of Computer Science, Polish Academy of Sciences

  • help with adaptation of coreference tools to Multiservice, a Web service framework for Polish NLP tools, was offered by Michał Lenart taking part in CESAR project (Central and South-east European Resources, part of META-NET) financed from a European Competitiveness and Innovation framework Programme, Information and Communication Technologies Policy Support Programme (CIP ICT-PSP, grant agreement 271022)

  • projection-based experiments were made possible by the University Research Program for Google Translate

  • contacts established with the parallel French coreference annotation project ANCOR were also beneficial for some of our scientific results and helped relate the CORE project more deeply to the international coreference community.

Project team

The core CORE project team constituted of (almost alphabetically):

  • Maciej Ogrodniczuk — principal investigator
  • Barbara Dunin-Kęplicz — formalization of coreference rules
  • Maria Głąbska — coreference annotation
  • Katarzyna Głowińska — linguistic expertise related to anaphora, coreference and Polish syntax
  • Anna Grzeszak — coreference annotation
  • Mateusz Kopeć — technical leadership, implementation and IT design, development of the annotation environment and project tools
  • Emilia Kubicka — coreference annotation
  • Barbara Masny — coreference annotation
  • Paulina Rosalska — coreference annotation
  • Agata Savary — coreference annotation and annotation work expertise
  • Magdalena Zawisławska — linguistic and semantic expertise, annotation management, adjudication of the annotation of Polish Coreference Corpus
  • Sebastian Żurowski — coreference annotation

but there were numerous other people, mainly colleagues from the Linguistic Engineering Group at the Institute of Computer Science, Polish Academy of Sciences, who contributed to various stages of the project with their selfless help:

  • Piotr Batko — coreference annotation, verification of coreference features for Polish (linguistic part)
  • Łukasz Degórski — help related to processing NKJP data
  • Łukasz Debowski — statistical expertise
  • Michał Lenart — help related to processing NKJP data, hardware expertise, Multiservice integration assistance
  • Małgorzata Marciniak — HPSG anaphora expertise
  • Bartłomiej Nitoń — verification of coreference features for Polish (implementation part)
  • Adam Przepiórkowski — linguistic and natural language processing expertise, management of co-operation with the National Corpus of Polish
  • Filip Skwarski — translation and proofreading
  • Jakub Waszczuk — expertise related to annotation and named entity-related tools, versioning system management
  • Joanna Wierucka — translation and proofreading.