Aloes2010 Learner corpora workshop

Un article de Enseignants, l'encyclopéde libre.

ALOES 2010 pre-conference workshop


Sommaire

ABSTRACTS AND SPEAKERS

Abstracts : http://www-lshs.univ-paris13.fr/Enseignants/images/9/90/WORKSHOPMarch_25th.pdf


PROGRAMME LEARNER CORPORA : A workshop

Jointly organised by ALOES, CRIDAF and CLILLAC-ARP

Virtual attendance through the EVO system or locally at the Paris 13 videoconference room.

Institut Galilée,

salle de visioconférences

How to get there : number 2, building L2 on the map http://www.univ-paris13.fr/CRIDAF/Axc2.htm


The number of seats in the videoconference room is limited, so please contact us in advance by March 18

EmailFree.png




MORNING : the Charles V learner corpus


10 h Nicolas Ballier (Paris 7) introduction


10 h 15 investigating suprasegmental features


Evelyne Cauvin (Amiens) and Nicolas Ballier (Paris 7)

  • The multi-layered annotation guideline (v. 1)
  • investigating (prosodic) features in read speech


10 h 45 discussion case studies

Sample questions from MA students currently annotating some of the corpus files


11 h 15 investigating L2 segmental features

Adrien Méli (Paris 7) : “read my lips” Interlanguage and lip-rounding for French speakers


11 45 discussion:


12 15 LUNCH


AFTERNOON : multi-layered approaches to learner corpora


14 h DATA COLLECTION : SOUNDS AND TEXTS

Detmar Meurers, Niels Ott and Ramon Ziai (broadcasting from Universität Tübigen): Compiling a Task-Based Corpus for the Analysis of Learner Language in Context


14 30 discussion of alternative software for data-collection


15 h Dora Alexopoulou (Research Centre for English and Applied Linguistics, Cambridge)

Beyond the Cambridge learner Corpus: an overview of the English Profile Data Collection


15 30 discussion


16 h Thomas Gaillat (Rennes 1) investigating syntax in L2 French speakers : a preliminary approach


16h 30 discussion


17 h Nicolas Ballier (Paris 7) and Anna Diaz (Jaén, to be confirmed) error annotation and the Vilnius workshop


17 h 30 END

call for papers

There will be a pre-conference workshop on learner corpora on March 25th (2010). Studying non-native English with corpora raises interesting issues as to the transferability of the concept of interlanguage for the phonological competence of learners. We invite presentations of existing databases or projects under way addressing some of these issues :

  • - speech corpora or spoken corpora for learners?
  • - longitudinal studies and protocols
  • - databases and querying interface
  • - annotation layers and tools
  • - POS tagging and "error" tagging
  • - interlanguage studies, performances and phonological competence


The workshop will consist of longer talks and discussions (30 min. + 30 min.). The workshop will be videoconferenced through the (free) EVO system and talks from distant universities are welcome (Paris local time 10-18).


Anonymous abstracts for the pre-conference workshop should be sent by Feb. 1st to EmailFree.png (300 words maximum plus separate page giving personal details).


Technical details about the workshop:

  • Speakers may attend the workshop "virtually", broadcasting their talks from their home universities, using the free JAVA-based system EVO, to be downloaded from http://evo.caltech.edu
  • on-line tutorials for EVO:

http://evo.caltech.edu/evoGate/videosTutorials.jsp


  • Get in touch for tests or informal talk about the system.

EmailFree.png


  • EVO tests will take place in advance and a rehearsal is planned on March 23rd, 10 00 AM (Paris local time)
Outils personels