NTNU logo

TDT4275 - Natural Language Interfaces (Naturlig språklig grensesnitt)

Teknologi

IDI > Education > Emner > TDT4275 Natural Language Interfaces (Naturlig språklig grensesnitt)

Natural Language Interfaces (Naturlig språklig grensesnitt)

Natural Language Interfaces (Naturlig språklig grensesnitt) given by Björn Gambäck.

Course overview

The subject comprises: What are words and languages? Grammars and syntactic analysis of natural language. Semantics and logical form. Statistical methods for language understanding. Dialogue analysis and conversational agents. Machine translation. Speech-based interfaces.

The course will be run as lectures and programming exercises. The first lecture will be Thursday January 12, at 12:15-13:00 in room KJL3 (Kjelhuset rom 252). The first meeting will be a general introduction to the subject.

Formal course description

It's learning

Table of contents

  • Course book
  • Examination
  • Lecture schedule
  • Lectures


  • Course Book

  • Daniel Jurafsky and James H. Martin (J&M)
    SPEECH and LANGUAGE PROCESSING: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition
    2nd Edition (NB!), Prentice-Hall, 2008.
  • Reading Instructions:
    Ch. 1; Ch. 3; Ch. 4.1-4.3; Ch. 5.1-5-7; Ch. 7.1-7.3; Ch. 8.1-8.3.3; Ch. 9.1-9.2, 9.5-9.6; Ch. 12; Ch. 13; Ch. 14.11; Ch. 15.1-3; Ch. 16.2.2-16.4; Ch. 17.1-17.4.1; Ch. 18.1-18.4, 18.6; Ch. 20.1-20.2, 20.7; Ch. 21.1-21.7; Ch. 22.1-22.2, 22.4-22.5; Ch. 23.1, 23.3-23.7; Ch. 24.1-24.2, 24.4-24.5; Ch. 25.1-25.3, 25.9;


  • Examination

    1. Written account of the five lab assignments.
      Send in the solution to a specific lab exercise no later than one month after the day the assignment was handed out (NB!).

      The assignments will be available in English only, unless somebody specifically requests to have them in Bokmål and/or Nynorsk at least one week before the publication date.
      The answers may be given in either English, Bokmål or Nynorsk (other languages only if agreed on beforehand with the assistant).

    2. A written exam Saturday 9.6, 09:00-12:00.
    3. The course grade will be assigned as follows:
      exam: 55%, assignments: 45%.


    Course Schedule 2012

    Lecture hours

     time and place
    Lecture 1: Introduction and Overview Thursday 12.1, 12:15-13:00, KJL3
    Lecture 2: Text Summarisation [Marsi] Thursday 19.1, 08:15-10:00, F3
    Lecture 3: BusTUC (with demo) [Sætre] Tuesday 26.1, 08:15-10:00, F3
    Lecture 4: NLP and NLP Systems Tuesday 31.1, 15:15-17:00, KJL22
      
    Lecture 5: Language and Linguistics Thursday 2.2, 08:15-10:00, F3
    Lecture 6: Phonology and Morphology Tuesday 14.2, 15:15-17:00, KJL22
    Lecture 7: Words and Tagging Thursday 16.2, 08:15-10:00, F3
    Computer lab 1: Words & Tagging [Bungum] Thursday 16.2, 12:15-13:00, KJL3
    Lecture 8: Basic Grammars Tuesday 21.2, 15:15-17:00, KJL22
    Lecture 9: Parsing Thursday 23.2, 08:15-10:00, F3
    Computer lab 2: Grammars & Parsing [Bungum] Thursday 23.2, 12:15-13:00, KJL3
      
    Lecture 10: Semantics Tuesday 6.3, 15:15-17:00, KJL22
    Lecture 11: Pragmatics Thursday 8.3, 08:15-10:00, F3
    Computer lab 3: Semantics [Bungum] Thursday 8.3, 12:15-13:00, KJL3
    Lecture 12: Machine Translation Tuesday 13.3, 15:15-17:00, KJL22
    Lecture 13: Statistical Machine Translation [Bungum] Tuesday 20.3, 15:15-17:00, KJL22
    Computer lab 4: Machine Translation [Bungum] Thursday 22.3, 12:15-13:00, KJL3
    Lecture 14: Sentiment Analysis [Das] Tuesday 27.3, 15:15-17:00, KJL22
      
    Lecture 15: Information Retrieval and Extraction Tuesday 17.4, 15:15-17:00, KJL22
    Lecture 16: Speech Synthesis and Recogition Thursday 19.4, 08:15-10:00, F3
    Computer lab 5: Information Retrieval [Bungum] Thursday 19.4, 12:15-13:00, KJL3
    Lecture 17: Conversational Agents Tuesday 24.4, 15:15-17:00, KJL22
      
    Written Exam Saturday 9.6, 09:00-12:00, Spektrum


    Lectures

    Lecture 1: Introduction and Overview

    Introduction; Administrative details; Examination; Schedule for the course.

    Slides     To read    
    pdf
     
    J&M Ch. 1
    Gambäck: Human Language Technology: The Babel Fish



    Lecture 2: Text Summarisation

    Guest Lecturer: Erwin Marsi
    Text summarisation; extracts; abstracts; sentence compression.
    Text fusion; reformulation; generalisation, specification. Anaphora resolution.

    Slides      To read     Other relevant material
    pdf J&M Ch. 23.3-23.7



    Lecture 3: BusTUC (with demo)

    Guest Lecturer: Rune Sætre
    An example of a successful NLP-application: TUC (The Understanding Computer); Natural language question answering.

    Slides      To read     Other relevant material

     
    Amble. "BusTUC - a natural language bus route oracle"
     
    BusTUC demo (ENG)
    BussTUC demo (NOR)



    Lecture 4: NLP and NLP systems

    The components of an NLP system; what do we need to build language processing systems?

    Slides     To read    
    pdf
     
    J&M Ch. 1
    Gambäck. Human Language Technology: The Babel Fish



    Lecture 5: Language and Linguistics

    Language and languages; Relations between languages; Written and spoken languages; Human language and other types of communication; The structure of language.

     Slides     To read     Other relevant material
    pdf
     
     
     
     
     
    J&M Ch. 1
    J&M Ch. 16.2.2-16.3
     
     
     
     
    Ethnologue: Languages of the World
    Omniglot: Writing systems
    CIA Factbook: Languages and other facts about all countries
    Seyfarth et al.. 1980. "Monkey Responses to Three Different Alarm Calls", Science 211:801-803.
    Cheney & Seyfarth. 2005. "
    Constraints and preadaptations in the earliest stages of language evolution", Linguistic Review 22:135-159.



    Lecture 6: Phonology and Morphology

    Phonemes; Syllables; Writing systems; Transliteration Morphology; Two-level morphology; Stemming; Morphological parsing

    Slides      To read     Other relevant material
    pdf
     
    J&M Ch. 3
    J&M Ch. 7.1-7.3
    J&M links to NLP resources
     



    Lecture 7: Words, their neighbours, and POS Tagging

    Words; Lexemes; Tokenization; N-grams; Sparse data; Part-of-speech; Open and closed classes; Tagsets; Methods for part-of-speech tagging; Unknown words.

    Slides      To read     Other relevant material
    pdf
     
    J&M Ch. 4.1-4.3
    J&M Ch. 5.1-5-7
    J&M links to NLP resources
     



    Lecture 8: Basic Grammars

    Context-free grammars; Definite Clause Grammars; Dependency Grammar.

    Slides      To read     Other relevant material
    pdf
     
    J&M Ch. 12
     
    Blackburn, Bos & Striegnitz, 2006-2011. Learn Prolog Now!. Saarbrücken, Germany.
    Tapanainen & Järvinen, 1997. "A Non-Projective Dependency Parser", Proc. 5th Conf. on Applied NLP, Washington, DC.



    Lecture 9: Parsing

    Parsing with context-free grammars; top-down parsing; bottom-up parsing;
    Well-formed substring tables; head-first parsing; charts; LR parsing;
    Human language processing.

    Slides      To read     Other relevant material
    pdf
     
     
     
     
     
     
     
     
    J&M Ch. 13
    J&M Ch. 14.11
    J&M Ch. 16.4
    Kimball. "Seven Principles of Surface Structure Parsing in Natural Languages"
     
     
     
     
    Kay. 1989. "Head-Driven Parsing", Proc. 1st Int. Workshop on Parsing Technologies. Pittsburgh.
    Aho & Ullman. 1972. Ch. 4.2: Tabular Parsing Methods. The Theory of Parsing, Translation, and Computation. Prentice-Hall.
    Tomita. 1986. Ch. 2: Informal Description of the Algorithm. Efficient Parsing for Natural Language. Kluwer Academic Press.
     



    Lecture 10: Semantics

    Semantic models, representing meaning. Syntax-driven semantic analysis, quantifiers, compositionality.

    Slides      To read     Other relevant material
    pdf
     
     
    J&M Ch. 17.1-17.4.1
    J&M Ch. 18.1-18.4
    J&M Ch. 18.6
    Blackburn & Bos. 2005. Representation and Inference for Natural Language: A First Course in Computational Semantics. CSLI Publications, Stanford, California.



    Lecture 11: Discourse and Pragmatics

    Discourse processing, reference resolution.

    Slides      To read     Other relevant material
    pdf
     
    J&M Ch. 20.1-20.2
    J&M Ch. 21.1-21.7



    Lecture 12: Machine Translation

    Machine Translation theory; Machine Translation systems, MT users, applications

    Slides     To read     Other relevant material
    pdf
     
     
    J&M Ch. 25.1-25.2

     
    Systran commercial application
    Various translation tools
    Free Internet MT services



    Lecture 13: Statistical Machine Translation

    Guest Lecturer: Lars Bungum
    Statistical Machine Translation theory and systems; language modelling and decoding; MT system evaluation

    Slides     To read     Other relevant material
    J&M Ch. 25.3-25.9 Parallel texts - European Parliament



    Exam Information and Previous Exams

  • Written 3-hour Exam Saturday 9.6, 09:00-12:00: English
    The exam questions will be given in English only, unless somebody specifically requests to have them in Bokmål and/or Nynorsk at least one week before the exam.
    The answers may be given in either English, Bokmål or Nynorsk (other languages only if agreed on beforehand with the examiner).

  • Exam 2011 (23/5 2011): English (+frontpage)
  • Exam 2010 (26/5 2010): English
  • Exam 2009 (27/5 2009): English (NB: 4 hours)
  • Sample exam (9/3 2004): English

  • Other previous NLP exams at NTNU (NB: somewhat different courses)



  • For nærmere informasjon om emnet, kontakt faglærer Björn Gambäck.