[Winter 2011] NLTK syn & sem

Wed 28 December 2011 by Adrian Brasoveanu

A script introducing basic lexical semantics, syntax and compositional semantics notions and tools with NLTK: NLTK-syn-sem.py, Dowty-et-al.cfg, mygrammar.cfg.

read more

[Fall 2010] Quantifier Scope, Intro to GLMs and GLMMS

Wed 28 December 2011 by Adrian Brasoveanu

Plan: talk about the analysis of a subpart of the quantifier-scope-tagged corpus that Scott AnderBois, Robert Henderson and I have been working on over the last year; we will focus exclusively on sentences with 2 quantifiers.

Main goal: provide motivation for continuing the discussion of linear regression from last quarter …

read more

[Spring 2010] Regression etc.

Tue 27 December 2011 by Adrian Brasoveanu

Corpus-based evaluation of centering theory: Centering: A Parametric Theory and Its Instantiations, Poesio et al (2004) (discussion led by Bern)

Intro to anaphora resolution algorithms (Hobbs 1978, Lappin & Leass 1994, Centering Theory): anaphora resolution.ppt

Tutorial on regression (see the references in the R scripts for sources):

  1. Warm-up (skewness, histograms …

read more

[Winter 2010] Tutorial on quantitative corpus linguistics and R

Tue 27 December 2011 by Adrian Brasoveanu

Why corpus linguistics? Here’s part of the conclusion of Abney (1996) (“Statistical methods and linguistics”); see also chapter 1 of Manning & Schuetze (1999) (“Foundations of Statistical Natural Language Processing”):

In closing, let me repeat the main line of argument as concisely as I can. Statistical methods—by which I …

read more

List of UCSC corpora

Tue 27 December 2011 by Adrian Brasoveanu

List of corpora available at the UCSC Media Center: list-of-UCSC-corpora.xls.

read more

From CLG to LaLoCo

Mon 26 December 2011 by Adrian Brasoveanu

The Corpus Linguistics Group (CLG) has evolved into the Language, Logic & Cognition (LaLoCo) Lab. A description of LaLoCo’s overarching research goal is available here.

The CLG materials (summaries, scripts etc.) have already been posted on the LaLoCo page:

read more