LaBB-CAT

(formerly known as ONZE Miner)

What does it do?
What is Transcriber?
What is Praat?
What do I need for LaBB-CAT to work for me?
How do I get LaBB-CAT?
How much does LaBB-CAT Cost?
Who wrote LaBB-CAT?
Why?

LaBB-CAT is a browser-based linguistics research tool that stores audio or video recordings, text transcripts, and other annotations.

Annotations of various types can be automatically generated or manually added.

The transcripts and annotations can be searched for particular text or regular expressions. The search results, or entire transcripts, can be viewed or saved in a variety of formats, and the related parts of the recordings can be played or opened in acoustic analysis software, all directly through the web-browser.

Storage of Media and Transcripts

LaBB-CAT is essentially a repository for time-aligned transcripts of audio/video recordings. Time-aligned transcripts can produced using Transcriber or Praat (which can be used to create a document lining up the transcript text with the corresponding location in the audio/video recording). The transcript is then uploaded to LaBB-CAT, which allows additional information about the speakers and the transcripts to be stored.

Transcriptions from Transcriber or Praat

Automatic Annotation

Combining the signal data, the raw orthographic transcriptions, and some third-party data and tools, the transcripts can be automatically annotated, for example:

CELEX

Annotations derived from CELEX With the help of data from CELEX, words can be automatically annotated with further data:

phonology
syllabification
morphology
part of speech
frequency

HTK

With the help of HTK, transcripts aligned at the utterance level can be force-aligned to the word and segment level.

Force alignment with HTK

Statistical Layers

Word frequency data for the LaBB-CAT database itself can be computed and annotated directly on each word.

Word frequency layer

Combining time alignment information and syllable count from CELEX, speech rate can be calculated over different domains.

Syllables per minute for lines and turns

Stanford Parser

With help from the Stanford Parser, editable syntactic trees can be generated for the transcripts.

Annotations that represent an editable tree

Manual Annotation

Annotations can be manually added, e.g.

Topic tagging
Text tags against individual words
Time points or intervals can be annotated using Praat

Point annotation using Praat

Search

Once transcripts and annotations are in place, searches can be performed across transcripts which meet certain criteria (e.g. based on age/gender of the speaker, corpus the transcript belongs to, etc.):

Filter speakers by their attributes

When the speakers have been selected, their utterances can be searched for text or regular expressions, across different layers:

Search for 'the' in an 'adult' topic, followed by a word starting with I or E or i or @ on the phonemes layer

This returns a list of all of the utterances from the selected transcripts which match the query.

Search results

If desired, this list can be exported, together with relevant speaker and annotation information, directly to a CSV file, ready for further analysis in Excel or R:

Search results

Or audio samples can be extracted for analysis:

Audio from results

Alternatively, clicking on an utterance returned by the search produces the full transcript for the speaker involved, positioned with the relevant utterance at the top of the screen. Any part of the transcript can be clicked on, and the corresponding part of the media played. Media can be stored on the webserver, on disk, or on CD (which is prompted for if required):

Interactive Transcript

Directly from the interactive transcript page, the media can be played, other annotation layers can be displayed, the audio for the line can be extracted, or opened in Praat with a corresponding TextGrid containing annotations. Annotations can be added, edited, or removed, and alignments adjusted.

What is Transcriber?

Transcriber is a free, 3rd-party software tool for creating and edited time-aligned transcripts - these are XML documents that line up transcript text with the corresponding part of the audio recording.

You can find out more about Transcriber here: http://trans.sourceforge.net

What is Praat?

Praat is a free, 3rd-party software tool for acoustic analysis. Currently, through a web browser, LaBB-CAT is capable of interacting only with Praat.

Also, Praat can be used instead of Transcriber for creating transcripts.

You can find out more about Praat here: http://www.praat.org/

What do I need for LaBB-CAT to work for me?

A server computer with

a web server that supports java server pages and java servlets (we have used LaBB-CAT with Microsoft IIS, and with Apache)
Java
MySQL, which is open-source software
Apache Tomcat, which is open-source software
LaBB-CAT

Because LaBB-CAT is written in Java, your server can run under Windows, Linux, or Macintosh OS X.

Any client computers that you wish to use to search the LaBB-CAT must have:

network access to the server
a web browser
Java

They can run under Windows, Linux, or Macintosh OS X

To interact with Praat, they must also have Praat installed.

You may also install Transcriber to create transcripts with, or you can do transcription with Praat.

Alternatively LaBB-CAT can be used as a 'standalone' system - i.e. the 'server computer' and the 'client computer' can be the same, in which case no network access is required.

How do I get LaBB-CAT?

LaBB-CAT can be downloaded from here: https://sourceforge.net/projects/labbcat/files/install/.

If you do download LaBB-CAT, please drop us a line, and let us know what you are using it for, and whether the installation goes smoothly.

You may also be interested in perusing this LaBB-CAT worksheet, which talks users through the various features. It presupposes access to a particular demo corpus which you won't have access to, but may still contain useful pointers.

There is also an article outlining the architecture and construction of LaBB-CAT.

How much does LaBB-CAT Cost?

LaBB-CAT is Open Source software, published with GNU's General Public License.

You can install and use it free of charge, and you can change and redistribute it freely, as long as your changes are also released as Open Source software. For more information, visit GNU: http://www.gnu.org/licenses

Who wrote LaBB-CAT?

The system was designed and written by Robert Fromont and Jen Hay for the ONZE (Origins of New Zealand English) Project, at the New Zealand Institute of Language, Brain and Behaviour.

Why?

The ONZE Project has a large (more than 1000 hours) corpus which contains recordings of people born in New Zealand from the 1850s to the 1980s. The ONZE Project was originally conceived as a project that traced the development of the New Zealand accent. Three different types of phonetic/phonological analyses have been carried out: auditory perceptual analyses, auditory quantitative analyses and acoustic analyses.

Since 2003 we have been working on techniques for interacting more directly with the sound-files, via the orthographic transcript. That is, we wanted to introduce timestamping information into the transcript, so that a researcher could directly access any particular utterance of interest. After an initial exploration of appropriate tools, we settled on Transcriber software. NXpeds Footpedals (http://www.nxpeds.com/) have been programmed for use with the software, so that transcribing into Transcriber proceeds very similarly to the traditional use of a transcription machine.

Because the format of the Transcriber files is XML, it can be relatively easily transformed into other formats. We therefore developed a simple java utility which would transform the Transcriber files into TextGrids for use with Praat acoustic analysis software.

In 2004 we began to develop software which allows these time-aligned transcripts to be stored on a central database, from where they can be easily viewed, filtered, and searched. The prototype of this software evolved into LaBB-CAT, and is proving highly successful, both for research, and for using the corpus in teaching. Part or all of the corresponding recordings can now be played-back or loaded into acoustic analysis software directly from the transcript.

This system has already radically increased the efficiency with which we can interact with the corpus. It raises exciting new possibilities for research projects that would have previously been extremely time-consuming.