Category Archives: fieldwork

New preprint (etc) archive for Australian languages!

I worry about data. It’s my job. I worry about how to analyze it, how to collect it, how to present it, and what happens to it. A particular worry for me at the moment is the very large amount of ‘grey’ publications for Australian language: that is, the language materials that are published locally, for example, by language centres or smaller publishers. There are also gems in working papers collections, some of which only exist in photocopies of photocopies at this stage. Some important work has come out of Hono(u)rs Theses, but that work isn’t often widely available, and unlike PhD theses, it tends not to make its way to university repositories. I have a large collection of such materials, both in print and in scanned format, and I presume that others do too, particularly the “older generation” of Australianists who did most of their work before putting stuff on the web was what one did as a matter of course.

Another area of accessibility in work on Australian languages is fire-walled publications (or subscription-only publications). There is an increasing attention to Open Access, but for various reasons, much work is either print-only or e-print but behind a firewall. But in many cases, authors are able to upload freely available preprints.

It’s important to make our work available to the many groups who are interested in language: to our linguist colleagues, to the wider scientific community, to the general public, and in particular to members of the Aboriginal community.

So, I’ve started a ‘community’ on Zenodo for Australian languages. Zenodo is an archive platform for sharing research. In a nutshell, you upload your paper, handout, or other item, give the site some information about the work’s metadata, and publish it. You choose a license to share your work under (it can be closed (archived), for example), upload the file(s), and presto!

Zenodo is somewhat similar to academia.edu and researchgate.net, in that it takes work and makes it available. However, there are also a couple of big differences. Both academia.edu and researchgate are for profit, while Zenodo is not for profit, and funded by CERN and programs in the EU’s Open Science Initiative. Zenodo uploads are publications, while the others’ aren’t. Zenodo assigns DOIs, allowing for referencing versions of publications (which makes it great for databases or dictionaries or other work that might have multiple editions or versions). It also lets you upload collections of files as a single item (which the others don’t), and it works well with code repositories like github, so you can publish the paper, supporting documentation, and code at the same time. If you have sound files, you can include them with the paper under the same DOI (which you can’t do on academia, for example)

Another issue is findability – in theory, everything on the web is ‘findable’ if you know what to search for. Search engines, however, optimize results, weighting results from different places differentially. I know from the experience of finding papers for ozpapers that it can be hard to find work on Australian languages, even when I have regular alerts set up. For example, not all university thesis repositories show up in google alerts (you have to know what to look for)

To contribute to Zenodo, go to https://zenodo.org/communities/australianlanguages/

You’ll see a list of current contributions and a button to upload.

If you have old handouts, or other useful information about Australian languages, that you would like to contribute but do not have the time/inclination to upload them, if you can get me the scans (or even paper copies), my students and I will upload them for you.

 

New bootcamp under way!

The 2017 grammar boot camp starts tomorrow. Three students (with bios below) will be working with me on materials for Noongar. We’re very lucky to be working with Denise Smith-Ali, Noongar linguist, and Sue Hanson from the Goldfields Language Centre. Our main focus for the month is to put together a phonological description of Noongar, with sound files to illustrate what we are describing. In some ways, this is pretty straightforward (in that it’s the sort of thing linguists do, the scope is known, etc) but in other ways, it’ll be a challenge! For example, we want to make something easy to access, and easy to edit and update. We’ll be posting more about this as we make decisions.

Akshay Aitha: Akshay is a rising senior at UC Berkeley working on a double major in Linguistics and Applied Mathematics (with a concentration in Logic). My main research interest at the moment is the functional structure of nominals, especially in my heritage language, Telugu. I also have a strong enthusiasm for linguistic fieldwork. Outside of my coursework, I’ve been involved as a research assistant on various phonetics and fieldwork projects under graduate students in the Berkeley Linguistics department, and I’m also involved in my department as an officer of our club for undergraduates, SLUgS.

Lydia Ding: Lydia is a recent graduate of Carleton College, where she majored in Linguistics and completed a senior thesis for distinction on wh-questions in Nukuoro [nkr] (Polynesian). Her primary interests lie in language documentation, syntax, morphology, and computational linguistics.

Sarah Mihuc: Sarah is a recent graduate of McGill University with a BA Honours in Linguistics & Computer Science. She works on anti-agreement and on word order in Kabyle Berber. She also has experience in experimental and computational linguistics, and fieldwork on two Mayan languages.

Grammar Boot Camp, 2017

I will again be holding a summer ‘grammar boot camp’ at Yale this summer. The dates will be from Wednesday, June 28 to July 26, 2017. (Note that these dates overlap with the LSA Institute at the University of Kentucky.) The idea is to have up to four advanced undergraduate students work intensively on existing high-quality archival field notes and recordings with the aim of producing a publishable sketch grammar. Students will receive a stipend and travel expenses to come to Yale.
This project is funded by the National Science Foundation’s Research Experiences for Undergraduates program; as such, applicants are limited to US citizens or permanent residents. Students who will have graduated in Spring 2017 are eligible to apply. The targeted cohort is undergraduates who will have just finished either their junior or senior year.
The materials to be worked on will be from an Australian Aboriginal language from Western Australia and will include both print materials and audio files. It is probable that the ‘print’ materials will already be digitized and in Toolbox.
Students will meet once a day as a group with me to discuss analyses and writing. They will spend the rest of the time working with the materials in the Linguistics department. They will receive regular detailed feedback on the analysis and writing. Familiarity with Australian languages is not required but I would expect that successful applicants would do some reading of grammars of related languages prior to the start of the boot camp.
Applications for the boot camp are now open. The deadline for applications is January 31, 2017, and applicants will be notified of the result in mid-February.
To apply, please send the following materials electronically:
. a letter of application, describing your experience in linguistics, including research experience, experience with language documentation and relevant software, your future plans, and why you’d like to join the boot camp.
. a writing sample, such as a linguistics term paper
. course transcript (this can be an unofficial transcript)
Please send materials as file attachments to bootcamp@pamanyungan.net, cc’ed to claire.bowern@yale.edu. Applications will be acknowledged within 2 days – if you don’t get an acknowledgment, please let me know.
Please also arrange for one or two letters of recommendation/support from faculty to be sent to the same email addresses, also by January 31.
Students will need to show some evidence of prior research experience (e.g. through an RA-ship or by having a senior thesis in progress) and some familiarity with language documentation procedures (e.g. through having taken a field methods class or equivalent, such as having attended CoLang or a LSA Institute class). Applicants will need to show attention to detail and ability to focus on a project for a sustained period. Students will need to be able to travel to New Haven for the entire period of the boot camp and should expect to work solely on this project during that time, including some evenings and weekends.
Please forward to anyone you think would be interested and feel free to contact me with any questions.

Conference talk on grammar boot camps

I run a grammar boot camp every year, where a small group of students write a grammar of a language in a month. Last year it was Ngalia, and this year (starting in a few weeks) it’ll be Cundalee Wangka and Kuwarra. I also ran a year-long grammar group to pilot the idea in 2013, using materials from Tjupan. All four languages are varieties of the Wati subgroup of Pama-Nyungan and all the books are based on fieldwork conducted by Sue Hanson.

At the recent Wanala Conference run by the Goldfields Language Centre, Anaí Navarro, Matthew Tyler and I did a video presentation about the boot camp, its aims, methods, and results. Here’s a link to the video: https://drive.google.com/a/yale.edu/file/d/0ByIoQcheKNw2RGx4amVjNjhLOUk/view. Warning that it’s 190mb and 22 minutes long.

Talk slides

This week I’ve been giving three talks in the Department of Linguistics at UC Berkeley. It’s been a very stimulating week, with lots of good feedback, brainstorming for new directions, and problem troubleshooting. I’ve also met with many of the graduate students (including two who were my students as undergraduates and who worked on the data that led to some of the results presented in the talks) to hear about their work.

I’m posting slides for two of the talks here. On Monday, I gave an overview of the Pama-Nyungan project and talked about how the tree was created, what it implies to take an ‘evolutionary’ view of language (in this framework), and some off-shoots of the project (MondayPhylogenetics powerpoint). On Wednesday, I talked about one further extension, using the tree to investigate the evolution of colo(u)r terminology. (WednesdayColor powerpoint.)

The other talk, on Tuesday, was on using my Bardi corpus to study life-span changes and variation. At the end of the talk when I was chatting, one of the sociolinguists expressed surprise that I hadn’t anonymized the identities of the Bardi speakers. I hadn’t thought about it. As a fieldworker working on Bardi, I did talk about this general point with the people I worked with, and all were keen to be acknowledged for their work on the language, and recognized among the key custodians of their culture. However, with the work on aging and variation, I am no longer talking directly about Bardi as a language, and more about the properties of the speech of individual speakers, and the more I thought about it, the less comfortable I am with putting that up online without talking to Bardi people about it first. It’s not just a matter of anonymizing the slides, because there are so few speakers, anyone who has any of my other Bardi work will be able to easily work out who I’m talking about. There will be a paper (soon I hope) on using forced alignment in field research, so some of the results will be in that paper, probably now along with a discussion of the ethics of same.

 

Second Annual Summer Grammar Bootcamp!

I will be holding a summer ‘grammar boot camp’ from July 5 to July 29, 2016. The idea is to have up to four advanced undergraduate students work intensively on existing high-quality archival field notes and recordings with the aim of producing a publishable sketch grammar. Students will receive a stipend and travel expenses to come to Yale. This follows from a very successful first bootcamp in 2015.

This project is funded by the National Science Foundation’s Research Experiences for Undergraduates program; as such, applicants are limited to US citizens or permanent residents. Students who have graduated in Spring 2016 will be eligible to apply. That is, the targeted cohort is undergraduates who will have just finished either their junior or senior year.

The materials to be worked on will be from an Australian Aboriginal language from Western Australia and will include both print materials and audio files. It is probable that the ‘print’ materials will already be digitized and in Toolbox.

Students will meet once a day as a group with me to discuss analyses and writing. They will spend the rest of the time working with the materials in the Linguistics department. They will receive regular detailed feedback on the analysis and writing. Familiarity with Australian languages is not required but I would expect that successful applicants would do some reading of grammars of related languages prior to the start of the boot camp.

Applications for the boot camp are now open. The deadline for applications is January 22, 2016, and applicants will be notified of the result in mid-February.

To apply, please send the following materials electronically:

. a letter of application, describing your experience in linguistics, including research experience, your future plans, and why you’d like to join the boot camp.
. a writing sample, such as a linguistics term paper
. course transcript (this can be an unofficial transcript)

Please send materials as file attachments to bootcamp@pamanyungan.net, cc’ed to claire.bowern@yale.edu. Applications will be acknowledged within 2 days – if you don’t get an acknowledgment, please let me know.

Please also arrange for one or two letters of recommendation/support from faculty to be sent to the same email addresses, also by January 22.

Students will need to show some evidence of prior research experience (e.g. through an RA-ship or by having a senior thesis in progress) and some familiarity with language documentation procedures (e.g. through having taken a field methods class or equivalent, such as having attended CoLang or a LSA Institute class). Applicants will need to show attention to detail and ability to focus on a project for a sustained period. Students will need to be able to travel to New Haven for the entire period of the boot camp and should expect to work solely on this project during that time, including some evenings and weekends.

Documenting Endangered Languages outreach videos

A new set of videos have been released which provide information on how to apply for a grant to do language documentation. The series is focused on the requirements for the National Science Foundation’s DEL program, but there is much information that would be useful to anyone applying for funding for their language projects. The videos are aimed at community members as much as (if not more than) academic linguists.

I have two of the video segments: components of an application, and 6 things that tank a grant proposal. The first segment is DEL-specific; we walk through the sections of an application. The second one, however, is very general, and applies to just about all grant applications.

In brief, the six things are

  1. A project outside the agency’s mandate (e.g. DEL funds linguistic work on endangered languages)
  2. Project doesn’t meet the agency requirements (e.g. they ask for X, Y, and Z in the application, but if that’s not provided, it’ll be rejected;
  3. Unrealistic aims, budget, time frame.
  4. Too vague
  5. Too specific, too narrow for the scope of the budget or time, ie not good value for money
  6. Inconsistency in the proposal.

You can watch the video here for further information.

Edited to add: Production of the videos was funded by NSF grant BCS#1500695, awarded to Racquel Sapien and Carlos Nash. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation.