Questions to Help Evaluate Linguistic Tools

BIFoCAL document

This document is intended to serve as a series of useful questions to think about when choosing a tool for producing linguistic data in digital from. The questions are oriented to a wide-range of needs that the general linguist might have.

Accompanying this document is a series of reviews of some popular tools. These may be useful just so that you can see how someone else has reviewed a particular tool, or to serve as a guide for evaluating a tool for which there is no review available. If you would like to do your own review of a tool which is not reviewed and have it posted here, please contact .

How does this tool store and export data?

Things to consider:

What types of data will you be working with?

Some common types of data linguists work with:

No one tool can work with all these kinds of data, and most tools are specialized only to work with a subset of them. It is good to make a list of the sorts of data you will be working with before choosing any tool--this will help you decide which tool or tools is the best match for your needs.

How will you be annotating and marking up your data?

Things to consider:

Parsing, interlinearizing, and aligining are all fairly specialized functions. Not many tools can do these things well.

Will you need to work with special characters?

If you will only need to use basic, unaccented characters, any tool should be fine. If you will need special characters (accented characters, non-Roman character sets, etc.), your options become more limited. It's best to use a tool which supports the Unicode character encoding standard.

How will you input your data?

Things to consider:

Different tools make doing the things listed above easier or more difficult. So, it is good to think about how you will be inputting and marking up your data before you choose your software tool.

What kinds of resources do you intend to produce with your data?

Some different kinds of resources often produced based on basic data:

Most tools will allow to fairly easily make hard copies or electronic versions of your documents (though this may require use of a second or third tool). Making online/searchable versions is not as common a type of functionality nor is making different versions of the same document. If these are priorities, one should make sure the tool one uses makes the creation of such resources as simple as possible.

What sorts of analysis will you do on your prepared data?

Common analysis requirements

Some of these types of analysis, like searching or selection of a subset of the data, are features of almost every tool. Others are less common and, if they are priorities, they should factor into consideration of which tool to use.

For more information please contact .