Skip to content

MatrixDoc_Number

VaryaGracheva edited this page Dec 20, 2012 · 36 revisions

Documentation for the Grammar Matrix Customization Number Library

Introduction

This document explains how to fill out the number page of the Grammar Matrix Customization questionnaire and presents background information on the number library of the Grammar Matrix Customization System (Bender et al., 2002; Bender and Flickinger, 2005; Bender et al., 2010). General instructions on using the questionnaire can be found here.

Options

On the number page in Matrix Customization you are asked to describe the number hierarchy in your language. The number library allows you to specify the range of values for the feature NUMBER that will be available elsewhere in the customization system and in the resulting grammar (see more on that in the Analyses section below). The complexity of number hierarchy is different for all languages. The number of values that you can enter for number names and their supertypes in questionnaire is unlimited. Please note that this part of the questionnaire is for entering number values, and NOT numerals, such as one, two, three, etc.. Even though there can be an interaction between grammatical number and numerals, numerals are a very different phenomenon, which is not yet supported by the Matrix Customization system.

Using dropdown menu, please add numbers and their supertypes as applicable to your language. Unlike gender, in terms of filling out the questionnaire for Grammar Matrix, you need to represent number hierarchy in grammar even if it is not expressed grammatically through agreement. The reason why you need to enter all numbers existing in your language even if they are not expressed through agreement is a very consistent semantic representation of number, which needs to be reflected in your grammar. An example of a language with the lack of agreement in number between nominals and other parts of speech is Japanese. Despite this lack of agreement, both singular and plural numbers should be present in customization system, otherwise the sentences with nominals differing in number would be treated as the same semantically by your grammar.

Some of the common possibilities of the number values on this page are: singular (represents one entity), dual (two entities), trial (three entities), quadral (four entities), paucal (a few entities), plural (multiple entities), general (unspecified number of entities), with some further subdivisions for paucal and plural numbers.

Below is an example of options chosen for Slovene language, which has three numbers and a facultative number system (see next section Analyses for more details):

  • Gender Supertype
    Singular number
    Dual number
    Plural number

For more information on how to use the questionnaire to define values for number feature in your language, please refer to the number page in Matrix Customization.

For more information on the implementation of number in Grammar Matrix please refer to the next section Analyses.

Analyses

Your starter grammar uses feature NUMBER, the values of which are based on the choices you make on the number customization page. Note that like GENDER and PERSON, NUMBER is a feature of the nominal INDEX. Making NUMBER a feature of semantic INDEX instead of syntactic HEAD allows for correct semantic representation of number in your grammar and avoiding spurious ambiguity during generation.

Instead of applying some form of a universal number hierarchy to all starter grammars, Matrix customization system requires user to define number values in their language, thus solving the problem with representing languages with complex number hierarchies, which cannot be accounted for by a universal number hierarchy. Examples of such number hierarchies that cannot be covered by a universal number hierarchy are found in languages with facultative number systems, in which number optionality is present.

American Sign Language is an example of a language with a number hierarchy that cannot be covered by a universal approach. While American Sign Language requires a hierarchy with plural and non-plural numbers (with the latter subdivided into singular and dual numbers), the universal number hierarchy is limited to singular vs. non-singular division. Therefore, due to existence of languages that cannot be covered by a universal number hierarchy, a customized Grammar Matrix approach is a better solution for representing number hierarchy across languages. For more information on using customized number hierarchy for your target language, please refer to Drellishak 2009, Section 5.3.4.

The choices you make in the questionnaire are recorded in the choices file, which is a Unicode text file. Below is an example of a gender section in the choices file of a language with singular, plural, and dual numbers:

section=number
  number1_name=singular
    number1_supertype1_name=number
  number2_name=dual
    number2_supertype1_name=number
  number3_name=plural
    number3_supertype1_name=number

This will be reflected in the your_language_name.tdl file:

png :+ [ PER person,
    NUM number, <<BR>>
    GEND gender,<<BR>>
    ANIMATION animation ].


;;; Number

number := *top*.<<BR>>
singular := number.<<BR>>
dual := number.<<BR>>
plural := number.<<BR>>

The values you assign to NUMBER feature will be available to use later in the customization system, especially the Morphology and Lexicon pages. The answers you provide on this page will determine the values available on the Lexicon page for the NUMBER feature (or the PERNUM feature, about which see the Person section of the questionnaire for more details or person documentation page). NUMBER will also appear as one of the features of the lexical rule types on Morphology page.

The choices you make on the number customization page, will also affect the lexical types (your_language_name.tdl file), lexical rules (lrules.tdl file), and inflectional rules (irules.tdl file).

The analyses implemented in the number library are described in detail in Drellishak 2009, Section 5.3.

NUMBER is a feature often involved in agreement phenomena. These are handled through the lexicon and morphology libraries. The analyses of agreement implemented in the Grammar Matrix are described in detail in Drellishak 2009, Section 5.2.

Upcoming Work

  • [ This documentation is under construction. When it is more complete, this section should describe any modifications to or enhancements of this library that are either in progress or planned. ]

References

Drellishak, Scott. 2009. Widespread but Not Universal: Improving the Typological Coverage of the Grammar Matrix. PhD thesis, University of Washington.

Clone this wiki locally