ParaLiv: Livonian Paradigms in Phonemic Notation
This is a human-readable rendition of a JSON file defining a frictionless package. It was generated automatically.
name: paralivlicenses:-
keywords: livonian, paradigms, lexicon, morphology, paralex, uralic -
homepagehttps://paraliv.finug.eu/ profiledata-packagesources- [1]
titleLivonian morphological databasepathhttps://livonian.tech/
- [2]
nameTartu Ülikooli liivi keele korpuspathhttp://dx.doi.org/10.23673/re-473
contributors- [1]
roles['author', 'dataCurator']titleJules Boutonpathhttps://orcid.org/0009-0006-7368-1064organizationUniversité Paris-Cité, LLF, CNRS
- [2]
rolecontributortitleTuuli Tuiskpathhttps://orcid.org/0000-0002-0566-782XorganizationUniversity of Tartu, Livonian Institute
- [3]
roledataCollectortitleValts Ernštreitspathhttps://orcid.org/0000-0002-5323-8536organizationLivonian Institute
version1.1.0citationJules Bouton, Tuuli Tuisk, and Valts Ernštreits. ParaLiv: Livonian Paradigms in Phonemic Notation. 2024. doi:10.5281/zenodo.11391420.idhttp://dx.doi.org/10.5281/zenodo.11391420languages_iso639['liv']paralex-version2.2.17
This package describes the following files:
sources
Sources| Bibliographical references.
- This file is located in
paraliv/sources.bib.
cells
Paradigm cells
- This file is located in paraliv/paraliv_cells.csv.
- The identifier column (or
primaryKey) is['cell_id']
Columns defined by cells-schema:
-
cell_id(string): Cell identifier. The set of feature values as would appear in a gloss, separated by dots, eg. prs.ind.1sg or f.pl- constraints: a
cell_idis obligatory; it must be unique; it must match the regular expression(iness|part|elat|nom|gen|dat|ins|ill|sg|pl)(\.(iness|part|elat|nom|gen|dat|ins|ill|sg|pl))*.
- constraints: a
-
POS(string): Part of Speech. The relevant part of speech for this item. This must refer to a PartOfSpeech entity from the lexinfo (https://lexinfo.net/) ontology.-
constraints: a
POSmust be one of the values:verb,numeral,conjunction,noun,adposition,determiner,article,adverb,pronoun,fusedPreposition,adjective,symbol,particle,conditionalParticle,demonstrativePronoun,interjection,semiColon,diminutiveNoun,possessivePronoun,prepositionalAdverb,compoundPreposition,interrogativeRelativePronoun,possessiveParticle,plainVerb,letter,interrogativeDeterminer,relativePronoun,postposition,fusedPronounAuxiliary,interrogativeOrdinalNumeral,indefiniteOrdinalNumeral,strongPersonalPronoun,possessiveRelativePronoun,ordinalAdjective,collectivePronoun,commonNoun,infinitiveParticle,comparativeParticle,partitiveArticle,invertedComma,lightVerb,emphaticPronoun,distinctiveParticle,genericNumeral,possessiveAdjective,reflexivePossessivePronoun,colon,coordinationParticle,presentParticipleAdjective,fusedPrepositionPronoun,cardinalNumeral,indefiniteDeterminer,numeralFraction,questionMark,generalAdverb,superlativeParticle,point,indefiniteMultiplicativeNumeral,comma,closeParenthesis,futureParticle,personalPronoun,reflexivePersonalPronoun,adverbialPronoun,reciprocalPronoun,openParenthesis,pastParticipleAdjective,negativePronoun,relativeDeterminer,existentialPronoun,pronominalAdverb,relativeParticle,exclamativeDeterminer,multiplicativeNumeral,reflexiveDeterminer,modal,unclassifiedParticle,properNoun,allusivePronoun,interrogativeCardinalNumeral,bullet,subordinatingConjunction,irreflexivePersonalPronoun,possessiveDeterminer,negativeParticle,indefinitePronoun,generalizationWord,coordinatingConjunction,deficientVerb,adjective-i,impersonalPronoun,indefiniteCardinalNumeral,adjective-na,qualifierAdjective,affirmativeParticle,mainVerb,fusedPrepositionDeterminer,indefiniteArticle,weakPersonalPronoun,suspensionPoints,interrogativeMultiplicativeNumeral,affixedPersonalPronoun,auxiliary,circumposition,copula,demonstrativeDeterminer,participleAdjective,exclamativePoint,interrogativePronoun,presentativePronoun,punctuation,definiteArticle,slash,exclamativePronoun,preposition,conditionalPronoun,relationNoun,interrogativeParticle. -
rdfProperty: https://www.paralex-standard.org/paralex_ontology.xml#POS
-
-
unimorph(string): Cell/feature value in unimorph format. The cell or feature value, written following the unimorph schema -
ud(string): Cell/feature value in the universal dependency format. The cell or feature value, written following the universal dependency format -
livonian_tech(string): Livonian.tech. The cells definition, written following the Livonian Institute format -
tartu(string): Murrete Korpused. The cells definition, written following the Murrete Korpused format -
frequency(number): Frequency. Frequency for this row.
features-values
Grammatical features values
- This file is located in paraliv/paraliv_features.csv.
- The identifier column (or
primaryKey) is['value_id']
Columns defined by features-values-schema:
-
value_id(string): Grammatical Feature value identifier. Identifier for the grammatical feature value (as found in the cell)- constraints: a
value_idis obligatory; it must be unique.
- constraints: a
-
label(string): label for this row. A human readable label for the row.rdfProperty: http://www.w3.org/2000/01/rdf-schema#label
-
POS(string): Part of Speech. The relevant part of speech for this item. This must refer to a PartOfSpeech entity from the lexinfo (https://lexinfo.net/) ontology.-
constraints: a
POSmust be one of the values:verb,numeral,conjunction,noun,adposition,determiner,article,adverb,pronoun,fusedPreposition,adjective,symbol,particle,conditionalParticle,demonstrativePronoun,interjection,semiColon,diminutiveNoun,possessivePronoun,prepositionalAdverb,compoundPreposition,interrogativeRelativePronoun,possessiveParticle,plainVerb,letter,interrogativeDeterminer,relativePronoun,postposition,fusedPronounAuxiliary,interrogativeOrdinalNumeral,indefiniteOrdinalNumeral,strongPersonalPronoun,possessiveRelativePronoun,ordinalAdjective,collectivePronoun,commonNoun,infinitiveParticle,comparativeParticle,partitiveArticle,invertedComma,lightVerb,emphaticPronoun,distinctiveParticle,genericNumeral,possessiveAdjective,reflexivePossessivePronoun,colon,coordinationParticle,presentParticipleAdjective,fusedPrepositionPronoun,cardinalNumeral,indefiniteDeterminer,numeralFraction,questionMark,generalAdverb,superlativeParticle,point,indefiniteMultiplicativeNumeral,comma,closeParenthesis,futureParticle,personalPronoun,reflexivePersonalPronoun,adverbialPronoun,reciprocalPronoun,openParenthesis,pastParticipleAdjective,negativePronoun,relativeDeterminer,existentialPronoun,pronominalAdverb,relativeParticle,exclamativeDeterminer,multiplicativeNumeral,reflexiveDeterminer,modal,unclassifiedParticle,properNoun,allusivePronoun,interrogativeCardinalNumeral,bullet,subordinatingConjunction,irreflexivePersonalPronoun,possessiveDeterminer,negativeParticle,indefinitePronoun,generalizationWord,coordinatingConjunction,deficientVerb,adjective-i,impersonalPronoun,indefiniteCardinalNumeral,adjective-na,qualifierAdjective,affirmativeParticle,mainVerb,fusedPrepositionDeterminer,indefiniteArticle,weakPersonalPronoun,suspensionPoints,interrogativeMultiplicativeNumeral,affixedPersonalPronoun,auxiliary,circumposition,copula,demonstrativeDeterminer,participleAdjective,exclamativePoint,interrogativePronoun,presentativePronoun,punctuation,definiteArticle,slash,exclamativePronoun,preposition,conditionalPronoun,relationNoun,interrogativeParticle. -
rdfProperty: https://www.paralex-standard.org/paralex_ontology.xml#POS
-
-
feature(string): feature. The name of the dimension of this feature, eg. case, tense, modality, voice, force, gender, evidentiality, person, number, polarity...-
constraints: a
featureis obligatory. -
rdfProperty: https://www.paralex-standard.org/paralex_ontology.xml#feature
-
-
unimorph(string): Cell/feature value in unimorph format. The cell or feature value, written following the unimorph schema -
ud(string): Cell/feature value in the universal dependency format. The cell or feature value, written following the universal dependency format -
livonian_tech(string): Livonian.tech. Custom set of identifiers used at the Livonian Institute -
tartu(string): Murrete Korpused. The feature-value, written following the Murrete Korpused format -
canonical_order(integer): Sorting order for visual presentation. The order in which items are canonically presented. Use integers to represent relative order, order is used per-item.
forms
Inflected forms
- This file is located in paraliv/paraliv_forms.csv.
-
The identifier column (or
primaryKey) is['form_id'] -
Formal relations (foreignKeys) with other tables:
- Each value in column
['cell']of forms must refer to['cell_id']in tablecells - Each value in column
['lexeme']of forms must refer to['lexeme_id']in tablelexemes
- Each value in column
Columns defined by forms-schema:
-
form_id(string): Form table row identifiers. These identifiers are specific to form, lexeme, cell triples.- constraints: a
form_idis obligatory; it must be unique.
- constraints: a
-
lexeme(string): Reference to a lexeme identifier. Lexeme identifiers must be unique to paradigms.-
constraints: a
lexemeis obligatory. -
rdfProperty: https://www.paralex-standard.org/paralex_ontology.xml#lexeme
-
-
cell(string): Reference to a cell identifier. The set of feature values as would appear in a gloss, separated by dots, eg. prs.ind.1sg or f.pl-
constraints: a
cellis obligatory. -
rdfProperty: https://www.paralex-standard.org/paralex_ontology.xml#cell
-
-
orth_form(string): Inflected form (orthographic). The form, given orthographically-
constraints: a
orth_formmust match the regular expression(ä|ǟ|a|ā|b|d|ḑ|e|ē|ȯ|ȱ|f|g|h|i|ī|j|k|l|ļ|m|n|ņ|o|ō|ǭ|p|r|ŗ|s|š|t|ț|u|ū|õ|ȭ|v|z|ž|')+. -
rdfProperty: https://www.paralex-standard.org/paralex_ontology.xml#orth_form missingValues:#DEF#
-
-
phon_form(string): Inflected form (phonemic or phonetic). The form, given in phonemic or phonetic notation, with sounds separated by spaces-
constraints: a
phon_formmust match the regular expression(uːˀoi|iːˀe|uːˀo|uːoi|uoˀi|ieˀu|uoiː|ieuː|uːˀi|ɔːˀi|æːˀ|ɑːˀ|eːˀ|ɤːˀ|iːˀ|oːˀ|ɔːˀ|uːˀ|ɯːˀ|dʲː|lʲː|nʲː|rʲː|tʲː|eːi|ɑːi|uːi|ɔːi|ɤːi|ɯːi|iˀu|eˀu|æˀu|ɯˀu|oˀu|iːu|iuː|euː|æuː|ɯuː|ouː|eˀi|ɑˀi|uˀi|oˀi|ɯˀi|ɤˀi|eiː|ɑiː|oiː|uiː|ɯiː|ɤiː|ieˀ|uoˀ|iːe|uːo|uoi|ieu|æˀ|ɑˀ|eˀ|ɤˀ|iˀ|oˀ|uˀ|ɯˀ|æː|ɑː|bː|dː|dʲ|eː|ɤː|fː|gː|hː|iː|jː|kː|lː|lʲ|mː|nː|nʲ|ŋː|oː|ɔː|pː|rː|rʲ|sː|ʃː|tː|tʲ|uː|ɯː|vː|zː|ʒː|iu|eu|æu|ɯu|ou|ei|ɑi|oi|ui|ɯi|ɤi|ie|uo|æ|ɑ|b|d|e|ɤ|f|g|h|i|j|k|l|m|n|ŋ|o|p|r|s|ʃ|t|u|ɯ|v|z|ʒ)( (uːˀoi|iːˀe|uːˀo|uːoi|uoˀi|ieˀu|uoiː|ieuː|uːˀi|ɔːˀi|æːˀ|ɑːˀ|eːˀ|ɤːˀ|iːˀ|oːˀ|ɔːˀ|uːˀ|ɯːˀ|dʲː|lʲː|nʲː|rʲː|tʲː|eːi|ɑːi|uːi|ɔːi|ɤːi|ɯːi|iˀu|eˀu|æˀu|ɯˀu|oˀu|iːu|iuː|euː|æuː|ɯuː|ouː|eˀi|ɑˀi|uˀi|oˀi|ɯˀi|ɤˀi|eiː|ɑiː|oiː|uiː|ɯiː|ɤiː|ieˀ|uoˀ|iːe|uːo|uoi|ieu|æˀ|ɑˀ|eˀ|ɤˀ|iˀ|oˀ|uˀ|ɯˀ|æː|ɑː|bː|dː|dʲ|eː|ɤː|fː|gː|hː|iː|jː|kː|lː|lʲ|mː|nː|nʲ|ŋː|oː|ɔː|pː|rː|rʲ|sː|ʃː|tː|tʲ|uː|ɯː|vː|zː|ʒː|iu|eu|æu|ɯu|ou|ei|ɑi|oi|ui|ɯi|ɤi|ie|uo|æ|ɑ|b|d|e|ɤ|f|g|h|i|j|k|l|m|n|ŋ|o|p|r|s|ʃ|t|u|ɯ|v|z|ʒ))*. -
rdfProperty: https://www.paralex-standard.org/paralex_ontology.xml#phon_form missingValues:#DEF#
-
-
analysed_orth_form(string): Inflected form with analysis, such as segmentation markers (orthographic). The form, given orthographically, with markers for analysis.rdfProperty: https://www.paralex-standard.org/paralex_ontology.xml#analysed_orth_formmissingValues:#DEF#
-
analysed_phon_form(string): Inflected form with analysis, such as segmentation markers (phonemic or phonetic). The form, given in phonemic or phonetic notation, with sounds separated by spaces, and analysis markers.rdfProperty: https://www.paralex-standard.org/paralex_ontology.xml#analysed_phon_formmissingValues:#DEF#
-
analysed_phon_form_full(string): Full analysed phonetic forms. Full analysed phonetic forms with legacy transcription (including half-long consonants)missingValues:#DEF#
-
defectiveness_tag(string): Tags for defectiveness status. Identifies sets of defective forms (eg. pluralia tantum).-
constraints: a
defectiveness_tagmust match the regular expression(pluralia_tantum|defective)(\|(pluralia_tantum|defective))*. -
rdfProperty: https://www.paralex-standard.org/paralex_ontology.xml#defectiveness_tag
-
-
overabundance_tag(string): Tags for overabundant forms. Identifies sets of overabundant forms. For example, overabundant forms across lexemes might belong to a series of regular and irregular forms, or a series of short and long forms, etc.-
constraints: a
overabundance_tagmust match the regular expression(illsg_without_z|elasg_without_õ|illsg_with_z|elasg_with_õ|strong_stem|weak_stem|nom_stem|gen_stem)(\|(illsg_without_z|elasg_without_õ|illsg_with_z|elasg_with_õ|strong_stem|weak_stem|nom_stem|gen_stem))*. -
rdfProperty: https://www.paralex-standard.org/paralex_ontology.xml#overabundance_tag
-
-
stem_syllables(integer): Stem syllables count. Count of syllables in the stem -
tone(string): Tone. Kind of tone of this wordform.- constraints: a
tonemust be one of the values:broken,plain.
- constraints: a
-
stem_grade(string): Stem grade. Stem grade according to Viitso's (2012) typology.- constraints: a
stem_grademust be one of the values:strong,weak.
- constraints: a
graphemes
Graphemes inventory
- This file is located in paraliv/paraliv_graphemes.csv.
- The identifier column (or
primaryKey) is['grapheme_id'] missingValues: ``
Columns defined by graphemes-schema:
-
grapheme_id(string): grapheme representation. These identifiers are specific to graphemes.-
constraints: a
grapheme_idis obligatory; it must be unique.
-
-
comment(string): Comment. Human-readable comment.rdfProperty: http://www.w3.org/2000/01/rdf-schema#comment
-
canonical_order(integer): Sorting order for visual presentation. The order in which items are canonically presented. Use integers to represent relative order, order is used per-item.
lexemes
Lexemes
- This file is located in paraliv/paraliv_lexemes.csv.
- The identifier column (or
primaryKey) is['lexeme_id']
Columns defined by lexemes-schema:
-
lexeme_id(string): Identifier for the lexeme. Lexeme identifiers. Often, they are identical to the label (lemma). However, they must be unique to paradigms, distinguishing homonyms with different inflection. For example, the animal mouse/mice and the computer peripheric mouse/mouses would both have the label 'mouse' but could be identified by the lexeme identifiers mouse_1 and mouse_2.- constraints: a
lexeme_idis obligatory; it must be unique.
- constraints: a
-
label(string): label for this row. A human readable label for the row.rdfProperty: http://www.w3.org/2000/01/rdf-schema#label
-
inflection_class(string): Inflection class identifier. This identifier groups together lexemes of the same inflection class. -
POS(string): Part of Speech. The relevant part of speech for this item. This must refer to a PartOfSpeech entity from the lexinfo (https://lexinfo.net/) ontology.-
constraints: a
POSmust be one of the values:verb,numeral,conjunction,noun,adposition,determiner,article,adverb,pronoun,fusedPreposition,adjective,symbol,particle,conditionalParticle,demonstrativePronoun,interjection,semiColon,diminutiveNoun,possessivePronoun,prepositionalAdverb,compoundPreposition,interrogativeRelativePronoun,possessiveParticle,plainVerb,letter,interrogativeDeterminer,relativePronoun,postposition,fusedPronounAuxiliary,interrogativeOrdinalNumeral,indefiniteOrdinalNumeral,strongPersonalPronoun,possessiveRelativePronoun,ordinalAdjective,collectivePronoun,commonNoun,infinitiveParticle,comparativeParticle,partitiveArticle,invertedComma,lightVerb,emphaticPronoun,distinctiveParticle,genericNumeral,possessiveAdjective,reflexivePossessivePronoun,colon,coordinationParticle,presentParticipleAdjective,fusedPrepositionPronoun,cardinalNumeral,indefiniteDeterminer,numeralFraction,questionMark,generalAdverb,superlativeParticle,point,indefiniteMultiplicativeNumeral,comma,closeParenthesis,futureParticle,personalPronoun,reflexivePersonalPronoun,adverbialPronoun,reciprocalPronoun,openParenthesis,pastParticipleAdjective,negativePronoun,relativeDeterminer,existentialPronoun,pronominalAdverb,relativeParticle,exclamativeDeterminer,multiplicativeNumeral,reflexiveDeterminer,modal,unclassifiedParticle,properNoun,allusivePronoun,interrogativeCardinalNumeral,bullet,subordinatingConjunction,irreflexivePersonalPronoun,possessiveDeterminer,negativeParticle,indefinitePronoun,generalizationWord,coordinatingConjunction,deficientVerb,adjective-i,impersonalPronoun,indefiniteCardinalNumeral,adjective-na,qualifierAdjective,affirmativeParticle,mainVerb,fusedPrepositionDeterminer,indefiniteArticle,weakPersonalPronoun,suspensionPoints,interrogativeMultiplicativeNumeral,affixedPersonalPronoun,auxiliary,circumposition,copula,demonstrativeDeterminer,participleAdjective,exclamativePoint,interrogativePronoun,presentativePronoun,punctuation,definiteArticle,slash,exclamativePronoun,preposition,conditionalPronoun,relationNoun,interrogativeParticle. -
rdfProperty: https://www.paralex-standard.org/paralex_ontology.xml#POS
-
-
frequency(number): Frequency. Frequency for this row.
sounds
Sound inventory with distinctive features
- This file is located in paraliv/paraliv_sounds.csv.
- The identifier column (or
primaryKey) is['sound_id'] missingValues: ``
Columns defined by sounds-schema:
-
sound_id(string): sound representation. These identifiers are specific to sounds.- constraints: a
sound_idis obligatory; it must be unique.
- constraints: a
-
CLTS_id(string): Identifier of this sound in CLTS. Reference to this sound in CLTS data.-
constraints: a
CLTS_idmust be unique. -
rdfProperty: https://www.paralex-standard.org/paralex_ontology.xml#CLTS_id
-
-
label(string): label for this row. A human readable label for the row.rdfProperty: http://www.w3.org/2000/01/rdf-schema#label
-
syllabic(any) -
stress(any) -
long(any) -
consonantal(any) -
sonorant(any) -
continuant(any) -
delayed release(any) -
approximant(any) -
trill(any) -
nasal(any) -
voice(any) -
spread gl(any) -
constr gl(any) -
LABIAL(any) -
round(any) -
labiodental(any) -
CORONAL(any) -
anterior(any) -
distributed(any) -
strident(any) -
lateral(any) -
DORSAL(any) -
high(any) -
low(any) -
pre-low(any) -
front(any) -
back(any) -
tense(any) -
diphthong(any) -
triphthong(any) -
diph_front(any) -
diph_high(any) -
diph_low(any) -
diph_long(any) -
C_high(any) -
C_front(any) -
palatalised(any) -
comment(string): Comment. Human-readable comment.rdfProperty: http://www.w3.org/2000/01/rdf-schema#comment
tags
Tags mark rows which have commonalities
- This file is located in paraliv/paraliv_tags.csv.
- The identifier column (or
primaryKey) is['tag_id']
Columns defined by tags-schema:
-
tag_id(string): Tag id. The label for a set of forms which have something in common.- constraints: a
tag_idis obligatory; it must be unique.
- constraints: a
-
tag_column_name(string): Name of the tag column in the forms table. The name of the column this tag is used in the forms table- constraints: a
tag_column_nameis obligatory; it must match the regular expression[^ ]+_tag.
- constraints: a
-
comment(string): Comment. Human-readable comment.rdfProperty: http://www.w3.org/2000/01/rdf-schema#comment
data_sheet
Data Sheet| Data Sheet
- This file is located in
docs/data_sheet.md.
readme
Read me| Basic documentation
- This file is located in
README.md.