feat: aggiunge CLI unificata, build vocabolario e filtro lessicale
This commit is contained in:
85
package/dist/CC-BY-SA-2.0.txt
vendored
Normal file
85
package/dist/CC-BY-SA-2.0.txt
vendored
Normal file
@@ -0,0 +1,85 @@
|
||||
Creative Commons Attribution-ShareAlike 2.0
|
||||
|
||||
CREATIVE COMMONS CORPORATION IS NOT A LAW FIRM AND DOES NOT PROVIDE LEGAL SERVICES. DISTRIBUTION OF THIS LICENSE DOES NOT CREATE AN ATTORNEY-CLIENT RELATIONSHIP. CREATIVE COMMONS PROVIDES THIS INFORMATION ON AN "AS-IS" BASIS. CREATIVE COMMONS MAKES NO WARRANTIES REGARDING THE INFORMATION PROVIDED, AND DISCLAIMS LIABILITY FOR DAMAGES RESULTING FROM ITS USE.
|
||||
|
||||
License
|
||||
|
||||
THE WORK (AS DEFINED BELOW) IS PROVIDED UNDER THE TERMS OF THIS CREATIVE COMMONS PUBLIC LICENSE ("CCPL" OR "LICENSE"). THE WORK IS PROTECTED BY COPYRIGHT AND/OR OTHER APPLICABLE LAW. ANY USE OF THE WORK OTHER THAN AS AUTHORIZED UNDER THIS LICENSE OR COPYRIGHT LAW IS PROHIBITED.
|
||||
|
||||
BY EXERCISING ANY RIGHTS TO THE WORK PROVIDED HERE, YOU ACCEPT AND AGREE TO BE BOUND BY THE TERMS OF THIS LICENSE. THE LICENSOR GRANTS YOU THE RIGHTS CONTAINED HERE IN CONSIDERATION OF YOUR ACCEPTANCE OF SUCH TERMS AND CONDITIONS.
|
||||
|
||||
1. Definitions
|
||||
|
||||
a. "Collective Work" means a work, such as a periodical issue, anthology or encyclopedia, in which the Work in its entirety in unmodified form, along with a number of other contributions, constituting separate and independent works in themselves, are assembled into a collective whole. A work that constitutes a Collective Work will not be considered a Derivative Work (as defined below) for the purposes of this License.
|
||||
|
||||
b. "Derivative Work" means a work based upon the Work or upon the Work and other pre-existing works, such as a translation, musical arrangement, dramatization, fictionalization, motion picture version, sound recording, art reproduction, abridgment, condensation, or any other form in which the Work may be recast, transformed, or adapted, except that a work that constitutes a Collective Work will not be considered a Derivative Work for the purpose of this License. For the avoidance of doubt, where the Work is a musical composition or sound recording, the synchronization of the Work in timed-relation with a moving image ("synching") will be considered a Derivative Work for the purpose of this License.
|
||||
|
||||
c. "Licensor" means the individual or entity that offers the Work under the terms of this License.
|
||||
|
||||
d. "Original Author" means the individual or entity who created the Work.
|
||||
|
||||
e. "Work" means the copyrightable work of authorship offered under the terms of this License.
|
||||
|
||||
f. "You" means an individual or entity exercising rights under this License who has not previously violated the terms of this License with respect to the Work, or who has received express permission from the Licensor to exercise rights under this License despite a previous violation.
|
||||
|
||||
g. "License Elements" means the following high-level license attributes as selected by Licensor and indicated in the title of this License: Attribution, ShareAlike.
|
||||
|
||||
2. Fair Use Rights. Nothing in this license is intended to reduce, limit, or restrict any rights arising from fair use, first sale or other limitations on the exclusive rights of the copyright owner under copyright law or other applicable laws.
|
||||
|
||||
3. License Grant. Subject to the terms and conditions of this License, Licensor hereby grants You a worldwide, royalty-free, non-exclusive, perpetual (for the duration of the applicable copyright) license to exercise the rights in the Work as stated below:
|
||||
|
||||
a. to reproduce the Work, to incorporate the Work into one or more Collective Works, and to reproduce the Work as incorporated in the Collective Works;
|
||||
|
||||
b. to create and reproduce Derivative Works;
|
||||
|
||||
c. to distribute copies or phonorecords of, display publicly, perform publicly, and perform publicly by means of a digital audio transmission the Work including as incorporated in Collective Works;
|
||||
|
||||
d. to distribute copies or phonorecords of, display publicly, perform publicly, and perform publicly by means of a digital audio transmission Derivative Works.
|
||||
|
||||
e. For the avoidance of doubt, where the work is a musical composition:
|
||||
|
||||
i. Performance Royalties Under Blanket Licenses. Licensor waives the exclusive right to collect, whether individually or via a performance rights society (e.g. ASCAP, BMI, SESAC), royalties for the public performance or public digital performance (e.g. webcast) of the Work.
|
||||
|
||||
ii. Mechanical Rights and Statutory Royalties. Licensor waives the exclusive right to collect, whether individually or via a music rights society or designated agent (e.g. Harry Fox Agency), royalties for any phonorecord You create from the Work ("cover version") and distribute, subject to the compulsory license created by 17 USC Section 115 of the US Copyright Act (or the equivalent in other jurisdictions).
|
||||
|
||||
f. Webcasting Rights and Statutory Royalties. For the avoidance of doubt, where the Work is a sound recording, Licensor waives the exclusive right to collect, whether individually or via a performance-rights society (e.g. SoundExchange), royalties for the public digital performance (e.g. webcast) of the Work, subject to the compulsory license created by 17 USC Section 114 of the US Copyright Act (or the equivalent in other jurisdictions).
|
||||
|
||||
The above rights may be exercised in all media and formats whether now known or hereafter devised. The above rights include the right to make such modifications as are technically necessary to exercise the rights in other media and formats. All rights not expressly granted by Licensor are hereby reserved.
|
||||
|
||||
4. Restrictions. The license granted in Section 3 above is expressly made subject to and limited by the following restrictions:
|
||||
|
||||
a. You may distribute, publicly display, publicly perform, or publicly digitally perform the Work only under the terms of this License, and You must include a copy of, or the Uniform Resource Identifier for, this License with every copy or phonorecord of the Work You distribute, publicly display, publicly perform, or publicly digitally perform. You may not offer or impose any terms on the Work that alter or restrict the terms of this License or the recipients' exercise of the rights granted hereunder. You may not sublicense the Work. You must keep intact all notices that refer to this License and to the disclaimer of warranties. You may not distribute, publicly display, publicly perform, or publicly digitally perform the Work with any technological measures that control access or use of the Work in a manner inconsistent with the terms of this License Agreement. The above applies to the Work as incorporated in a Collective Work, but this does not require the Collective Work apart from the Work itself to be made subject to the terms of this License. If You create a Collective Work, upon notice from any Licensor You must, to the extent practicable, remove from the Collective Work any reference to such Licensor or the Original Author, as requested. If You create a Derivative Work, upon notice from any Licensor You must, to the extent practicable, remove from the Derivative Work any reference to such Licensor or the Original Author, as requested.
|
||||
|
||||
b. You may distribute, publicly display, publicly perform, or publicly digitally perform a Derivative Work only under the terms of this License, a later version of this License with the same License Elements as this License, or a Creative Commons iCommons license that contains the same License Elements as this License (e.g. Attribution-ShareAlike 2.0 Japan). You must include a copy of, or the Uniform Resource Identifier for, this License or other license specified in the previous sentence with every copy or phonorecord of each Derivative Work You distribute, publicly display, publicly perform, or publicly digitally perform. You may not offer or impose any terms on the Derivative Works that alter or restrict the terms of this License or the recipients' exercise of the rights granted hereunder, and You must keep intact all notices that refer to this License and to the disclaimer of warranties. You may not distribute, publicly display, publicly perform, or publicly digitally perform the Derivative Work with any technological measures that control access or use of the Work in a manner inconsistent with the terms of this License Agreement. The above applies to the Derivative Work as incorporated in a Collective Work, but this does not require the Collective Work apart from the Derivative Work itself to be made subject to the terms of this License.
|
||||
|
||||
c. If you distribute, publicly display, publicly perform, or publicly digitally perform the Work or any Derivative Works or Collective Works, You must keep intact all copyright notices for the Work and give the Original Author credit reasonable to the medium or means You are utilizing by conveying the name (or pseudonym if applicable) of the Original Author if supplied; the title of the Work if supplied; to the extent reasonably practicable, the Uniform Resource Identifier, if any, that Licensor specifies to be associated with the Work, unless such URI does not refer to the copyright notice or licensing information for the Work; and in the case of a Derivative Work, a credit identifying the use of the Work in the Derivative Work (e.g., "French translation of the Work by Original Author," or "Screenplay based on original Work by Original Author"). Such credit may be implemented in any reasonable manner; provided, however, that in the case of a Derivative Work or Collective Work, at a minimum such credit will appear where any other comparable authorship credit appears and in a manner at least as prominent as such other comparable authorship credit.
|
||||
|
||||
5. Representations, Warranties and Disclaimer
|
||||
|
||||
UNLESS OTHERWISE AGREED TO BY THE PARTIES IN WRITING, LICENSOR OFFERS THE WORK AS-IS AND MAKES NO REPRESENTATIONS OR WARRANTIES OF ANY KIND CONCERNING THE MATERIALS, EXPRESS, IMPLIED, STATUTORY OR OTHERWISE, INCLUDING, WITHOUT LIMITATION, WARRANTIES OF TITLE, MERCHANTIBILITY, FITNESS FOR A PARTICULAR PURPOSE, NONINFRINGEMENT, OR THE ABSENCE OF LATENT OR OTHER DEFECTS, ACCURACY, OR THE PRESENCE OF ABSENCE OF ERRORS, WHETHER OR NOT DISCOVERABLE. SOME JURISDICTIONS DO NOT ALLOW THE EXCLUSION OF IMPLIED WARRANTIES, SO SUCH EXCLUSION MAY NOT APPLY TO YOU.
|
||||
|
||||
6. Limitation on Liability. EXCEPT TO THE EXTENT REQUIRED BY APPLICABLE LAW, IN NO EVENT WILL LICENSOR BE LIABLE TO YOU ON ANY LEGAL THEORY FOR ANY SPECIAL, INCIDENTAL, CONSEQUENTIAL, PUNITIVE OR EXEMPLARY DAMAGES ARISING OUT OF THIS LICENSE OR THE USE OF THE WORK, EVEN IF LICENSOR HAS BEEN ADVISED OF THE POSSIBILITY OF SUCH DAMAGES.
|
||||
|
||||
7. Termination
|
||||
|
||||
a. This License and the rights granted hereunder will terminate automatically upon any breach by You of the terms of this License. Individuals or entities who have received Derivative Works or Collective Works from You under this License, however, will not have their licenses terminated provided such individuals or entities remain in full compliance with those licenses. Sections 1, 2, 5, 6, 7, and 8 will survive any termination of this License.
|
||||
|
||||
b. Subject to the above terms and conditions, the license granted here is perpetual (for the duration of the applicable copyright in the Work). Notwithstanding the above, Licensor reserves the right to release the Work under different license terms or to stop distributing the Work at any time; provided, however that any such election will not serve to withdraw this License (or any other license that has been, or is required to be, granted under the terms of this License), and this License will continue in full force and effect unless terminated as stated above.
|
||||
|
||||
8. Miscellaneous
|
||||
|
||||
a. Each time You distribute or publicly digitally perform the Work or a Collective Work, the Licensor offers to the recipient a license to the Work on the same terms and conditions as the license granted to You under this License.
|
||||
|
||||
b. Each time You distribute or publicly digitally perform a Derivative Work, Licensor offers to the recipient a license to the original Work on the same terms and conditions as the license granted to You under this License.
|
||||
|
||||
c. If any provision of this License is invalid or unenforceable under applicable law, it shall not affect the validity or enforceability of the remainder of the terms of this License, and without further action by the parties to this agreement, such provision shall be reformed to the minimum extent necessary to make such provision valid and enforceable.
|
||||
|
||||
d. No term or provision of this License shall be deemed waived and no breach consented to unless such waiver or consent shall be in writing and signed by the party to be charged with such waiver or consent.
|
||||
|
||||
e. This License constitutes the entire agreement between the parties with respect to the Work licensed here. There are no understandings, agreements or representations with respect to the Work not specified here. Licensor shall not be bound by any additional provisions that may appear in any communication from You. This License may not be modified without the mutual written agreement of the Licensor and You.
|
||||
|
||||
Creative Commons is not a party to this License, and makes no warranty whatsoever in connection with the Work. Creative Commons will not be liable to You or any party on any legal theory for any damages whatsoever, including without limitation any general, special, incidental or consequential damages arising in connection to this license. Notwithstanding the foregoing two (2) sentences, if Creative Commons has expressly identified itself as the Licensor hereunder, it shall have all rights and obligations of Licensor.
|
||||
|
||||
Except for the limited purpose of indicating to the public that the Work is licensed under the CCPL, neither party will use the trademark "Creative Commons" or any related trademark or logo of Creative Commons without the prior written consent of Creative Commons. Any permitted use will be in compliance with Creative Commons' then-current trademark usage guidelines, as may be published on its website or otherwise made available upon request from time to time.
|
||||
|
||||
Creative Commons may be contacted at http://creativecommons.org/.
|
||||
14
package/dist/index.d.ts
vendored
Normal file
14
package/dist/index.d.ts
vendored
Normal file
@@ -0,0 +1,14 @@
|
||||
/**
|
||||
* @license
|
||||
* Copyright 2021 Ludan Stoecklé
|
||||
* SPDX-License-Identifier: Apache-2.0
|
||||
*/
|
||||
export type Genders = 'M' | 'F';
|
||||
export interface WordInfo {
|
||||
G: Genders | null;
|
||||
S?: string | null;
|
||||
P: string | null;
|
||||
}
|
||||
export interface WordsInfo {
|
||||
[key: string]: WordInfo;
|
||||
}
|
||||
8
package/dist/index.js
vendored
Normal file
8
package/dist/index.js
vendored
Normal file
@@ -0,0 +1,8 @@
|
||||
"use strict";
|
||||
/**
|
||||
* @license
|
||||
* Copyright 2021 Ludan Stoecklé
|
||||
* SPDX-License-Identifier: Apache-2.0
|
||||
*/
|
||||
Object.defineProperty(exports, "__esModule", { value: true });
|
||||
//# sourceMappingURL=index.js.map
|
||||
1
package/dist/index.js.map
vendored
Normal file
1
package/dist/index.js.map
vendored
Normal file
@@ -0,0 +1 @@
|
||||
{"version":3,"file":"index.js","sourceRoot":"","sources":["../src/index.ts"],"names":[],"mappings":";AAAA;;;;GAIG"}
|
||||
415
package/dist/readme-morph-it.txt
vendored
Normal file
415
package/dist/readme-morph-it.txt
vendored
Normal file
@@ -0,0 +1,415 @@
|
||||
===================================================================
|
||||
Morph-it!
|
||||
|
||||
A free morphological lexicon for the Italian Language
|
||||
===================================================================
|
||||
|
||||
version 0.4.8
|
||||
February 23 2009
|
||||
|
||||
*******************************************************************
|
||||
THIS README IS NOT REALLY UP TO DATE
|
||||
A NEW VERSION OF THIS
|
||||
README FILE WILL BE
|
||||
RELEASED (HOPEFULLY) SOON
|
||||
(BUT I WOULDN'T COUNT ON THAT...)
|
||||
*******************************************************************
|
||||
|
||||
Copyright (c) 2004-2009
|
||||
Marco Baroni (marco.baroni@unitn.it)
|
||||
Eros Zanchetta (eros@sslmit.unibo.it)
|
||||
|
||||
http://sslmit.unibo.it/morphit
|
||||
|
||||
|
||||
Morph-it! is a free (as in free speech and in free beer) morphological
|
||||
resource for the Italian language.
|
||||
|
||||
Morph-it! is a lexicon of inflected forms with their lemma and
|
||||
morphological features. For example:
|
||||
|
||||
gattini gattino NOUN-M:p
|
||||
andarono andare VER:ind+past+3+p
|
||||
fastidiosetto fastidioso ADJ:dim+m+s
|
||||
|
||||
As of version 0.4.7 the list contains 504,906 entries and 34,968
|
||||
lemmas.
|
||||
|
||||
Morph-it! can be used as a data source for an Italian lemmatizer /
|
||||
morphological analyzer / morphological generator.
|
||||
|
||||
As example applications, on the Morph-it! site you can download the
|
||||
lexicon compiled for the SFST [1] and Finite State Utilities [2]
|
||||
packages.
|
||||
|
||||
The data for Morph-it! were prepared by Marco Baroni and Eros
|
||||
Zanchetta using a mixture of corpus-based methods,
|
||||
regular-expression-based rules and manual checking. We are currently
|
||||
writing a paper that describes the procedure we used to build the
|
||||
resource.
|
||||
|
||||
Morph-it! is still under development and there may still be gaps,
|
||||
unlikely forms, etc. We will be very grateful if you let us know
|
||||
about missing forms, problems, and ideas/resources that can help
|
||||
us expanding or cleaning the list (sslmitdevonline@sslmit.unibo.it).
|
||||
|
||||
Notice in particular that, since we extracted data from an Italian
|
||||
newspaper corpus (the la Repubblica corpus, also accessible from our
|
||||
site), we have many gaps in basic, every-day vocabulary.
|
||||
|
||||
Also, the current version does not distinguish between coordinative
|
||||
and subordinative conjunctions. We plan to do this in the near
|
||||
future. More in general, we are not fully satisfied with our current
|
||||
features for function words, and we plan to revise them.
|
||||
|
||||
A more ambitious plan we would like to pursue is the identification
|
||||
of derivational structure and derivationally related lemmas. Then, we
|
||||
will add full semantic representations. Then, we will take over the
|
||||
world and reign supreme for the next 100 years.
|
||||
|
||||
The remainder of this document contains a commented list of the
|
||||
morphological features used in the lexicon, licensing information and
|
||||
aknowledgments.
|
||||
|
||||
|
||||
FEATURES
|
||||
========
|
||||
|
||||
We distinguish between derivational features, that pertain to the
|
||||
lemma, and inflectional features, that pertain to the wordform.
|
||||
|
||||
Derivational and inflectional features are separated by a colon.
|
||||
|
||||
The derivational features are in upper case and they are
|
||||
dash-delimited. The inflectional features are in lower case and they
|
||||
are plus-sign-delimited.
|
||||
|
||||
For example, we represent gender as a derivational feature of nouns
|
||||
(we take "cameriere" and "cameriera" to belong to different lemmas),
|
||||
whereas we treat number as an inflectional feature of nouns. Thus,
|
||||
gender and number are represented as in the following examples:
|
||||
|
||||
cameriere cameriera NOUN-F:p
|
||||
cameriera cameriera NOUN-F:s
|
||||
camerieri cameriere NOUN-M:p
|
||||
cameriere cameriere NOUN-M:s
|
||||
|
||||
For adjectives, gender is considered an inflectional feature. Thus,
|
||||
gender is represented differently in adjectives and nouns:
|
||||
|
||||
azzurre azzurra NOUN-F:p
|
||||
azzurra azzurra NOUN-F:s
|
||||
azzurri azzurro NOUN-M:p
|
||||
azzurro azzurro NOUN-M:s
|
||||
|
||||
azzurra azzurro ADJ:pos+f+s
|
||||
azzurri azzurro ADJ:pos+m+p
|
||||
azzurro azzurro ADJ:pos+m+s
|
||||
azzurre azzurro ADJ:pos+f+p
|
||||
|
||||
Changes that are purely orthographical/phonological but do not affect
|
||||
morphology/syntax/meaning are not reflected in the features. For
|
||||
example, the following variants of "cento" share the same lemma and
|
||||
the same features:
|
||||
|
||||
cent' cento DET-NUM-CARD
|
||||
cento cento DET-NUM-CARD
|
||||
|
||||
We now present the full list of features we used, organized by major
|
||||
syntactic categories.
|
||||
|
||||
ABL
|
||||
|
||||
Abbreviated locutions, such as "a.C.", "ecc." and "i.e."
|
||||
|
||||
ADJ
|
||||
|
||||
Adjectives, with the following inflectional features:
|
||||
|
||||
pos/comp/sup
|
||||
|
||||
Thas is: positive, comparative, superlative. Although these are not
|
||||
true inflectional features, given their high productivity we decided
|
||||
to represent them as properties of inflected forms.
|
||||
|
||||
f/m
|
||||
|
||||
That is: feminine, masculine.
|
||||
|
||||
s/p
|
||||
|
||||
Thas is: singular, plural.
|
||||
|
||||
ADV
|
||||
|
||||
Adverbs.
|
||||
|
||||
ART
|
||||
|
||||
Articles, with gender as a derivational feature (F/M) and number as an
|
||||
inflectional feature (s/p).
|
||||
|
||||
ARTPRE
|
||||
|
||||
Preposition+article compounds ("col", "della", "nei"...), with gender
|
||||
as a derivational feature (F/M) and number as an inflectional feature
|
||||
(s/p).
|
||||
|
||||
ASP
|
||||
|
||||
Aspectuals ("stare" in "stare per"). Same inflectional features as VER
|
||||
(see below).
|
||||
|
||||
AUX
|
||||
|
||||
Auxiliaries ("essere", "avere", "venire"). Same inflectional features
|
||||
as VER (see below).
|
||||
|
||||
CAU
|
||||
|
||||
Causatives ("fare" in "far sapere"). Same inflectional features as VER
|
||||
(see below).
|
||||
|
||||
CE
|
||||
|
||||
Clitic "ce" as in "ce l'ho fatta".
|
||||
|
||||
CI
|
||||
|
||||
Clitic "ci" as in "ci prova".
|
||||
|
||||
CON
|
||||
|
||||
Conjunctions.
|
||||
|
||||
DET-DEMO
|
||||
|
||||
Demonstrative determiners (such as "questa" in "questa sera"), with
|
||||
inflectional gender (f/s) and number (s/p) features.
|
||||
|
||||
DET-INDEF
|
||||
|
||||
Indefinite determiners (such as "molti" in "molti amici") with
|
||||
inflectional gender (f/s) and number (s/p) features.
|
||||
|
||||
DET-NUM-CARD
|
||||
|
||||
Cardinal number determiners (e.g., "cinque" in "cinque
|
||||
amici"). Pure-digit numbers are not included (i.e., the list includes
|
||||
"100mila" but not "100000" nor "100,000", "100.000", etc.)
|
||||
|
||||
DET-POSS
|
||||
|
||||
Possessive determiners (e.g., "mio", "suo"), with inflectional gender
|
||||
(f/s) and number (s/p) features.
|
||||
|
||||
DET-WH
|
||||
|
||||
Wh determiners (e.g., quale in "quale amico"), with inflectional
|
||||
gender (f/s) and number (s/p) features.
|
||||
|
||||
INT
|
||||
|
||||
Interjections.
|
||||
|
||||
MOD
|
||||
|
||||
Modal verbs (e.g. "dover" in "dover ricostruire"). Same inflectional
|
||||
features as VER (see below).
|
||||
|
||||
NE
|
||||
|
||||
Clitic "ne" (as in: "ne hanno molte").
|
||||
|
||||
NOUN
|
||||
|
||||
Nouns, with gender as a derivational feature (F/M) and number as an
|
||||
inflectional feature (s/p).
|
||||
|
||||
PON
|
||||
|
||||
Non-sentential punctuation marks (e.g. , " $).
|
||||
|
||||
PRE
|
||||
|
||||
Prepositions.
|
||||
|
||||
PRO-DEMO
|
||||
|
||||
Demonstrative pronouns (e.g. "questa" in "voglio questa"), with both
|
||||
gender and number as derivational features (F/M, S/P).
|
||||
|
||||
PRO-INDEF
|
||||
|
||||
Indefinite pronouns (e.g., "molti" in "vengono molti"), with both
|
||||
gender and number as derivational features (F/M, S/P).
|
||||
|
||||
PRO-NUM
|
||||
|
||||
Numeral pronouns (e.g., "cinque" in "cinque sono
|
||||
sopravvissuti"). Pure-digit numbers are not included (e.g., the list
|
||||
includes "100mila" but not 100000 nor 100,000, 100.000, etc.)
|
||||
|
||||
PRO-PERS
|
||||
|
||||
Personal pronouns, such as "lui" and "loro". Clitic possessive
|
||||
pronouns (such as pronominal "lo" and "si") are marked by the
|
||||
derivational feature CLI. Person, gender and number are also encoded
|
||||
as derivational features (1/2/3, F/M, S/P).
|
||||
|
||||
PRO-POSS
|
||||
|
||||
Possessive pronouns, such as "loro" in "non era uno dei loro"), with
|
||||
gender and number encoded as derivational features (F/M, S/P).
|
||||
|
||||
PRO-WH
|
||||
|
||||
Wh-pronouns, such as "quale" in "quale e' venuto?"
|
||||
|
||||
SENT
|
||||
|
||||
End of sentence marker (! . ... : ?).
|
||||
|
||||
SI
|
||||
|
||||
Clitic "si" as in "di cui si discute".
|
||||
|
||||
TALE
|
||||
|
||||
"Tale" in constructions such as "una fortuna tale che...", "la tal
|
||||
cosa", "tali amici", ecc. Gender (f/m) and number (s/p) as
|
||||
inflectional features.
|
||||
|
||||
VER
|
||||
|
||||
Verbs, with the following inflectional features:
|
||||
|
||||
cond/ger/impr/ind/inf/part/sub
|
||||
|
||||
Conditional, gerundive, imperative, indicative, infinitive,
|
||||
participle, subjunctive.
|
||||
|
||||
pre/past/impf/fut
|
||||
|
||||
Present, past, imperfective, future.
|
||||
|
||||
1/2/3
|
||||
|
||||
Person.
|
||||
|
||||
s/p
|
||||
|
||||
Number.
|
||||
|
||||
f/m
|
||||
|
||||
Gender (only relevant for participles).
|
||||
|
||||
cela/cele/celi/celo/cene/ci/gli/gliela/gliele/glieli/glielo/gliene/la/
|
||||
le/li/lo/mela/mele/meli/melo/mene/mi/ne/sela/sele/seli/selo/sene/si/
|
||||
tela/tele/teli/telo/tene/ti/vela/vele/veli/velo/vene/vi
|
||||
|
||||
Clitics attached to the verb.
|
||||
|
||||
WH
|
||||
|
||||
Wh elements ("come", "qualora", "quando"...)
|
||||
|
||||
WH-CHE
|
||||
|
||||
"Che" as a wh element (e.g., "l'uomo che hai visto", "hai detto che").
|
||||
|
||||
|
||||
LICENSING INFORMATION
|
||||
======================
|
||||
|
||||
This program is dual-licensed free software; you can redistribute it
|
||||
and/or modify it under the terms of the under the Creative Commons
|
||||
Attribution ShareAlike 2.0 License and the GNU Lesser General Public
|
||||
License.
|
||||
|
||||
***********************************************
|
||||
* Creative Commons Attribution ShareAlike 2.0 *
|
||||
***********************************************
|
||||
|
||||
Morph-it! is licensed under the Creative Commons Attribution
|
||||
ShareAlike 2.0 License.
|
||||
|
||||
You are free:
|
||||
|
||||
- to copy, distribute and display the resource;
|
||||
- to make derivative works;
|
||||
- to make commercial use of the resource;
|
||||
|
||||
under the following conditions:
|
||||
|
||||
- you must give the original authors credit;
|
||||
- if you alter, transform, or build upon this work, you may distribute
|
||||
the resulting work only under a license identical to this one;
|
||||
- for any reuse or distribution, you must make clear to others the
|
||||
license terms of this work;
|
||||
- any of these conditions can be waived if you get permission from the
|
||||
copyright holders.
|
||||
|
||||
Your fair use and other rights are in no way affected by the above.
|
||||
|
||||
You can find a link to the full license from the Morph-it! website.
|
||||
|
||||
Copyright (C) 2004-2007 Marco Baroni and Eros Zanchetta.
|
||||
|
||||
*************************************
|
||||
* GNU Lesser General Public License *
|
||||
*************************************
|
||||
|
||||
Morph-it! A free morphological lexicon for the Italian Language
|
||||
Copyright (C) 2004-2007 Marco Baroni and Eros Zanchetta
|
||||
|
||||
This program is free software; you can redistribute it and/or modify
|
||||
it under the terms of the GNU General Public License as published by
|
||||
the Free Software Foundation; either version 2 of the License, or
|
||||
(at your option) any later version.
|
||||
|
||||
This program is distributed in the hope that it will be useful,
|
||||
but WITHOUT ANY WARRANTY; without even the implied warranty of
|
||||
MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the
|
||||
GNU General Public License for more details.
|
||||
|
||||
You should have received a copy of the GNU General Public License along
|
||||
with this program; if not, write to the Free Software Foundation, Inc.,
|
||||
51 Franklin Street, Fifth Floor, Boston, MA 02110-1301 USA.
|
||||
|
||||
AKNOWLEDGMENTS
|
||||
==============
|
||||
|
||||
The main data source for the Morph-it! lexicon was the "la Repubblica"
|
||||
corpus. Thus, we would like to thank the colleagues who developed this
|
||||
resource with us: Lorenzo Piccioni, Guy Aston, Silvia Bernardini,
|
||||
Federica Comastri, Alessandra Volpi, Marco Mazzoleni.
|
||||
|
||||
We would like to thank the developers of the tools we used to tag,
|
||||
lemmatize and index the Repubblica corpus: the (Italian) TreeTagger
|
||||
(Helmut Schmid, Achim Stein), the ACOPOST taggers (Ingo Schroeder) and
|
||||
the IMS Corpus WorkBench (Oli Christ, Arne Fitschen and Stefan Evert).
|
||||
|
||||
Thanks to Helmut Schmid also for converting the Morph-it! lexicon into
|
||||
a SFST transducer.
|
||||
|
||||
We would like to thank Aldo Calpini, who developed the perl module
|
||||
Lingua:IT:Conjugate.
|
||||
|
||||
We are also very grateful to Jan Daciuk for creating his finite-state
|
||||
utilities and for helping us learn to use them.
|
||||
|
||||
Finally, a big thanks to the members of the FoLUG, SannioLUG and
|
||||
Scuola (software libero nella scuola) mailing lists, for advice about
|
||||
licensing and dissemination.
|
||||
|
||||
...and kudos to Lorenzo for creating and maintaining the SSLMITDev
|
||||
site!
|
||||
|
||||
|
||||
FOOTNOTES
|
||||
=========
|
||||
|
||||
[1] http://www.ims.uni-stuttgart.de/projekte/gramotron/SOFTWARE/SFST.html
|
||||
[2] http://juggernaut.eti.pg.gda.pl/~jandac/fsa.html
|
||||
1
package/dist/words.json
vendored
Normal file
1
package/dist/words.json
vendored
Normal file
File diff suppressed because one or more lines are too long
Reference in New Issue
Block a user