StyloMetrix Tutorial#

This notebook presents full instructions for using StyloMetrix with examples.

J.Busse 2025-11-09: Dies ist eine modifizierte Fassung von

Unser Korpus ist auf DE. Das Original-Tutorial verwendet EN als Sprache. Selbstverständlich haben wir auch den englischen lg-Korpus insgtalliert:

python -m spacy download en_core_web_lg --direct

Problem: Wenn wir StyloMetrix in der hier vorliegenden Installation mit en starten:

stylo = sm.StyloMetrix('en') # define langauge, one of ('de','en', 'pl', 'ru', 'ukr')

Erhalten wir die Fehlermeldung

[E050] Can’t find model ‘en_core_web_trf’. It doesn’t seem to be a Python package or a valid path to a data directory.

Ursache: Wir haben den lg installiert, aber nicht trf. Lösung: trf installieren? Problem: Die Installation ist riesig, nicht gewollt für das dsci-lab.

ABER wir müssen dieses Problem ja auch gar nicht lösen, denn uns interessiert ja vorallem DE als Sprache. Entsprechend haben wir dieses Tutorial auf DE Beispiele umgestellt.

1. Quick start#

StyloMetrix is a tool for stylometric analysis of texts. It is based on Spacy and supports four languages. In order for the tool to work properly, a corresponding language model is required. Below is the list of supported languages and their corresponding models:

The model must be downloaded and installed in the environment where SM will be used. StyloMetrix is installed using pip install stylo_metrix.

The following shows how to quickly calculate metrics for several texts. Please remember everything presented in this tutorial can be applied to all supported languages.

# import library
import stylo_metrix as sm
---------------------------------------------------------------------------
ModuleNotFoundError                       Traceback (most recent call last)
Cell In[1], line 2
      1 # import library
----> 2 import stylo_metrix as sm

ModuleNotFoundError: No module named 'stylo_metrix'
# example texts

texts = [ 
    # jbusse
    """Im Rahmen von Forschungs und Lehre müssen Leistungen individuell zuschreibbar sein. 
Aber auch moderne generative KI kann inzwischen Texte generieren. 
Problem: Bei einem einem “gut formulierten” Text ist es sehr schwer zu unterscheiden,
*   ob der Text ein von der KI generierter Text ist, den ein Mensch in Auftrag gegeben hat, oder
*   ob der Text ein vom Menschen selbst formulierter Rohtext ist, der von der KI lektoriert wurde.""",
    """Unser Prüfungsrecht verlangt, dass eine Leistung individuell zugeschrieben werden kann. 
Die KI-Leitlinie Bayern verlangt, dass KI-generierte Inhalte eindeutig gekennzeichnet werden. 
Es stellt sich die Frage, wie man in Studienarbeiten und Bachelorabeiten, 
aber auch Präsentationen etc. die Beiträge von KI auch typografisch kenntlich machen kann. """,
    # ChatGPT
    """In der Forschung und Lehre muss man wissen, wer etwas geschrieben hat. 
Aber heutzutage kann auch KI Texte erstellen.
Das Problem: Wenn ein Text gut geschrieben ist, kann man kaum erkennen, ob
* der Text von einer KI stammt, die jemand beauftragt hat, oder
* der Text zuerst von einem Menschen geschrieben und dann von der KI verbessert wurde.""",
    """Das Prüfungsrecht verlangt, dass man genau erkennen kann, wer eine Leistung erbracht hat.
Die KI-Leitlinie Bayern fordert, dass KI-generierte Inhalte klar gekennzeichnet werden.
Die Frage ist nun, wie man in Studienarbeiten, Bachelorarbeiten oder auch in Präsentationen zeigen kann, 
welche Teile von der KI stammen – auch durch die Gestaltung des Textes."""
    ]
# count metrics
stylo = sm.StyloMetrix('de') # define langauge, one of ('de','en', 'pl', 'ru', 'ukr')
metrics = stylo.transform(texts)
metrics
100%|████████████████████████████████████████████████████████████████████████████████████████████| 4/4 [00:00<00:00, 35.68it/s]
text G_N G_ADJ G_ADV G_V G_VMOD G_NUM G_PART G_ADP G_CONJ ... L_STOP L_TCCT1 L_TCCT5 DESC_PRON_VOC GR_UPPER GR_EMOT GR_LENNY GR_MENTION GR_HASH GR_LINK
0 Im Rahmen von Forschungs und Lehre müssen Leis... 0.200000 0.058824 0.094118 0.141176 0.023529 0.0 0.011765 0.082353 0.058824 ... 0.435294 0.070588 0.129412 0.0 0.035294 0.0 0.0 0.0 0.0 0.0
1 Unser Prüfungsrecht verlangt, dass eine Leistu... 0.203704 0.018519 0.129630 0.185185 0.037037 0.0 0.000000 0.037037 0.092593 ... 0.407407 0.055556 0.111111 0.0 0.018519 0.0 0.0 0.0 0.0 0.0
2 In der Forschung und Lehre muss man wissen, we... 0.166667 0.000000 0.083333 0.222222 0.041667 0.0 0.000000 0.055556 0.083333 ... 0.500000 0.083333 0.138889 0.0 0.041667 0.0 0.0 0.0 0.0 0.0
3 Das Prüfungsrecht verlangt, dass man genau erk... 0.206349 0.015873 0.079365 0.190476 0.031746 0.0 0.000000 0.063492 0.063492 ... 0.428571 0.095238 0.142857 0.0 0.015873 0.0 0.0 0.0 0.0 0.0

4 rows × 169 columns

You can count metrics for just as a string as well.

# You can provide string or list of strings to transform method
metrics_for_one = stylo.transform(texts[0])
metrics_for_one
100%|████████████████████████████████████████████████████████████████████████████████████████████| 1/1 [00:00<00:00, 30.32it/s]
text G_N G_ADJ G_ADV G_V G_VMOD G_NUM G_PART G_ADP G_CONJ ... L_STOP L_TCCT1 L_TCCT5 DESC_PRON_VOC GR_UPPER GR_EMOT GR_LENNY GR_MENTION GR_HASH GR_LINK
0 Im Rahmen von Forschungs und Lehre müssen Leis... 0.2 0.058824 0.094118 0.141176 0.023529 0.0 0.011765 0.082353 0.058824 ... 0.435294 0.070588 0.129412 0.0 0.035294 0.0 0.0 0.0 0.0 0.0

1 rows × 169 columns

2. Create StyloMetrix instance#

This chapter describes in detail the parameters of the sm.StyloMetrix class.

  • The basis for building an SM object is to specify the language in which the processed texts are written. This is done by entering a parameter lang of type string, can be one of:

    • ['english', 'angielski', 'en', 'eng', ] for english,

    • ['polish', 'polski', 'pl', 'pol'] for polish,

    • ['russian', 'rosyjski', 'ru'] for russian,

    • ['ukrainian', 'ukraiński', 'ukr'] for ukrainian.

  • Quite an important parameter is debug, which takes boolean values. When set to True, the result of the transform operation will be two DataFrame objects - the first is the results of the metrics calculation, the second contains information about which tokens were taken into account during the metrics count.

stylo = sm.StyloMetrix('de', debug=True) 
metrics, debug = stylo.transform(texts)
debug
100%|████████████████████████████████████████████████████████████████████████████████████████████| 4/4 [00:00<00:00, 19.69it/s]
text G_N G_ADJ G_ADV G_V G_VMOD G_NUM G_PART G_ADP G_CONJ ... L_STOP L_TCCT1 L_TCCT5 DESC_PRON_VOC GR_UPPER GR_EMOT GR_LENNY GR_MENTION GR_HASH GR_LINK
0 Im Rahmen von Forschungs und Lehre müssen Leis... [Rahmen, Forschungs, Lehre, Leistungen, KI, Te... [moderne, generative, formulierten, generierte... [individuell, zuschreibbar, auch, inzwischen, ... [müssen, sein, kann, generieren, ist, untersch... [müssen, kann] [] [zu] [Im, von, Bei, von, in, vom, von] [und, Aber, ob, oder, ob] ... [Im, von, und, müssen, sein, Aber, auch, kann,... [der] [der, Text] [] [KI, KI, KI] [] [] [] [] []
1 Unser Prüfungsrecht verlangt, dass eine Leistu... [Prüfungsrecht, Leistung, KI-Leitlinie, Bayern... [KI-generierte] [individuell, eindeutig, auch, etc., auch, typ... [verlangt, zugeschrieben, werden, kann, verlan... [kann, kann] [] [] [in, von] [dass, dass, wie, und, aber] ... [Unser, dass, eine, werden, kann, Die, dass, w... [\n] [\n, der] [] [KI] [] [] [] [] []
2 In der Forschung und Lehre muss man wissen, we... [Forschung, Lehre, KI, Texte, Problem, Text, T... [] [heutzutage, auch, gut, kaum, zuerst, dann] [muss, wissen, geschrieben, hat, kann, erstell... [muss, kann, kann] [] [] [In, von, von, von] [und, Aber, Wenn, ob, oder, und] ... [In, der, und, muss, man, wer, etwas, hat, Abe... [der] [der, \n] [] [KI, KI, KI] [] [] [] [] []
3 Das Prüfungsrecht verlangt, dass man genau erk... [Prüfungsrecht, Leistung, KI-Leitlinie, Bayern... [KI-generierte] [genau, klar, nun, auch, auch] [verlangt, erkennen, kann, erbracht, hat, ford... [kann, kann] [] [] [in, in, von, durch] [dass, dass, wie, oder] ... [Das, dass, man, kann, wer, eine, hat, Die, da... [der] [der, \n] [] [KI] [] [] [] [] []

4 rows × 169 columns

  • If we want our results to save automatically, set the save_path parameter. It takes values of type string, which denotes the path to an existing directory where DataFrames are to be saved in csv form.

path = '.'
stylo = sm.StyloMetrix('de', debug=True, save_path=path)

stylo.transform(texts[:2])
100%|████████████████████████████████████████████████████████████████████████████████████████████| 2/2 [00:00<00:00, 31.60it/s]
File saved in location: ./sm_output4.csv
File saved in location: ./sm_debug4.csv

(                                                text       G_N     G_ADJ  \
 0  Im Rahmen von Forschungs und Lehre müssen Leis...  0.200000  0.058824   
 1  Unser Prüfungsrecht verlangt, dass eine Leistu...  0.203704  0.018519   
 
       G_ADV       G_V    G_VMOD  G_NUM    G_PART     G_ADP    G_CONJ  ...  \
 0  0.094118  0.141176  0.023529    0.0  0.011765  0.082353  0.058824  ...   
 1  0.129630  0.185185  0.037037    0.0  0.000000  0.037037  0.092593  ...   
 
      L_STOP   L_TCCT1   L_TCCT5  DESC_PRON_VOC  GR_UPPER  GR_EMOT  GR_LENNY  \
 0  0.435294  0.070588  0.129412            0.0  0.035294      0.0       0.0   
 1  0.407407  0.055556  0.111111            0.0  0.018519      0.0       0.0   
 
    GR_MENTION  GR_HASH  GR_LINK  
 0         0.0      0.0      0.0  
 1         0.0      0.0      0.0  
 
 [2 rows x 169 columns],
                                                 text  \
 0  Im Rahmen von Forschungs und Lehre müssen Leis...   
 1  Unser Prüfungsrecht verlangt, dass eine Leistu...   
 
                                                  G_N  \
 0  [Rahmen, Forschungs, Lehre, Leistungen, KI, Te...   
 1  [Prüfungsrecht, Leistung, KI-Leitlinie, Bayern...   
 
                                                G_ADJ  \
 0  [moderne, generative, formulierten, generierte...   
 1                                    [KI-generierte]   
 
                                                G_ADV  \
 0  [individuell, zuschreibbar, auch, inzwischen, ...   
 1  [individuell, eindeutig, auch, etc., auch, typ...   
 
                                                  G_V          G_VMOD G_NUM  \
 0  [müssen, sein, kann, generieren, ist, untersch...  [müssen, kann]    []   
 1  [verlangt, zugeschrieben, werden, kann, verlan...    [kann, kann]    []   
 
   G_PART                              G_ADP                        G_CONJ  \
 0   [zu]  [Im, von, Bei, von, in, vom, von]     [und, Aber, ob, oder, ob]   
 1     []                          [in, von]  [dass, dass, wie, und, aber]   
 
    ...                                             L_STOP L_TCCT1  \
 0  ...  [Im, von, und, müssen, sein, Aber, auch, kann,...   [der]   
 1  ...  [Unser, dass, eine, werden, kann, Die, dass, w...    [\n]   
 
        L_TCCT5 DESC_PRON_VOC      GR_UPPER GR_EMOT GR_LENNY GR_MENTION  \
 0  [der, Text]            []  [KI, KI, KI]      []       []         []   
 1    [\n, der]            []          [KI]      []       []         []   
 
   GR_HASH GR_LINK  
 0      []      []  
 1      []      []  
 
 [2 rows x 169 columns])
  • If you are analyzing numerous lengthy documents and are concerned about lengthy processing times or potential interruptions during the counting process, you have the option to define intermittent saving points. This allows you to generate temporary files as the counting progresses. If the analysis completes without issues, these temporary files are automatically deleted. However, in the event of a disruption, you’ll have temporary files containing results for at least a portion of your documents. Specify save_step parameter (integer) to save results after each save_step processed documents.

path = '.'
stylo = sm.StyloMetrix('de', debug=True, save_path=path, save_step=2)

stylo.transform(texts)
100%|████████████████████████████████████████████████████████████████████████████████████████████| 4/4 [00:00<00:00, 28.16it/s]
File saved in location: ./sm_output5_temp.csv
File saved in location: ./sm_debug5_temp.csv
File saved in location: ./sm_output5_temp.csv
File saved in location: ./sm_debug5_temp.csv
File saved in location: ./sm_output5.csv
File saved in location: ./sm_debug5.csv

(                                                text       G_N     G_ADJ  \
 0  Im Rahmen von Forschungs und Lehre müssen Leis...  0.200000  0.058824   
 1  Unser Prüfungsrecht verlangt, dass eine Leistu...  0.203704  0.018519   
 2  In der Forschung und Lehre muss man wissen, we...  0.166667  0.000000   
 3  Das Prüfungsrecht verlangt, dass man genau erk...  0.206349  0.015873   
 
       G_ADV       G_V    G_VMOD  G_NUM    G_PART     G_ADP    G_CONJ  ...  \
 0  0.094118  0.141176  0.023529    0.0  0.011765  0.082353  0.058824  ...   
 1  0.129630  0.185185  0.037037    0.0  0.000000  0.037037  0.092593  ...   
 2  0.083333  0.222222  0.041667    0.0  0.000000  0.055556  0.083333  ...   
 3  0.079365  0.190476  0.031746    0.0  0.000000  0.063492  0.063492  ...   
 
      L_STOP   L_TCCT1   L_TCCT5  DESC_PRON_VOC  GR_UPPER  GR_EMOT  GR_LENNY  \
 0  0.435294  0.070588  0.129412            0.0  0.035294      0.0       0.0   
 1  0.407407  0.055556  0.111111            0.0  0.018519      0.0       0.0   
 2  0.500000  0.083333  0.138889            0.0  0.041667      0.0       0.0   
 3  0.428571  0.095238  0.142857            0.0  0.015873      0.0       0.0   
 
    GR_MENTION  GR_HASH  GR_LINK  
 0         0.0      0.0      0.0  
 1         0.0      0.0      0.0  
 2         0.0      0.0      0.0  
 3         0.0      0.0      0.0  
 
 [4 rows x 169 columns],
                                                 text  \
 0  Im Rahmen von Forschungs und Lehre müssen Leis...   
 1  Unser Prüfungsrecht verlangt, dass eine Leistu...   
 2  In der Forschung und Lehre muss man wissen, we...   
 3  Das Prüfungsrecht verlangt, dass man genau erk...   
 
                                                  G_N  \
 0  [Rahmen, Forschungs, Lehre, Leistungen, KI, Te...   
 1  [Prüfungsrecht, Leistung, KI-Leitlinie, Bayern...   
 2  [Forschung, Lehre, KI, Texte, Problem, Text, T...   
 3  [Prüfungsrecht, Leistung, KI-Leitlinie, Bayern...   
 
                                                G_ADJ  \
 0  [moderne, generative, formulierten, generierte...   
 1                                    [KI-generierte]   
 2                                                 []   
 3                                    [KI-generierte]   
 
                                                G_ADV  \
 0  [individuell, zuschreibbar, auch, inzwischen, ...   
 1  [individuell, eindeutig, auch, etc., auch, typ...   
 2        [heutzutage, auch, gut, kaum, zuerst, dann]   
 3                     [genau, klar, nun, auch, auch]   
 
                                                  G_V              G_VMOD  \
 0  [müssen, sein, kann, generieren, ist, untersch...      [müssen, kann]   
 1  [verlangt, zugeschrieben, werden, kann, verlan...        [kann, kann]   
 2  [muss, wissen, geschrieben, hat, kann, erstell...  [muss, kann, kann]   
 3  [verlangt, erkennen, kann, erbracht, hat, ford...        [kann, kann]   
 
   G_NUM G_PART                              G_ADP  \
 0    []   [zu]  [Im, von, Bei, von, in, vom, von]   
 1    []     []                          [in, von]   
 2    []     []                [In, von, von, von]   
 3    []     []               [in, in, von, durch]   
 
                              G_CONJ  ...  \
 0         [und, Aber, ob, oder, ob]  ...   
 1      [dass, dass, wie, und, aber]  ...   
 2  [und, Aber, Wenn, ob, oder, und]  ...   
 3           [dass, dass, wie, oder]  ...   
 
                                               L_STOP L_TCCT1      L_TCCT5  \
 0  [Im, von, und, müssen, sein, Aber, auch, kann,...   [der]  [der, Text]   
 1  [Unser, dass, eine, werden, kann, Die, dass, w...    [\n]    [\n, der]   
 2  [In, der, und, muss, man, wer, etwas, hat, Abe...   [der]    [der, \n]   
 3  [Das, dass, man, kann, wer, eine, hat, Die, da...   [der]    [der, \n]   
 
   DESC_PRON_VOC      GR_UPPER GR_EMOT GR_LENNY GR_MENTION GR_HASH GR_LINK  
 0            []  [KI, KI, KI]      []       []         []      []      []  
 1            []          [KI]      []       []         []      []      []  
 2            []  [KI, KI, KI]      []       []         []      []      []  
 3            []          [KI]      []       []         []      []      []  
 
 [4 rows x 169 columns])
  • Moreover, it is possible to set the parameter nlp which denotes a custom Spacy model.

  • By default, all available metrics in a given language are counted. We can modify them with parameters metrics or exceptions. We can choose ourselves the set of metrics we want to calculate and assign it to metrics parameter. As well as we can select all metrics except a given set, in which case such a set is assigned to exceptions parameter. The next section will show how we can select metrics.

3. Selecting metrics#

In order to select metrics, we first need to see what we have to choose from. Please, keep in mind that metrics might differ between languages. We can find out available metrics in the following way:

metrics = sm.get_all_metrics('de')
print(metrics)
0  |  GrammaticalForms  |  G_N  |  Nouns  |  Substantive
1  |  GrammaticalForms  |  G_ADJ  |  Adjectives  |  Adjektive
2  |  GrammaticalForms  |  G_ADV  |  Adverbs  |  Adverbien
3  |  GrammaticalForms  |  G_V  |  Verbs  |  Verben
4  |  GrammaticalForms  |  G_VMOD  |  Modal verbs  |  Modalverben
5  |  GrammaticalForms  |  G_NUM  |  Numerals  |  Numerale
6  |  GrammaticalForms  |  G_PART  |  Particles  |  Partikeln
7  |  GrammaticalForms  |  G_ADP  |  Adpositions  |  Präpositionen
8  |  GrammaticalForms  |  G_CONJ  |  Conjunctions  |  Konjunktionen
9  |  GrammaticalForms  |  G_CCONJ  |  Coordinating conjunctions  |  Koordinierende Konjunktionen
10  |  GrammaticalForms  |  G_SCONJ  |  Conjunctions  |  Subordinierende Konjunktionen
11  |  GrammaticalForms  |  G_PRO  |  Pronouns  |  Pronomen
12  |  GrammaticalForms  |  G_PRO_PRS  |  Personal pronouns  |  Personalpronomen
13  |  GrammaticalForms  |  G_PRO_DEM  |  Demonstrative pronouns  |  Demonstrativpronomen
14  |  GrammaticalForms  |  G_PRO_IND  |  Indefinite pronouns  |  Indefinitpronomen
15  |  GrammaticalForms  |  G_PRO_PIS  |  Substitutive indefinite pronouns  |  Ersetzende Indefinitpronomen
16  |  GrammaticalForms  |  G_PRO_PIAT  |  Attributive indefinite pronouns  |  Attributive Indefinitpronomen
17  |  GrammaticalForms  |  G_PRO_POS  |  Possessive pronouns  |  Possessivpronomen
18  |  GrammaticalForms  |  G_PRO_INT  |  Interrogative pronouns  |  Interrogativpronomen
19  |  GrammaticalForms  |  G_PRO_REL  |  Relative pronouns  |  Relativpronomen
20  |  GrammaticalForms  |  G_PRO_REFL  |  Reflexive pronouns  |  Reflexivpronomen
21  |  GrammaticalForms  |  G_PRO_REZ  |  Reciprocal pronouns  |  Reziprokpronomen
22  |  GrammaticalForms  |  G_PRO_UNPERS  |  Impersonal pronouns  |  Unpersönliche Pronomen
23  |  GrammaticalForms  |  G_PRO_ADV  |  Pronominal adverbs  |  Pronominaladverbien
24  |  GrammaticalForms  |  G_ART  |  Determiners  |  Artikel
25  |  GrammaticalForms  |  G_ART_DEF  |  Definite articles  |  Bestimmte Artikel
26  |  GrammaticalForms  |  G_ART_IND  |  Indefinite articles  |  Unbestimmte Artikel
27  |  GrammaticalForms  |  G_OTHER  |  Other parts of speech  |  Andere Wortarten
28  |  Inflection  |  IN_N_SG  |  Singular nouns  |  Substantive im Singular
29  |  Inflection  |  IN_N_PL  |  Plural nouns  |  Substantive im Plural
30  |  Inflection  |  IN_N_MS  |  Singular masculine nouns  |  Maskuline Substantive im Singular
31  |  Inflection  |  IN_N_FS  |  Singular feminine nouns  |  Feminine Substantive im Singular
32  |  Inflection  |  IN_N_NS  |  Singular neutral nouns  |  Neutrale Substantive im Singular
33  |  Inflection  |  IN_N_MP  |  Plural masculine nouns  |  Maskuline Substantive im Plural
34  |  Inflection  |  IN_N_FP  |  Feminine masculine nouns  |  Feminine Substantive im Plural
35  |  Inflection  |  IN_N_NP  |  Neutral masculine nouns  |  Neutrale Substantive im Plural
36  |  Inflection  |  IN_N_1NOM  |  Nouns in the nominative case  |  Nomen im Nominativ
37  |  Inflection  |  IN_N_2GEN  |  Nouns in the genitive case  |  Nomen im Genitiv
38  |  Inflection  |  IN_N_3DAT  |  Nouns in the dative case  |  Nomen im Dativ
39  |  Inflection  |  IN_N_4ACC  |  Nouns in the accusative case  |  Nomen im Accusativ
40  |  Inflection  |  IN_ADJ_POS  |  Adjectives in positive degree  |  Adjektive im Positiv
41  |  Inflection  |  IN_ADJ_CMP  |  Adjectives in comparative degree  |  Adjektive im Komparativ
42  |  Inflection  |  IN_ADJ_SUP  |  Adjectives in superlative degree  |  Adjektive im Superlativ
43  |  Inflection  |  IN_ADV_POS  |  Adverbs in positive degree  |  Adverben im Positiv
44  |  Inflection  |  IN_ADV_CMP  |  Adverbs in comparative degree  |  Adverben im Komparativ
45  |  Inflection  |  IN_ADV_SUP  |  Adverbs in superlative degree  |  Adverben im Superlativ
46  |  Inflection  |  IN_PRO_SG  |  Singular pronouns  |  Pronomen im Singular
47  |  Inflection  |  IN_PRO_PL  |  Plural pronouns  |  Pronomen im Plural
48  |  Inflection  |  IN_PRO_1NOM  |  Pronouns in the nominative case  |  Pronomen im Nominativ
49  |  Inflection  |  IN_PRO_2GEN  |  Pronouns in the genitive case  |  Pronomen im Genitiv
50  |  Inflection  |  IN_PRO_3DAT  |  Pronouns in the dative case  |  Pronomen im Dativ
51  |  Inflection  |  IN_PRO_4ACC  |  Pronouns in the accusative case  |  Pronomen im Accusativ
52  |  Inflection  |  IN_PRO_1SG  |  Personal pronouns in 1st person singular  |  Personalpronomen in der ersten Person Singular
53  |  Inflection  |  IN_PRO_2SG  |  Personal pronouns in 2nd person singular  |  Personalpronomen in der zweiten Person Singular
54  |  Inflection  |  IN_PRO_3SG  |  Personal pronouns in 3rd person singular  |  Personalpronomen in der dritten Person Singular
55  |  Inflection  |  IN_PRO_1PL  |  Personal pronouns in 1st person plural  |  Personalpronomen in der ersten Person Plural
56  |  Inflection  |  IN_PRO_2PL  |  Personal pronouns in 2nd person plural  |  Personalpronomen in der zweiten Person Plural
57  |  Inflection  |  IN_PRO_3PL  |  Personal pronouns in 3rd person plural  |  Personalpronomen in der dritten Person Plural
58  |  Inflection  |  IN_POSS_1SG  |  Possessive pronouns in 1st person singular  |  Possesivpronomen in der ersten Person Singular
59  |  Inflection  |  IN_POSS_2SG  |  Possessive pronouns in 2nd person singular  |  Possesivpronomen in der zweiten Person Singular
60  |  Inflection  |  IN_POSS_3SG  |  Possessive pronouns in 3rd person singular  |  Possesivpronomen in der dritten Person Singular
61  |  Inflection  |  IN_POSS_1PL  |  Possessive pronouns in 1st person plural  |  Possesivpronomen in der ersten Person Plural
62  |  Inflection  |  IN_POSS_2PL  |  Possessive pronouns in 2nd person plural  |  Possesivpronomen in der zweiten Person Plural
63  |  Inflection  |  IN_POSS_3PL  |  Possessive pronouns in 3rd person plural  |  Possesivpronomen in der dritten Person Plural
64  |  Inflection  |  IN_ART_SG  |  Singular determiners  |  Artikel im Singular
65  |  Inflection  |  IN_ART_PL  |  Plural determiners  |  Artikel im Plural
66  |  Inflection  |  IN_ART_DEF_SG  |  Singular definite articles  |  Bestimmte Artikel im Singular
67  |  Inflection  |  IN_ART_DEF_PL  |  Plural definite articles  |  Bestimmte Artikel im Plural
68  |  Inflection  |  IN_ART_IND_SG  |  Singular indefinite articles  |  Unbestimmte Artikel im Singular
69  |  Inflection  |  IN_ART_M  |  Masculine determiners  |  Maskuline Artikel
70  |  Inflection  |  IN_ART_F  |  Feminine determiners  |  Feminine Artikel
71  |  Inflection  |  IN_ART_N  |  Neutral determiners  |  Neutrale Artikel
72  |  Inflection  |  IN_ART_1NOM  |  Articles in the nominative case  |  Artikel im Nominativ
73  |  Inflection  |  IN_ART_2GEN  |  Articles in the genitive case  |  Artikel im Genitiv
74  |  Inflection  |  IN_ART_3DAT  |  Articles in the dative case  |  Artikel im Dativ
75  |  Inflection  |  IN_ART_4ACC  |  Articles in the accusative case  |  Artikel im Accusativ
76  |  Inflection  |  IN_ART_DEF_1NOM  |  Definite articles in the nominative case  |  Bestimmte Artikel im Nominativ
77  |  Inflection  |  IN_ART_DEF_2GEN  |  Definite articles in the genitive case  |  Bestimmte Artikel im Genitiv
78  |  Inflection  |  IN_ART_DEF_3DAT  |  Definite articles in the dative case  |  Bestimmte Artikel im Dativ
79  |  Inflection  |  IN_ART_DEF_4ACC  |  Definite articles in the accusative case  |  Bestimmte Artikel im Accusativ
80  |  Inflection  |  IN_ART_IND_1NOM  |  Indefinite articles in the nominative case  |  Unbestimmte Artikel im Nominativ
81  |  Inflection  |  IN_ART_IND_2GEN  |  Indefinite articles in the genitive case  |  Unbestimmte Artikel im Genitiv
82  |  Inflection  |  IN_ART_IND_3DAT  |  Indefinite articles in the nominative case  |  Unbestimmte Artikel im Nominativ
83  |  Inflection  |  IN_ART_IND_4ACC  |  Indefinite articles in the accusative case  |  Unbestimmte Artikel im Accusativ
84  |  Inflection  |  IN_V_FIN  |  Finite verbs  |  Finite Verben
85  |  Inflection  |  IN_V_AUX_FIN  |  Finite auxiliary verbs  |  Finite auxiliare Verben
86  |  Inflection  |  IN_V_AUX2_FIN  |  Finite auxiliary verbs  |  Finite Hilfsverben
87  |  Inflection  |  IN_V_AUX_MOD_FIN  |  Finite modal verbs  |  Finite Modalverben
88  |  Inflection  |  IN_V_INF  |  Infinitive verbs  |  Infinitive Verbformen
89  |  Inflection  |  IN_V_VVIZU  |  Infinitive verbs with 'zu'  |  Infinitiv mit 'zu'
90  |  Inflection  |  IN_V_PRES  |  Verbs in present tense  |  Verben im Präsens
91  |  Inflection  |  IN_V_1SG  |  First person singular verbs  |  Verben in der ersten Person Singular
92  |  Inflection  |  IN_V_2SG  |  Second person singular verbs  |  Verben in der zweiten Person Singular
93  |  Inflection  |  IN_V_3SG  |  Third person singular verbs  |  Verben in der dritten Person Singular
94  |  Inflection  |  IN_V_1PL  |  First person plural verbs  |  Verben in der ersten Person Plural
95  |  Inflection  |  IN_V_2PL  |  Second person plural verbs  |  Verben in der zweiten Person Plural
96  |  Inflection  |  IN_V_3PL  |  Third person plural verbs  |  Verben in der dritten Person Plural
97  |  Inflection  |  IN_V_PP  |  Participle Perfect Verbs  |  Partizip Perfekt Vollverben
98  |  Inflection  |  IN_V_PAST_IMP  |  Imperfect verbs  |  Imperfekt Verben
99  |  Inflection  |  IN_V_PAST_IMP2  |  Imperfect Verbs excluding Modals and Auxiliaries  |  Imperfekt Verben ohne Modale und Hilfsverben
100  |  Inflection  |  IN_V_PAST_IMP_AUX  |  Imperfect Modals and Auxiliaries  |  Imperfekt Modale und Hilfsverben
101  |  Inflection  |  IN_V_PAST_IMP_MOD  |  Participle Perfect Verbs  |  Imperfekt Modalverben
102  |  Inflection  |  IN_V_PERFEKT  |  Present perfect tense verb forms  |  Verben im Perfekt
103  |  Inflection  |  IN_V_PLUSQUAM  |  Past perfect tense verb forms  |  Verben im Plusquamperfekt
104  |  Inflection  |  IN_V_PAST  |  Verbs in past tense  |  Vergangenheitszeitformen
105  |  Inflection  |  IN_V_SUB  |  Verb forms in the subjunctive  |  Verbformen im Konjunktiv
106  |  Inflection  |  IN_V_PRES_SUB  |  Verb forms in the present subjunctive  |  Verbformen im Konjunktiv Präsens
107  |  Inflection  |  IN_V_PAST_SUB  |  Verb forms in the past subjunctive  |  Verbformen im Konjunktiv Imperfekt
108  |  Inflection  |  IN_V_PERF_SUB  |  Verb forms in the perfective subjunctive  |  Verbformen im Konjunktiv Perfekt
109  |  Inflection  |  IN_V_PAST_SUB_PLUSQ  |  Verb forms in the past subjunctive (Plusquamperfekt)  |  Verbformen im Konjunktiv Plusquamperfekt
110  |  Inflection  |  IN_V_KOND1  |  Verb forms in the conditional clause  |  Verbformen im Konditional I
111  |  Inflection  |  IN_V_KOND2  |  Verb forms in the conditional clause  |  Verbformen im Konditional II
112  |  Inflection  |  IN_V_FUT1  |  Verbs in future simple tense  |  Verbformen im Futur I
113  |  Inflection  |  IN_V_FUT2  |  Verbs in future perfect tense  |  Verbformen im Futur II
114  |  Inflection  |  IN_V_FUT  |  Verbs in future tense  |  Verbformen im Futur
115  |  Inflection  |  IN_V_PASS  |  Procesual passive verb forms  |  Verbformen im Vorgangspassiv
116  |  Inflection  |  IN_V_PASS_MOD  |  Passive voice with modal verbs  |  Passiv mit Modalverben
117  |  Syntactic  |  SY_ADJD  |  Adjectives  |  Adjektive und Adverbien im Prädikativ
118  |  Syntactic  |  SY_PTKA  |  Particles with adjective or adverb  |  Partikeln mit Adjektiv oder Adverb
119  |  Syntactic  |  SY_APPR  |  Adpositions, left of the noun  |  Präpositionen, links des Nomens
120  |  Syntactic  |  SY_APPRART  |  Adpositions with fused articles  |  Präpositionen mit verschmolzenen Artikeln
121  |  Syntactic  |  SY_APPO  |  Postpositions  |  Postpositionen
122  |  Syntactic  |  SY_DO  |  Dative object  |  Dativobjekt
123  |  Syntactic  |  SY_OA  |  Accusative object  |  Akkusativobjekt
124  |  Syntactic  |  SY_S_DE  |  Words in declarative sentences  |  Wörter in Aussagesätzen
125  |  Syntactic  |  SY_S_EX  |  Words in exclamatory sentences  |  Wörter in Ausrufesätzen
126  |  Syntactic  |  SY_S_INT  |  Words in interrogative sentences  |  Wörter in Fragesätzen
127  |  Syntactic  |  SY_S_NEG  |  Words in negative sentences  |  Wörter in negativen Sätzen
128  |  Syntactic  |  SY_S_INF  |  Words in infinitive-only sentences without finite verbs  |  Wörter in Infinitivsätzen ohne finiten Verben
129  |  Syntactic  |  SY_S_MAN  |  Words in sentences with the pronoun 'man'  |  Wörter in Man-Aussagen
130  |  Syntactic  |  SY_S_SUB  |  Words in subordinate sentences (excluding 'zu')  |  Wörter in subordinierten Sätzen (ohne 'zu')
131  |  Syntactic  |  SY_S_SUB_ZU  |  Words in subordinate sentences with 'zu'  |  Wörter in subordinierten Sätzen mit 'zu'
132  |  Syntactic  |  SY_S_KOKOM  |  Words in sentences with a comparative conjunction  |  Wörter in Sätzen mit einer komparativen Konjunktion
133  |  Syntactic  |  SY_S_COND1  |  Words in factual conditional sentences  |  Wörter in realen Konditionalsätzen
134  |  Syntactic  |  SY_S_COND2  |  Words in potential conditional sentences  |  Wörter in potentialen Konditionalsätzen
135  |  Syntactic  |  SY_S_COND3  |  Words in unreal conditional sentences  |  Wörter in irrealen Konditionalsätzen
136  |  Syntactic  |  SY_QUOT  |  Words in quotation marks  |  Wörter in Anführungszeichen
137  |  Punctuation  |  PUNCT_TOTAL  |  Total punctuation  |  Interpunktion
138  |  Punctuation  |  PUNCT_BI_NOUN  |  Punctuation following a noun  |  Interpunktion nach einem Nomen
139  |  Punctuation  |  PUNCT_BI_VERB  |  Punctuation following a verb  |  Interpunktion nach einem Verb
140  |  Lexical  |  L_NAME  |  Proper names  |  Eigennamen
141  |  Lexical  |  L_NAME_M  |  Masculine proper nouns  |  Männliche Eigennamen
142  |  Lexical  |  L_NAME_F  |  Feminine proper nouns  |  Weibliche Eigennamen
143  |  Lexical  |  L_GEOG  |  Place and geographical names  |  Nazwy miejsc i nazwy geograficzne
144  |  Lexical  |  L_ORG  |  Organization names  |  Organizationsnamen
145  |  Lexical  |  L_SYL_G1  |  One-syllable words  |  Ein-Silben-Wörter
146  |  Lexical  |  L_SYL_G2  |  Two-syllables words  |  Zwei-Silben-Wörter
147  |  Lexical  |  L_SYL_G3  |  Three-syllables words  |  Drei-Silben-Wörter
148  |  Lexical  |  L_SYL_G4  |  Four-syllables words  |  Vier-Silben-Wörter
149  |  Lexical  |  L_SYL_G5  |  Words formed of 5 or more syllables  |  Wörter mit 5 oder mehr Silben
150  |  Lexical  |  L_TTR_IA  |  Type-token ratio for non-lemmatized tokens  |  Type-token ratio für nicht-lemmatisierte Tokens
151  |  Lexical  |  L_TTR_LA  |  Type-token ratio for lemmatized tokens  |  Type-token ratio für lemmatisierte Tokens
152  |  Lexical  |  L_CONT_A  |  Incidence of content words  |  Inhaltswörter
153  |  Lexical  |  L_CONT_T  |  Content words types  |  Typen von Inhaltswörtern
154  |  Lexical  |  L_CONT_L  |  Content words lemma types  |  Lemmatypen von Inhaltswörtern
155  |  Lexical  |  L_FUNC_A  |  Incidence of function words  |  Funktionswörter
156  |  Lexical  |  L_FUNC_T  |  Function words types  |  Typen von Funktionswörtern
157  |  Lexical  |  L_FUNC_L  |  Function words lemma types  |  Lemmatypen von Funktionswörtern
158  |  Lexical  |  L_STOP  |  Incidence of stop words  |  Stoppwörter
159  |  Lexical  |  L_TCCT1  |  Tokens covering 1% of most common types  |  Tokens, die 1 % der häufigsten Typen abdecken
160  |  Lexical  |  L_TCCT5  |  Tokens covering 5% of most common types  |  Tokens, die 5% der häufigsten Typen abdecken
161  |  Descriptive  |  DESC_PRON_VOC  |  Direct address phrases with a pronoun  |  Direkte Anredephrasen mit einem Pronomen
162  |  Graphical  |  GR_UPPER  |  Capital letters  |  Großbuchstaben
163  |  Graphical  |  GR_EMOT  |  Emoticons  |  Emoticons
164  |  Graphical  |  GR_LENNY  |  Lenny faces  |  Lenny faces
165  |  Graphical  |  GR_MENTION  |  Direct mentions with @  |  Direkte Erwähnungen mit @
166  |  Graphical  |  GR_HASH  |  Hashtags  |  Hashtags
167  |  Graphical  |  GR_LINK  |  Hyperlinks  |  Hyperlinks

Above are the following (from left):

  • order number - metrics is a MetricGroup object, from which we can select individual metrics, or snippets, e.g. metrics[0], or metrics[10:20].

  • category - each metric is assigned to a subject category.

  • metrics code - this is a unique string for each metric displayed in the DataFrame.

  • name - the extended name of the metric.

Metrics can also be accessed in other ways, eg. we can choose all metrics from given category:

# check available categories
categories = sm.get_all_categories('de')
print(categories)
[GrammaticalForms, Inflection, Syntactic, Punctuation, Lexical, Descriptive, Graphical]
# choose category
category = categories[2]

# preview what metrics are available within this category
# this is the same DataFrame object as before and you can perform the same operations on it
category_metrics = category.get_metrics()
print(category_metrics)
0  |  Syntactic  |  SY_ADJD  |  Adjectives  |  Adjektive und Adverbien im Prädikativ
1  |  Syntactic  |  SY_PTKA  |  Particles with adjective or adverb  |  Partikeln mit Adjektiv oder Adverb
2  |  Syntactic  |  SY_APPR  |  Adpositions, left of the noun  |  Präpositionen, links des Nomens
3  |  Syntactic  |  SY_APPRART  |  Adpositions with fused articles  |  Präpositionen mit verschmolzenen Artikeln
4  |  Syntactic  |  SY_APPO  |  Postpositions  |  Postpositionen
5  |  Syntactic  |  SY_DO  |  Dative object  |  Dativobjekt
6  |  Syntactic  |  SY_OA  |  Accusative object  |  Akkusativobjekt
7  |  Syntactic  |  SY_S_DE  |  Words in declarative sentences  |  Wörter in Aussagesätzen
8  |  Syntactic  |  SY_S_EX  |  Words in exclamatory sentences  |  Wörter in Ausrufesätzen
9  |  Syntactic  |  SY_S_INT  |  Words in interrogative sentences  |  Wörter in Fragesätzen
10  |  Syntactic  |  SY_S_NEG  |  Words in negative sentences  |  Wörter in negativen Sätzen
11  |  Syntactic  |  SY_S_INF  |  Words in infinitive-only sentences without finite verbs  |  Wörter in Infinitivsätzen ohne finiten Verben
12  |  Syntactic  |  SY_S_MAN  |  Words in sentences with the pronoun 'man'  |  Wörter in Man-Aussagen
13  |  Syntactic  |  SY_S_SUB  |  Words in subordinate sentences (excluding 'zu')  |  Wörter in subordinierten Sätzen (ohne 'zu')
14  |  Syntactic  |  SY_S_SUB_ZU  |  Words in subordinate sentences with 'zu'  |  Wörter in subordinierten Sätzen mit 'zu'
15  |  Syntactic  |  SY_S_KOKOM  |  Words in sentences with a comparative conjunction  |  Wörter in Sätzen mit einer komparativen Konjunktion
16  |  Syntactic  |  SY_S_COND1  |  Words in factual conditional sentences  |  Wörter in realen Konditionalsätzen
17  |  Syntactic  |  SY_S_COND2  |  Words in potential conditional sentences  |  Wörter in potentialen Konditionalsätzen
18  |  Syntactic  |  SY_S_COND3  |  Words in unreal conditional sentences  |  Wörter in irrealen Konditionalsätzen
19  |  Syntactic  |  SY_QUOT  |  Words in quotation marks  |  Wörter in Anführungszeichen
# example subset of metrics for analysis
metrics_to_analyse = metrics[60:100]
print(metrics_to_analyse)
0  |  Inflection  |  IN_POSS_3SG  |  Possessive pronouns in 3rd person singular  |  Possesivpronomen in der dritten Person Singular
1  |  Inflection  |  IN_POSS_1PL  |  Possessive pronouns in 1st person plural  |  Possesivpronomen in der ersten Person Plural
2  |  Inflection  |  IN_POSS_2PL  |  Possessive pronouns in 2nd person plural  |  Possesivpronomen in der zweiten Person Plural
3  |  Inflection  |  IN_POSS_3PL  |  Possessive pronouns in 3rd person plural  |  Possesivpronomen in der dritten Person Plural
4  |  Inflection  |  IN_ART_SG  |  Singular determiners  |  Artikel im Singular
5  |  Inflection  |  IN_ART_PL  |  Plural determiners  |  Artikel im Plural
6  |  Inflection  |  IN_ART_DEF_SG  |  Singular definite articles  |  Bestimmte Artikel im Singular
7  |  Inflection  |  IN_ART_DEF_PL  |  Plural definite articles  |  Bestimmte Artikel im Plural
8  |  Inflection  |  IN_ART_IND_SG  |  Singular indefinite articles  |  Unbestimmte Artikel im Singular
9  |  Inflection  |  IN_ART_M  |  Masculine determiners  |  Maskuline Artikel
10  |  Inflection  |  IN_ART_F  |  Feminine determiners  |  Feminine Artikel
11  |  Inflection  |  IN_ART_N  |  Neutral determiners  |  Neutrale Artikel
12  |  Inflection  |  IN_ART_1NOM  |  Articles in the nominative case  |  Artikel im Nominativ
13  |  Inflection  |  IN_ART_2GEN  |  Articles in the genitive case  |  Artikel im Genitiv
14  |  Inflection  |  IN_ART_3DAT  |  Articles in the dative case  |  Artikel im Dativ
15  |  Inflection  |  IN_ART_4ACC  |  Articles in the accusative case  |  Artikel im Accusativ
16  |  Inflection  |  IN_ART_DEF_1NOM  |  Definite articles in the nominative case  |  Bestimmte Artikel im Nominativ
17  |  Inflection  |  IN_ART_DEF_2GEN  |  Definite articles in the genitive case  |  Bestimmte Artikel im Genitiv
18  |  Inflection  |  IN_ART_DEF_3DAT  |  Definite articles in the dative case  |  Bestimmte Artikel im Dativ
19  |  Inflection  |  IN_ART_DEF_4ACC  |  Definite articles in the accusative case  |  Bestimmte Artikel im Accusativ
20  |  Inflection  |  IN_ART_IND_1NOM  |  Indefinite articles in the nominative case  |  Unbestimmte Artikel im Nominativ
21  |  Inflection  |  IN_ART_IND_2GEN  |  Indefinite articles in the genitive case  |  Unbestimmte Artikel im Genitiv
22  |  Inflection  |  IN_ART_IND_3DAT  |  Indefinite articles in the nominative case  |  Unbestimmte Artikel im Nominativ
23  |  Inflection  |  IN_ART_IND_4ACC  |  Indefinite articles in the accusative case  |  Unbestimmte Artikel im Accusativ
24  |  Inflection  |  IN_V_FIN  |  Finite verbs  |  Finite Verben
25  |  Inflection  |  IN_V_AUX_FIN  |  Finite auxiliary verbs  |  Finite auxiliare Verben
26  |  Inflection  |  IN_V_AUX2_FIN  |  Finite auxiliary verbs  |  Finite Hilfsverben
27  |  Inflection  |  IN_V_AUX_MOD_FIN  |  Finite modal verbs  |  Finite Modalverben
28  |  Inflection  |  IN_V_INF  |  Infinitive verbs  |  Infinitive Verbformen
29  |  Inflection  |  IN_V_VVIZU  |  Infinitive verbs with 'zu'  |  Infinitiv mit 'zu'
30  |  Inflection  |  IN_V_PRES  |  Verbs in present tense  |  Verben im Präsens
31  |  Inflection  |  IN_V_1SG  |  First person singular verbs  |  Verben in der ersten Person Singular
32  |  Inflection  |  IN_V_2SG  |  Second person singular verbs  |  Verben in der zweiten Person Singular
33  |  Inflection  |  IN_V_3SG  |  Third person singular verbs  |  Verben in der dritten Person Singular
34  |  Inflection  |  IN_V_1PL  |  First person plural verbs  |  Verben in der ersten Person Plural
35  |  Inflection  |  IN_V_2PL  |  Second person plural verbs  |  Verben in der zweiten Person Plural
36  |  Inflection  |  IN_V_3PL  |  Third person plural verbs  |  Verben in der dritten Person Plural
37  |  Inflection  |  IN_V_PP  |  Participle Perfect Verbs  |  Partizip Perfekt Vollverben
38  |  Inflection  |  IN_V_PAST_IMP  |  Imperfect verbs  |  Imperfekt Verben
39  |  Inflection  |  IN_V_PAST_IMP2  |  Imperfect Verbs excluding Modals and Auxiliaries  |  Imperfekt Verben ohne Modale und Hilfsverben

Metrics / categories to use can be chosen directly as MetricGroup or defined as list of strings containing name of Category to include / exclude whole category of names of metrics to include/exclude them.

# choose metrics_to_analyse excluding syntatic using MetricGroup
stylo = sm.StyloMetrix('de', metrics=metrics_to_analyse, exceptions=category_metrics)
metrics = stylo.transform(texts)
metrics
100%|████████████████████████████████████████████████████████████████████████████████████████████| 4/4 [00:00<00:00, 73.24it/s]
text IN_POSS_3SG IN_POSS_1PL IN_POSS_2PL IN_POSS_3PL IN_ART_SG IN_ART_PL IN_ART_DEF_SG IN_ART_DEF_PL IN_ART_IND_SG ... IN_V_PRES IN_V_1SG IN_V_2SG IN_V_3SG IN_V_1PL IN_V_2PL IN_V_3PL IN_V_PP IN_V_PAST_IMP IN_V_PAST_IMP2
0 Im Rahmen von Forschungs und Lehre müssen Leis... 0.0 0.000000 0.0 0.0 0.105882 0.000000 0.047059 0.000000 0.058824 ... 0.070588 0.0 0.0 0.070588 0.0 0.0 0.011765 0.023529 0.0 0.0
1 Unser Prüfungsrecht verlangt, dass eine Leistu... 0.0 0.018519 0.0 0.0 0.055556 0.018519 0.037037 0.018519 0.018519 ... 0.055556 0.0 0.0 0.055556 0.0 0.0 0.000000 0.074074 0.0 0.0
2 In der Forschung und Lehre muss man wissen, we... 0.0 0.000000 0.0 0.0 0.111111 0.000000 0.069444 0.000000 0.041667 ... 0.097222 0.0 0.0 0.111111 0.0 0.0 0.000000 0.069444 0.0 0.0
3 Das Prüfungsrecht verlangt, dass man genau erk... 0.0 0.000000 0.0 0.0 0.111111 0.000000 0.095238 0.000000 0.015873 ... 0.095238 0.0 0.0 0.079365 0.0 0.0 0.015873 0.047619 0.0 0.0

4 rows × 41 columns

# choose metrics_to_analyse excluding syntatic using list of strings
# remember to provide names as list even if you are defining only one name to include/exclude
stylo = sm.StyloMetrix('de', metrics=["Syntactic", "VerbTenses"], 
                       exceptions=['VT_SHOULD_PROGRESSIVE'])
metrics = stylo.transform(texts)
metrics
100%|████████████████████████████████████████████████████████████████████████████████████████████| 4/4 [00:00<00:00, 76.76it/s]
text SY_ADJD SY_PTKA SY_APPR SY_APPRART SY_APPO SY_DO SY_OA SY_S_DE SY_S_EX ... SY_S_NEG SY_S_INF SY_S_MAN SY_S_SUB SY_S_SUB_ZU SY_S_KOKOM SY_S_COND1 SY_S_COND2 SY_S_COND3 SY_QUOT
0 Im Rahmen von Forschungs und Lehre müssen Leis... 0.047059 0.0 0.058824 0.023529 0.0 0.0 0.023529 0.976471 0.0 ... 0.0 0.000000 0.000000 0.694118 0.0 0.0 0.0 0.0 0.0 0.0
1 Unser Prüfungsrecht verlangt, dass eine Leistu... 0.074074 0.0 0.037037 0.000000 0.0 0.0 0.018519 1.000000 0.0 ... 0.0 0.240741 0.518519 0.481481 0.0 0.0 0.0 0.0 0.0 0.0
2 In der Forschung und Lehre muss man wissen, we... 0.013889 0.0 0.055556 0.000000 0.0 0.0 0.041667 1.000000 0.0 ... 0.0 0.000000 0.875000 0.666667 0.0 0.0 0.0 0.0 0.0 0.0
3 Das Prüfungsrecht verlangt, dass man genau erk... 0.031746 0.0 0.063492 0.000000 0.0 0.0 0.015873 1.000000 0.0 ... 0.0 0.000000 0.793651 0.476190 0.0 0.0 0.0 0.0 0.0 0.0

4 rows × 21 columns

  • You can calculate the value for a single metric (provided as a list)

  • or you can provide list of any available metrics (also provided as string), as well as groups of metrics

stylo = sm.StyloMetrix('de', metrics=[metrics_to_analyse[1]])
metrics = stylo.transform(texts)
metrics
100%|████████████████████████████████████████████████████████████████████████████████████████████| 4/4 [00:00<00:00, 36.90it/s]
text IN_POSS_1PL
0 Im Rahmen von Forschungs und Lehre müssen Leis... 0.000000
1 Unser Prüfungsrecht verlangt, dass eine Leistu... 0.018519
2 In der Forschung und Lehre muss man wissen, we... 0.000000
3 Das Prüfungsrecht verlangt, dass man genau erk... 0.000000
stylo = sm.StyloMetrix('de', metrics=['SY_SIMILE'])
metrics = stylo.transform(texts)
metrics
100%|████████████████████████████████████████████████████████████████████████████████████████████| 4/4 [00:00<00:00, 90.50it/s]
text
0 Im Rahmen von Forschungs und Lehre müssen Leis...
1 Unser Prüfungsrecht verlangt, dass eine Leistu...
2 In der Forschung und Lehre muss man wissen, we...
3 Das Prüfungsrecht verlangt, dass man genau erk...

Groups of metrics can be added (concatenated) as well as subtracted (remove some groups)

metrics = sm.get_all_metrics('de')
group1 = metrics[20:30]
group2 = metrics[50:70]
group3 = metrics[25:55]
final_group = group1 + group2 - group3
print(final_group)
0  |  GrammaticalForms  |  G_PRO_REFL  |  Reflexive pronouns  |  Reflexivpronomen
1  |  GrammaticalForms  |  G_PRO_REZ  |  Reciprocal pronouns  |  Reziprokpronomen
2  |  GrammaticalForms  |  G_PRO_UNPERS  |  Impersonal pronouns  |  Unpersönliche Pronomen
3  |  GrammaticalForms  |  G_PRO_ADV  |  Pronominal adverbs  |  Pronominaladverbien
4  |  GrammaticalForms  |  G_ART  |  Determiners  |  Artikel
5  |  Inflection  |  IN_PRO_1PL  |  Personal pronouns in 1st person plural  |  Personalpronomen in der ersten Person Plural
6  |  Inflection  |  IN_PRO_2PL  |  Personal pronouns in 2nd person plural  |  Personalpronomen in der zweiten Person Plural
7  |  Inflection  |  IN_PRO_3PL  |  Personal pronouns in 3rd person plural  |  Personalpronomen in der dritten Person Plural
8  |  Inflection  |  IN_POSS_1SG  |  Possessive pronouns in 1st person singular  |  Possesivpronomen in der ersten Person Singular
9  |  Inflection  |  IN_POSS_2SG  |  Possessive pronouns in 2nd person singular  |  Possesivpronomen in der zweiten Person Singular
10  |  Inflection  |  IN_POSS_3SG  |  Possessive pronouns in 3rd person singular  |  Possesivpronomen in der dritten Person Singular
11  |  Inflection  |  IN_POSS_1PL  |  Possessive pronouns in 1st person plural  |  Possesivpronomen in der ersten Person Plural
12  |  Inflection  |  IN_POSS_2PL  |  Possessive pronouns in 2nd person plural  |  Possesivpronomen in der zweiten Person Plural
13  |  Inflection  |  IN_POSS_3PL  |  Possessive pronouns in 3rd person plural  |  Possesivpronomen in der dritten Person Plural
14  |  Inflection  |  IN_ART_SG  |  Singular determiners  |  Artikel im Singular
15  |  Inflection  |  IN_ART_PL  |  Plural determiners  |  Artikel im Plural
16  |  Inflection  |  IN_ART_DEF_SG  |  Singular definite articles  |  Bestimmte Artikel im Singular
17  |  Inflection  |  IN_ART_DEF_PL  |  Plural definite articles  |  Bestimmte Artikel im Plural
18  |  Inflection  |  IN_ART_IND_SG  |  Singular indefinite articles  |  Unbestimmte Artikel im Singular
19  |  Inflection  |  IN_ART_M  |  Masculine determiners  |  Maskuline Artikel

4. Creating new metrics#

you can create new metric using the inheritance mechanism of the Metric class. Note that the category, and name_en and name_local fields are required for proper operation. The category object must then be retrieved from the list available via sm.get_all_categories() or a new one must be created. The counting method itself is implemented in the count(doc) method.

categories = sm.get_all_categories('de')
category = categories[5]

class SAMPL2(sm.Metric):
    category = category
    name_en = "abc"
    name_local = "abc"
    
    def count(doc):
        result = 0.1
        debug = [doc[2], doc[3], doc[4]]
        return result, debug
# create a new category - indicate the language to which the category belongs.
# (same as with get_all_metrics(), etc.).

class C1(sm.Category):
    lang = 'de'        # define language
    name_en = "C1"     # name in enslish
    name_local = "C1"  # local name

    
class SAMPL3(sm.Metric):
    category = C1
    name_en = "abc"
    name_local = "abc"
    
    def count(doc):
        result = 0.99
        debug = [doc[9], doc[0], doc[1]]
        return result, debug

The created metrics are automatically saved after calling the code in which they are defined. So after calling the above cells we are already able to use them. This can be seen by looking at all available metrics.

print(sm.get_all_metrics('de'))
0  |  GrammaticalForms  |  G_N  |  Nouns  |  Substantive
1  |  GrammaticalForms  |  G_ADJ  |  Adjectives  |  Adjektive
2  |  GrammaticalForms  |  G_ADV  |  Adverbs  |  Adverbien
3  |  GrammaticalForms  |  G_V  |  Verbs  |  Verben
4  |  GrammaticalForms  |  G_VMOD  |  Modal verbs  |  Modalverben
5  |  GrammaticalForms  |  G_NUM  |  Numerals  |  Numerale
6  |  GrammaticalForms  |  G_PART  |  Particles  |  Partikeln
7  |  GrammaticalForms  |  G_ADP  |  Adpositions  |  Präpositionen
8  |  GrammaticalForms  |  G_CONJ  |  Conjunctions  |  Konjunktionen
9  |  GrammaticalForms  |  G_CCONJ  |  Coordinating conjunctions  |  Koordinierende Konjunktionen
10  |  GrammaticalForms  |  G_SCONJ  |  Conjunctions  |  Subordinierende Konjunktionen
11  |  GrammaticalForms  |  G_PRO  |  Pronouns  |  Pronomen
12  |  GrammaticalForms  |  G_PRO_PRS  |  Personal pronouns  |  Personalpronomen
13  |  GrammaticalForms  |  G_PRO_DEM  |  Demonstrative pronouns  |  Demonstrativpronomen
14  |  GrammaticalForms  |  G_PRO_IND  |  Indefinite pronouns  |  Indefinitpronomen
15  |  GrammaticalForms  |  G_PRO_PIS  |  Substitutive indefinite pronouns  |  Ersetzende Indefinitpronomen
16  |  GrammaticalForms  |  G_PRO_PIAT  |  Attributive indefinite pronouns  |  Attributive Indefinitpronomen
17  |  GrammaticalForms  |  G_PRO_POS  |  Possessive pronouns  |  Possessivpronomen
18  |  GrammaticalForms  |  G_PRO_INT  |  Interrogative pronouns  |  Interrogativpronomen
19  |  GrammaticalForms  |  G_PRO_REL  |  Relative pronouns  |  Relativpronomen
20  |  GrammaticalForms  |  G_PRO_REFL  |  Reflexive pronouns  |  Reflexivpronomen
21  |  GrammaticalForms  |  G_PRO_REZ  |  Reciprocal pronouns  |  Reziprokpronomen
22  |  GrammaticalForms  |  G_PRO_UNPERS  |  Impersonal pronouns  |  Unpersönliche Pronomen
23  |  GrammaticalForms  |  G_PRO_ADV  |  Pronominal adverbs  |  Pronominaladverbien
24  |  GrammaticalForms  |  G_ART  |  Determiners  |  Artikel
25  |  GrammaticalForms  |  G_ART_DEF  |  Definite articles  |  Bestimmte Artikel
26  |  GrammaticalForms  |  G_ART_IND  |  Indefinite articles  |  Unbestimmte Artikel
27  |  GrammaticalForms  |  G_OTHER  |  Other parts of speech  |  Andere Wortarten
28  |  Inflection  |  IN_N_SG  |  Singular nouns  |  Substantive im Singular
29  |  Inflection  |  IN_N_PL  |  Plural nouns  |  Substantive im Plural
30  |  Inflection  |  IN_N_MS  |  Singular masculine nouns  |  Maskuline Substantive im Singular
31  |  Inflection  |  IN_N_FS  |  Singular feminine nouns  |  Feminine Substantive im Singular
32  |  Inflection  |  IN_N_NS  |  Singular neutral nouns  |  Neutrale Substantive im Singular
33  |  Inflection  |  IN_N_MP  |  Plural masculine nouns  |  Maskuline Substantive im Plural
34  |  Inflection  |  IN_N_FP  |  Feminine masculine nouns  |  Feminine Substantive im Plural
35  |  Inflection  |  IN_N_NP  |  Neutral masculine nouns  |  Neutrale Substantive im Plural
36  |  Inflection  |  IN_N_1NOM  |  Nouns in the nominative case  |  Nomen im Nominativ
37  |  Inflection  |  IN_N_2GEN  |  Nouns in the genitive case  |  Nomen im Genitiv
38  |  Inflection  |  IN_N_3DAT  |  Nouns in the dative case  |  Nomen im Dativ
39  |  Inflection  |  IN_N_4ACC  |  Nouns in the accusative case  |  Nomen im Accusativ
40  |  Inflection  |  IN_ADJ_POS  |  Adjectives in positive degree  |  Adjektive im Positiv
41  |  Inflection  |  IN_ADJ_CMP  |  Adjectives in comparative degree  |  Adjektive im Komparativ
42  |  Inflection  |  IN_ADJ_SUP  |  Adjectives in superlative degree  |  Adjektive im Superlativ
43  |  Inflection  |  IN_ADV_POS  |  Adverbs in positive degree  |  Adverben im Positiv
44  |  Inflection  |  IN_ADV_CMP  |  Adverbs in comparative degree  |  Adverben im Komparativ
45  |  Inflection  |  IN_ADV_SUP  |  Adverbs in superlative degree  |  Adverben im Superlativ
46  |  Inflection  |  IN_PRO_SG  |  Singular pronouns  |  Pronomen im Singular
47  |  Inflection  |  IN_PRO_PL  |  Plural pronouns  |  Pronomen im Plural
48  |  Inflection  |  IN_PRO_1NOM  |  Pronouns in the nominative case  |  Pronomen im Nominativ
49  |  Inflection  |  IN_PRO_2GEN  |  Pronouns in the genitive case  |  Pronomen im Genitiv
50  |  Inflection  |  IN_PRO_3DAT  |  Pronouns in the dative case  |  Pronomen im Dativ
51  |  Inflection  |  IN_PRO_4ACC  |  Pronouns in the accusative case  |  Pronomen im Accusativ
52  |  Inflection  |  IN_PRO_1SG  |  Personal pronouns in 1st person singular  |  Personalpronomen in der ersten Person Singular
53  |  Inflection  |  IN_PRO_2SG  |  Personal pronouns in 2nd person singular  |  Personalpronomen in der zweiten Person Singular
54  |  Inflection  |  IN_PRO_3SG  |  Personal pronouns in 3rd person singular  |  Personalpronomen in der dritten Person Singular
55  |  Inflection  |  IN_PRO_1PL  |  Personal pronouns in 1st person plural  |  Personalpronomen in der ersten Person Plural
56  |  Inflection  |  IN_PRO_2PL  |  Personal pronouns in 2nd person plural  |  Personalpronomen in der zweiten Person Plural
57  |  Inflection  |  IN_PRO_3PL  |  Personal pronouns in 3rd person plural  |  Personalpronomen in der dritten Person Plural
58  |  Inflection  |  IN_POSS_1SG  |  Possessive pronouns in 1st person singular  |  Possesivpronomen in der ersten Person Singular
59  |  Inflection  |  IN_POSS_2SG  |  Possessive pronouns in 2nd person singular  |  Possesivpronomen in der zweiten Person Singular
60  |  Inflection  |  IN_POSS_3SG  |  Possessive pronouns in 3rd person singular  |  Possesivpronomen in der dritten Person Singular
61  |  Inflection  |  IN_POSS_1PL  |  Possessive pronouns in 1st person plural  |  Possesivpronomen in der ersten Person Plural
62  |  Inflection  |  IN_POSS_2PL  |  Possessive pronouns in 2nd person plural  |  Possesivpronomen in der zweiten Person Plural
63  |  Inflection  |  IN_POSS_3PL  |  Possessive pronouns in 3rd person plural  |  Possesivpronomen in der dritten Person Plural
64  |  Inflection  |  IN_ART_SG  |  Singular determiners  |  Artikel im Singular
65  |  Inflection  |  IN_ART_PL  |  Plural determiners  |  Artikel im Plural
66  |  Inflection  |  IN_ART_DEF_SG  |  Singular definite articles  |  Bestimmte Artikel im Singular
67  |  Inflection  |  IN_ART_DEF_PL  |  Plural definite articles  |  Bestimmte Artikel im Plural
68  |  Inflection  |  IN_ART_IND_SG  |  Singular indefinite articles  |  Unbestimmte Artikel im Singular
69  |  Inflection  |  IN_ART_M  |  Masculine determiners  |  Maskuline Artikel
70  |  Inflection  |  IN_ART_F  |  Feminine determiners  |  Feminine Artikel
71  |  Inflection  |  IN_ART_N  |  Neutral determiners  |  Neutrale Artikel
72  |  Inflection  |  IN_ART_1NOM  |  Articles in the nominative case  |  Artikel im Nominativ
73  |  Inflection  |  IN_ART_2GEN  |  Articles in the genitive case  |  Artikel im Genitiv
74  |  Inflection  |  IN_ART_3DAT  |  Articles in the dative case  |  Artikel im Dativ
75  |  Inflection  |  IN_ART_4ACC  |  Articles in the accusative case  |  Artikel im Accusativ
76  |  Inflection  |  IN_ART_DEF_1NOM  |  Definite articles in the nominative case  |  Bestimmte Artikel im Nominativ
77  |  Inflection  |  IN_ART_DEF_2GEN  |  Definite articles in the genitive case  |  Bestimmte Artikel im Genitiv
78  |  Inflection  |  IN_ART_DEF_3DAT  |  Definite articles in the dative case  |  Bestimmte Artikel im Dativ
79  |  Inflection  |  IN_ART_DEF_4ACC  |  Definite articles in the accusative case  |  Bestimmte Artikel im Accusativ
80  |  Inflection  |  IN_ART_IND_1NOM  |  Indefinite articles in the nominative case  |  Unbestimmte Artikel im Nominativ
81  |  Inflection  |  IN_ART_IND_2GEN  |  Indefinite articles in the genitive case  |  Unbestimmte Artikel im Genitiv
82  |  Inflection  |  IN_ART_IND_3DAT  |  Indefinite articles in the nominative case  |  Unbestimmte Artikel im Nominativ
83  |  Inflection  |  IN_ART_IND_4ACC  |  Indefinite articles in the accusative case  |  Unbestimmte Artikel im Accusativ
84  |  Inflection  |  IN_V_FIN  |  Finite verbs  |  Finite Verben
85  |  Inflection  |  IN_V_AUX_FIN  |  Finite auxiliary verbs  |  Finite auxiliare Verben
86  |  Inflection  |  IN_V_AUX2_FIN  |  Finite auxiliary verbs  |  Finite Hilfsverben
87  |  Inflection  |  IN_V_AUX_MOD_FIN  |  Finite modal verbs  |  Finite Modalverben
88  |  Inflection  |  IN_V_INF  |  Infinitive verbs  |  Infinitive Verbformen
89  |  Inflection  |  IN_V_VVIZU  |  Infinitive verbs with 'zu'  |  Infinitiv mit 'zu'
90  |  Inflection  |  IN_V_PRES  |  Verbs in present tense  |  Verben im Präsens
91  |  Inflection  |  IN_V_1SG  |  First person singular verbs  |  Verben in der ersten Person Singular
92  |  Inflection  |  IN_V_2SG  |  Second person singular verbs  |  Verben in der zweiten Person Singular
93  |  Inflection  |  IN_V_3SG  |  Third person singular verbs  |  Verben in der dritten Person Singular
94  |  Inflection  |  IN_V_1PL  |  First person plural verbs  |  Verben in der ersten Person Plural
95  |  Inflection  |  IN_V_2PL  |  Second person plural verbs  |  Verben in der zweiten Person Plural
96  |  Inflection  |  IN_V_3PL  |  Third person plural verbs  |  Verben in der dritten Person Plural
97  |  Inflection  |  IN_V_PP  |  Participle Perfect Verbs  |  Partizip Perfekt Vollverben
98  |  Inflection  |  IN_V_PAST_IMP  |  Imperfect verbs  |  Imperfekt Verben
99  |  Inflection  |  IN_V_PAST_IMP2  |  Imperfect Verbs excluding Modals and Auxiliaries  |  Imperfekt Verben ohne Modale und Hilfsverben
100  |  Inflection  |  IN_V_PAST_IMP_AUX  |  Imperfect Modals and Auxiliaries  |  Imperfekt Modale und Hilfsverben
101  |  Inflection  |  IN_V_PAST_IMP_MOD  |  Participle Perfect Verbs  |  Imperfekt Modalverben
102  |  Inflection  |  IN_V_PERFEKT  |  Present perfect tense verb forms  |  Verben im Perfekt
103  |  Inflection  |  IN_V_PLUSQUAM  |  Past perfect tense verb forms  |  Verben im Plusquamperfekt
104  |  Inflection  |  IN_V_PAST  |  Verbs in past tense  |  Vergangenheitszeitformen
105  |  Inflection  |  IN_V_SUB  |  Verb forms in the subjunctive  |  Verbformen im Konjunktiv
106  |  Inflection  |  IN_V_PRES_SUB  |  Verb forms in the present subjunctive  |  Verbformen im Konjunktiv Präsens
107  |  Inflection  |  IN_V_PAST_SUB  |  Verb forms in the past subjunctive  |  Verbformen im Konjunktiv Imperfekt
108  |  Inflection  |  IN_V_PERF_SUB  |  Verb forms in the perfective subjunctive  |  Verbformen im Konjunktiv Perfekt
109  |  Inflection  |  IN_V_PAST_SUB_PLUSQ  |  Verb forms in the past subjunctive (Plusquamperfekt)  |  Verbformen im Konjunktiv Plusquamperfekt
110  |  Inflection  |  IN_V_KOND1  |  Verb forms in the conditional clause  |  Verbformen im Konditional I
111  |  Inflection  |  IN_V_KOND2  |  Verb forms in the conditional clause  |  Verbformen im Konditional II
112  |  Inflection  |  IN_V_FUT1  |  Verbs in future simple tense  |  Verbformen im Futur I
113  |  Inflection  |  IN_V_FUT2  |  Verbs in future perfect tense  |  Verbformen im Futur II
114  |  Inflection  |  IN_V_FUT  |  Verbs in future tense  |  Verbformen im Futur
115  |  Inflection  |  IN_V_PASS  |  Procesual passive verb forms  |  Verbformen im Vorgangspassiv
116  |  Inflection  |  IN_V_PASS_MOD  |  Passive voice with modal verbs  |  Passiv mit Modalverben
117  |  Syntactic  |  SY_ADJD  |  Adjectives  |  Adjektive und Adverbien im Prädikativ
118  |  Syntactic  |  SY_PTKA  |  Particles with adjective or adverb  |  Partikeln mit Adjektiv oder Adverb
119  |  Syntactic  |  SY_APPR  |  Adpositions, left of the noun  |  Präpositionen, links des Nomens
120  |  Syntactic  |  SY_APPRART  |  Adpositions with fused articles  |  Präpositionen mit verschmolzenen Artikeln
121  |  Syntactic  |  SY_APPO  |  Postpositions  |  Postpositionen
122  |  Syntactic  |  SY_DO  |  Dative object  |  Dativobjekt
123  |  Syntactic  |  SY_OA  |  Accusative object  |  Akkusativobjekt
124  |  Syntactic  |  SY_S_DE  |  Words in declarative sentences  |  Wörter in Aussagesätzen
125  |  Syntactic  |  SY_S_EX  |  Words in exclamatory sentences  |  Wörter in Ausrufesätzen
126  |  Syntactic  |  SY_S_INT  |  Words in interrogative sentences  |  Wörter in Fragesätzen
127  |  Syntactic  |  SY_S_NEG  |  Words in negative sentences  |  Wörter in negativen Sätzen
128  |  Syntactic  |  SY_S_INF  |  Words in infinitive-only sentences without finite verbs  |  Wörter in Infinitivsätzen ohne finiten Verben
129  |  Syntactic  |  SY_S_MAN  |  Words in sentences with the pronoun 'man'  |  Wörter in Man-Aussagen
130  |  Syntactic  |  SY_S_SUB  |  Words in subordinate sentences (excluding 'zu')  |  Wörter in subordinierten Sätzen (ohne 'zu')
131  |  Syntactic  |  SY_S_SUB_ZU  |  Words in subordinate sentences with 'zu'  |  Wörter in subordinierten Sätzen mit 'zu'
132  |  Syntactic  |  SY_S_KOKOM  |  Words in sentences with a comparative conjunction  |  Wörter in Sätzen mit einer komparativen Konjunktion
133  |  Syntactic  |  SY_S_COND1  |  Words in factual conditional sentences  |  Wörter in realen Konditionalsätzen
134  |  Syntactic  |  SY_S_COND2  |  Words in potential conditional sentences  |  Wörter in potentialen Konditionalsätzen
135  |  Syntactic  |  SY_S_COND3  |  Words in unreal conditional sentences  |  Wörter in irrealen Konditionalsätzen
136  |  Syntactic  |  SY_QUOT  |  Words in quotation marks  |  Wörter in Anführungszeichen
137  |  Punctuation  |  PUNCT_TOTAL  |  Total punctuation  |  Interpunktion
138  |  Punctuation  |  PUNCT_BI_NOUN  |  Punctuation following a noun  |  Interpunktion nach einem Nomen
139  |  Punctuation  |  PUNCT_BI_VERB  |  Punctuation following a verb  |  Interpunktion nach einem Verb
140  |  Lexical  |  L_NAME  |  Proper names  |  Eigennamen
141  |  Lexical  |  L_NAME_M  |  Masculine proper nouns  |  Männliche Eigennamen
142  |  Lexical  |  L_NAME_F  |  Feminine proper nouns  |  Weibliche Eigennamen
143  |  Lexical  |  L_GEOG  |  Place and geographical names  |  Nazwy miejsc i nazwy geograficzne
144  |  Lexical  |  L_ORG  |  Organization names  |  Organizationsnamen
145  |  Lexical  |  L_SYL_G1  |  One-syllable words  |  Ein-Silben-Wörter
146  |  Lexical  |  L_SYL_G2  |  Two-syllables words  |  Zwei-Silben-Wörter
147  |  Lexical  |  L_SYL_G3  |  Three-syllables words  |  Drei-Silben-Wörter
148  |  Lexical  |  L_SYL_G4  |  Four-syllables words  |  Vier-Silben-Wörter
149  |  Lexical  |  L_SYL_G5  |  Words formed of 5 or more syllables  |  Wörter mit 5 oder mehr Silben
150  |  Lexical  |  L_TTR_IA  |  Type-token ratio for non-lemmatized tokens  |  Type-token ratio für nicht-lemmatisierte Tokens
151  |  Lexical  |  L_TTR_LA  |  Type-token ratio for lemmatized tokens  |  Type-token ratio für lemmatisierte Tokens
152  |  Lexical  |  L_CONT_A  |  Incidence of content words  |  Inhaltswörter
153  |  Lexical  |  L_CONT_T  |  Content words types  |  Typen von Inhaltswörtern
154  |  Lexical  |  L_CONT_L  |  Content words lemma types  |  Lemmatypen von Inhaltswörtern
155  |  Lexical  |  L_FUNC_A  |  Incidence of function words  |  Funktionswörter
156  |  Lexical  |  L_FUNC_T  |  Function words types  |  Typen von Funktionswörtern
157  |  Lexical  |  L_FUNC_L  |  Function words lemma types  |  Lemmatypen von Funktionswörtern
158  |  Lexical  |  L_STOP  |  Incidence of stop words  |  Stoppwörter
159  |  Lexical  |  L_TCCT1  |  Tokens covering 1% of most common types  |  Tokens, die 1 % der häufigsten Typen abdecken
160  |  Lexical  |  L_TCCT5  |  Tokens covering 5% of most common types  |  Tokens, die 5% der häufigsten Typen abdecken
161  |  Descriptive  |  DESC_PRON_VOC  |  Direct address phrases with a pronoun  |  Direkte Anredephrasen mit einem Pronomen
162  |  Graphical  |  GR_UPPER  |  Capital letters  |  Großbuchstaben
163  |  Graphical  |  GR_EMOT  |  Emoticons  |  Emoticons
164  |  Graphical  |  GR_LENNY  |  Lenny faces  |  Lenny faces
165  |  Graphical  |  GR_MENTION  |  Direct mentions with @  |  Direkte Erwähnungen mit @
166  |  Graphical  |  GR_HASH  |  Hashtags  |  Hashtags
167  |  Graphical  |  GR_LINK  |  Hyperlinks  |  Hyperlinks
168  |  Descriptive  |  SAMPL2  |  abc  |  abc
169  |  C1  |  SAMPL3  |  abc  |  abc