Corpus info
General stats
| Tokens | 967592
|
|---|
| Words | 866485
|
|---|
| Types | 23151 |
|---|
| Lemmas | 15531 |
|---|
| Hapax legomenon | 10826 |
|---|
| Dis legomenon | 3480 |
|---|
| POS tags | 496 |
|---|
Documents
| Number of documents | 724
|
|---|
| Average (tokens per document) | 1336 |
|---|
| Median (tokens per document) | 986 |
|---|
| Longest document (tokens) | 15663 |
|---|
| Shortest document (tokens) | 108 |
|---|
| Oldest document (year) | 1600 |
|---|
| Most recent document (year) | 1896 |
|---|
Group by part of speech
| Main POS tag | N | % |
|---|
| common noun | 213693 | 22.09 |
|---|
| preposition | 156238 | 16.15 |
|---|
| determiner | 106127 | 10.97 |
|---|
| punctuation | 101107 | 10.45 |
|---|
| verb | 91589 | 9.47 |
|---|
| numeral | 72124 | 7.45 |
|---|
| conjunction | 70434 | 7.28 |
|---|
| adjective | 45429 | 4.70 |
|---|
| pronoun | 38074 | 3.93 |
|---|
| proper noun | 34089 | 3.52 |
|---|
| adverb | 26916 | 2.78 |
|---|
| untagged | 11403 | 1.18 |
|---|
| foreign word | 272 | 0.03 |
|---|
| interjection | 97 | 0.01 |
|---|
| Total | 967592 | 100.00 |
|---|
Group by project
| Project | N | % |
|---|
| VIVE | 439152 | 45.39 |
|---|
| HISPATESD | 192917 | 19.94 |
|---|
| ALEAO18 | 145405 | 15.03 |
|---|
| ALEA18 | 110727 | 11.44 |
|---|
| ALEAO19 | 79391 | 8.21 |
|---|
| Total | 967592 | 100.00 |
|---|
Group by text type
| Text type | N | % |
|---|
| inventory of goods | 698910 | 72.23 |
|---|
| witness statement | 192917 | 19.94 |
|---|
| medical certificate | 71354 | 7.37 |
|---|
| other | 4411 | 0.46 |
|---|
| Total | 967592 | 100.00 |
|---|
Group by century
| Century | N | % |
|---|
| XVIII | 578224 | 59.76 |
|---|
| XVII | 270685 | 27.98 |
|---|
| XIX | 118683 | 12.27 |
|---|
| Total | 967592 | 100.00 |
|---|
Group by province
| Province | N | % |
|---|
| Granada | 167512 | 17.31 |
|---|
| Jaén | 113009 | 11.68 |
|---|
| Almería | 112077 | 11.58 |
|---|
| Badajoz | 110973 | 11.47 |
|---|
| Madrid | 75183 | 7.77 |
|---|
| Cádiz | 72339 | 7.48 |
|---|
| Málaga | 69754 | 7.21 |
|---|
| Burgos | 68793 | 7.11 |
|---|
| Cáceres | 61248 | 6.33 |
|---|
| Sevilla | 33045 | 3.42 |
|---|
| Huelva | 27320 | 2.82 |
|---|
| Murcia | 14863 | 1.54 |
|---|
| Valladolid | 12732 | 1.32 |
|---|
| La Rioja | 5349 | 0.55 |
|---|
| Cantabria | 4944 | 0.51 |
|---|
| Toledo | 3982 | 0.41 |
|---|
| Palencia | 3785 | 0.39 |
|---|
| Zamora | 2150 | 0.22 |
|---|
| Navarra | 2011 | 0.21 |
|---|
| Álava | 1680 | 0.17 |
|---|
| Soria | 1444 | 0.15 |
|---|
| León | 1380 | 0.14 |
|---|
| Córdoba | 883 | 0.09 |
|---|
| Gipuzkoa | 605 | 0.06 |
|---|
| Teruel | 293 | 0.03 |
|---|
| Salamanca | 238 | 0.02 |
|---|
| Total | 967592 | 100.00 |
|---|
Group by institution
| Institution | N | % |
|---|
| Archivo de la Real Chancillería de Granada | 237436 | 24.54 |
|---|
| Archivo Histórico Provincial de Badajoz | 107981 | 11.16 |
|---|
| Archivo Histórico Provincial de Jaén | 105466 | 10.90 |
|---|
| Archivo Histórico de Protocolos de Madrid | 73290 | 7.57 |
|---|
| Archivo Histórico Provincial de Burgos | 66656 | 6.89 |
|---|
| Archivo Histórico Provincial de Almería | 63774 | 6.59 |
|---|
| Archivo Histórico Provincial de Cáceres | 59902 | 6.19 |
|---|
| Archivo Histórico Provincial de Cádiz | 51101 | 5.28 |
|---|
| Archivo Histórico de Protocolos de Granada | 49570 | 5.12 |
|---|
| Archivo de la Real Chancillería de Valladolid | 44590 | 4.61 |
|---|
| Archivo Histórico Provincial de Huelva | 26573 | 2.75 |
|---|
| Archivo Histórico Provincial de Sevilla | 24702 | 2.55 |
|---|
| Archivo Histórico Municipal de Lorca | 14863 | 1.54 |
|---|
| Archivo Municipal de Puerto Real | 14253 | 1.47 |
|---|
| Archivo Histórico Provincial de Málaga | 13521 | 1.40 |
|---|
| Archivo Histórico Municipal de Baeza | 6443 | 0.67 |
|---|
| Archivo Municipal de Vera | 5807 | 0.60 |
|---|
| Archivo Histórico Provincial de Córdoba | 883 | 0.09 |
|---|
| Archivo Histórico Municipal de Loja | 781 | 0.08 |
|---|
| Total | 967592 | 100.00 |
|---|
Group by century and province (absolute frequencies)
| XV | XVI | XVII | XVIII | XIX | Total (province) | Total (area) |
|---|
| Almería | | | 26626 | 47667 | 37784 | 112077 | 392598 |
|---|
| Granada | | | 51452 | 113431 | 2629 | 167512 |
|---|
| Jaén | | | | 109275 | 3734 | 113009 |
|---|
| Málaga | | | 26420 | 33732 | 9602 | 69754 | 70637 |
|---|
| Córdoba | | | | | 883 | 883 |
|---|
| Cádiz | | | | 70676 | 1663 | 72339 | 132704 |
|---|
| Sevilla | | | 168 | 32877 | | 33045 |
|---|
| Huelva | | | | 27320 | | 27320 |
|---|
| Madrid | | | | 36074 | 39109 | 75183 | 143976 |
|---|
| Burgos | | | | 67453 | 1340 | 68793 |
|---|
| others | | | 166019 | 39719 | 21939 | 227677 | 227677 |
|---|
| Total (century) | 0 | 0 | 270685 | 578224 | 118683 | 967592 | 967592 |
|---|
Group by century and province (relative frequencies)
| XV | XVI | XVII | XVIII | XIX | Total (province) | Total (area) |
|---|
| Almería | 0.00 | 0.00 | 2.75 | 4.93 | 3.90 | 11.58 | 40.57 |
|---|
| Granada | 0.00 | 0.00 | 5.32 | 11.72 | 0.27 | 17.31 |
|---|
| Jaén | 0.00 | 0.00 | 0.00 | 11.29 | 0.39 | 11.68 |
|---|
| Málaga | 0.00 | 0.00 | 2.73 | 3.49 | 0.99 | 7.21 | 7.30 |
|---|
| Córdoba | 0.00 | 0.00 | 0.00 | 0.00 | 0.09 | 0.09 |
|---|
| Cádiz | 0.00 | 0.00 | 0.00 | 7.30 | 0.17 | 7.48 | 13.71 |
|---|
| Sevilla | 0.00 | 0.00 | 0.02 | 3.40 | 0.00 | 3.42 |
|---|
| Huelva | 0.00 | 0.00 | 0.00 | 2.82 | 0.00 | 2.82 |
|---|
| Madrid | 0.00 | 0.00 | 0.00 | 3.73 | 4.04 | 7.77 | 14.88 |
|---|
| Burgos | 0.00 | 0.00 | 0.00 | 6.97 | 0.14 | 7.11 |
|---|
| others | 0.00 | 0.00 | 17.16 | 4.10 | 2.27 | 23.53 | 23.53 |
|---|
| Total (century) | 0.00 | 0.00 | 27.98 | 59.76 | 12.27 | 100.00 | 100.00 |
|---|
Measures of lexical diversity
| Measure | Description | Formula | Result |
|---|
| TTR | type-token ratio | | 0.027 |
|---|
| RTTR | Giraud's root type-token ratio | | 24.871 |
|---|
| CTTR | Carroll's corrected type-token ratio | | 17.586 |
|---|
| C | Herdan's C index | | 0.735 |
|---|
| S | Somer's S index | | 0.882 |
|---|
| M | Maas' index | | 0.036 |
|---|
| H | Honoré's index | | 2568.155 |
|---|
| K | Yule's K index | | 172.530 |
|---|
| D | Simpson's D index | | 0.017 |
|---|
| HTR | Hapax-token ratio | | 0.468 |
|---|
| DTR | Dis-token ratio | | 0.150 |
|---|
| VGR | Vocabulary growth rate | | 0.012 |
|---|