Corpus info
General stats
Tokens | 967988
|
---|
Words | 866855
|
---|
Types | 23183 |
---|
Lemmas | 15566 |
---|
Hapax legomenon | 10846 |
---|
Dis legomenon | 3484 |
---|
POS tags | 498 |
---|
Documents
Number of documents | 726
|
---|
Average (tokens per document) | 1333 |
---|
Median (tokens per document) | 985 |
---|
Longest document (tokens) | 15663 |
---|
Shortest document (tokens) | 108 |
---|
Oldest document (year) | 1600 |
---|
Most recent document (year) | 1896 |
---|
Group by part of speech
Main POS tag | N | % |
---|
common noun | 213767 | 22.08 |
---|
preposition | 156273 | 16.14 |
---|
determiner | 106169 | 10.97 |
---|
punctuation | 101133 | 10.45 |
---|
verb | 91669 | 9.47 |
---|
numeral | 72130 | 7.45 |
---|
conjunction | 70479 | 7.28 |
---|
adjective | 45441 | 4.69 |
---|
pronoun | 38117 | 3.94 |
---|
proper noun | 34096 | 3.52 |
---|
adverb | 26938 | 2.78 |
---|
untagged | 11406 | 1.18 |
---|
foreign word | 272 | 0.03 |
---|
interjection | 98 | 0.01 |
---|
Total | 967988 | 100.00 |
---|
Group by project
Project | N | % |
---|
CORDEREGRA | 365578 | 37.77 |
---|
HISPATESD | 247593 | 25.58 |
---|
ALEA18 | 176369 | 18.22 |
---|
CORTENEX | 167883 | 17.34 |
---|
ALEA19 | 8019 | 0.83 |
---|
VIVE | 2438 | 0.25 |
---|
_ | 108 | 0.01 |
---|
Total | 967988 | 100.00 |
---|
Group by text type
Text type | N | % |
---|
inventory of goods | 698910 | 72.20 |
---|
witness statement | 192917 | 19.93 |
---|
medical certificate | 71302 | 7.37 |
---|
other | 4640 | 0.48 |
---|
OTH | 219 | 0.02 |
---|
Total | 967988 | 100.00 |
---|
Group by century
Century | N | % |
---|
XVIII | 578204 | 59.73 |
---|
XVII | 270685 | 27.96 |
---|
XIX | 119099 | 12.30 |
---|
Total | 967988 | 100.00 |
---|
Group by province
Province | N | % |
---|
Granada | 167492 | 17.30 |
---|
Jaén | 113009 | 11.67 |
---|
Almería | 112057 | 11.58 |
---|
Badajoz | 110973 | 11.46 |
---|
Madrid | 75183 | 7.77 |
---|
Cádiz | 72787 | 7.52 |
---|
Málaga | 69754 | 7.21 |
---|
Burgos | 68793 | 7.11 |
---|
Cáceres | 61248 | 6.33 |
---|
Sevilla | 33045 | 3.41 |
---|
Huelva | 27320 | 2.82 |
---|
Murcia | 14863 | 1.54 |
---|
Valladolid | 12720 | 1.31 |
---|
La Rioja | 5349 | 0.55 |
---|
Cantabria | 4944 | 0.51 |
---|
Toledo | 3982 | 0.41 |
---|
Palencia | 3785 | 0.39 |
---|
Zamora | 2150 | 0.22 |
---|
Navarra | 2011 | 0.21 |
---|
Álava | 1680 | 0.17 |
---|
Soria | 1444 | 0.15 |
---|
León | 1380 | 0.14 |
---|
Córdoba | 883 | 0.09 |
---|
Gipuzkoa | 605 | 0.06 |
---|
Teruel | 293 | 0.03 |
---|
Salamanca | 238 | 0.02 |
---|
Total | 967988 | 100.00 |
---|
Group by institution
Institution | N | % |
---|
Archivo de la Real Chancillería de Granada | 237396 | 24.52 |
---|
Archivo Histórico Provincial de Badajoz | 107981 | 11.16 |
---|
Archivo Histórico Provincial de Jaén | 105466 | 10.90 |
---|
Archivo Histórico de Protocolos de Madrid | 73290 | 7.57 |
---|
Archivo Histórico Provincial de Burgos | 66656 | 6.89 |
---|
Archivo Histórico Provincial de Almería | 63774 | 6.59 |
---|
Archivo Histórico Provincial de Cáceres | 59902 | 6.19 |
---|
Archivo Histórico Provincial de Cádiz | 51101 | 5.28 |
---|
Archivo Histórico de Protocolos de Granada | 49570 | 5.12 |
---|
Archivo de la Real Chancillería de Valladolid | 44578 | 4.61 |
---|
Archivo Histórico Provincial de Huelva | 26573 | 2.75 |
---|
Archivo Histórico Provincial de Sevilla | 24702 | 2.55 |
---|
Archivo Histórico Municipal de Lorca | 14863 | 1.54 |
---|
Archivo Municipal de Puerto Real | 14253 | 1.47 |
---|
Archivo Histórico Provincial de Málaga | 13521 | 1.40 |
---|
Archivo Histórico Municipal de Baeza | 6443 | 0.67 |
---|
Archivo Municipal de Vera | 5807 | 0.60 |
---|
Archivo Histórico Provincial de Córdoba | 883 | 0.09 |
---|
Archivo Histórico Municipal de Loja | 781 | 0.08 |
---|
AHPC | 448 | 0.05 |
---|
Total | 967988 | 100.00 |
---|
Group by century and province (absolute frequencies)
| XV | XVI | XVII | XVIII | XIX | Total (province) | Total (area) |
---|
Almería | | | 26626 | 47667 | 37764 | 112057 | 392558 |
---|
Granada | | | 51452 | 113411 | 2629 | 167492 |
---|
Jaén | | | | 109275 | 3734 | 113009 |
---|
Málaga | | | 26420 | 33732 | 9602 | 69754 | 70637 |
---|
Córdoba | | | | | 883 | 883 |
---|
Cádiz | | | | 70676 | 2111 | 72787 | 133152 |
---|
Sevilla | | | 168 | 32877 | | 33045 |
---|
Huelva | | | | 27320 | | 27320 |
---|
Madrid | | | | 36074 | 39109 | 75183 | 143976 |
---|
Burgos | | | | 67453 | 1340 | 68793 |
---|
others | | | 166019 | 39719 | 21927 | 227665 | 227665 |
---|
Total (century) | 0 | 0 | 270685 | 578204 | 119099 | 967988 | 967988 |
---|
Group by century and province (relative frequencies)
| XV | XVI | XVII | XVIII | XIX | Total (province) | Total (area) |
---|
Almería | 0.00 | 0.00 | 2.75 | 4.92 | 3.90 | 11.58 | 40.55 |
---|
Granada | 0.00 | 0.00 | 5.32 | 11.72 | 0.27 | 17.30 |
---|
Jaén | 0.00 | 0.00 | 0.00 | 11.29 | 0.39 | 11.67 |
---|
Málaga | 0.00 | 0.00 | 2.73 | 3.48 | 0.99 | 7.21 | 7.30 |
---|
Córdoba | 0.00 | 0.00 | 0.00 | 0.00 | 0.09 | 0.09 |
---|
Cádiz | 0.00 | 0.00 | 0.00 | 7.30 | 0.22 | 7.52 | 13.76 |
---|
Sevilla | 0.00 | 0.00 | 0.02 | 3.40 | 0.00 | 3.41 |
---|
Huelva | 0.00 | 0.00 | 0.00 | 2.82 | 0.00 | 2.82 |
---|
Madrid | 0.00 | 0.00 | 0.00 | 3.73 | 4.04 | 7.77 | 14.87 |
---|
Burgos | 0.00 | 0.00 | 0.00 | 6.97 | 0.14 | 7.11 |
---|
others | 0.00 | 0.00 | 17.15 | 4.10 | 2.27 | 23.52 | 23.52 |
---|
Total (century) | 0.00 | 0.00 | 27.96 | 59.73 | 12.30 | 100.00 | 100.00 |
---|
Measures of lexical diversity
Measure | Description | Formula | Result |
---|
TTR | type-token ratio | | 0.027 |
---|
RTTR | Giraud's root type-token ratio | | 24.900 |
---|
CTTR | Carroll's corrected type-token ratio | | 17.607 |
---|
C | Herdan's C index | | 0.735 |
---|
S | Somer's S index | | 0.882 |
---|
M | Maas' index | | 0.036 |
---|
H | Honoré's index | | 2569.284 |
---|
K | Yule's K index | | 172.443 |
---|
D | Simpson's D index | | 0.017 |
---|
HTR | Hapax-token ratio | | 0.468 |
---|
DTR | Dis-token ratio | | 0.150 |
---|
VGR | Vocabulary growth rate | | 0.013 |
---|