Betrachtungen zur Toponymie an der heutigen bairisch-rätoromanischen Sprachgrenze im Tiroler Ober... more Betrachtungen zur Toponymie an der heutigen bairisch-rätoromanischen Sprachgrenze im Tiroler Oberland (mit zwei Karten) Aus toponomastischer Sicht ist eines der interessantesten Gebiete Tirols das Tiroler Oberland und hier insbesondere der Bezirk Landeck. Dies ist unter anderem auf die Tatsache zurückzuführen, dass es sich in verschiedenen Aspekten, die weiter unten näher erläutert werden sollen, um ein historisches und aktuelles Grenzgebiet handelt. Da Grenzgebiete aber immer auch Kontaktgebiete sind, entstand hier im Laufe der Jahrhunderte ein toponymisches Geflecht, das durch seine Komplexität immer wieder das Interesse von Namenforschern weckte.
<p>The geographical distributions and frequencies of in total 853 pasture names with Slavic... more <p>The geographical distributions and frequencies of in total 853 pasture names with Slavic (panel <b>A</b>), Romance (panel <b>B</b>), and Germanic (panel <b>C</b>) etymons are shown. Black dots indicate the localization of the pastures. Panel <b>D</b> illustrates the localization of the blood donors' municipalities of birth and the subdivision of the study area into two regions of former Romance (A) respectively Slavic (B) main settlement.</p
Names perform a key function in texts: through their direct reference to extra-linguistic objects... more Names perform a key function in texts: through their direct reference to extra-linguistic objects, they provide immediate information about who is doing what, who is affected by what, and about the places involved. Thus, names also play a key role in the computer-based processing of texts. For the use of semantic technologies, names and definite descriptions must be marked up in texts. This set of references to extra-linguistic objects is largely subsumed beneath the term ‚named entities‘ (NE) in information technology. This paper discusses, first, the technical processes, methods and possible ways of representing the contexts of large sets of names/named entities and, second, the challenges posed by allonymic and orthographic variants of names in text processing. Third, the substantive focus is on the differentiation between name and definite description in the Early High German mining documents „Schwazer Berglehenbuch“ and „Verleihbuch der Rattenberger Bergrichter“, which provide information on individuals, places, mines and dates linked by the legal act of lending.
<p>The Austrian political district of Lienz (“East Tyrol”) is highlighted in orange. <b&... more <p>The Austrian political district of Lienz (“East Tyrol”) is highlighted in orange. <b>A</b>: Austria, <b>G</b>: Germany, <b>I</b>: Italy.</p
The files in these directory contain the Alpenwort Corpus version from January 2020 enriched with... more The files in these directory contain the Alpenwort Corpus version from January 2020 enriched with automated NER for place, person and organisation and NEL for places and persons. Data are provided in delimited text format(csv), RDF (.ttl) and jsonLD(.json)
The unified gazetteer for this project was derived from 6 sources named here as local gazetteers.... more The unified gazetteer for this project was derived from 6 sources named here as local gazetteers. Entries from these gazeteers were automatically matched to the Alpenwort Corpus, its Table of Content and to the OeAV Photo Index. All matches where integrated in the file "mount_place_name_gaz.txt". For these 6 gazetteers a matching of the place types present in these gazetteers to broader categories was performed and the matching is in the file "gaz_union_thes.txt". details on semanticmountain.at
The file contains persons of the Alpenwort Corpus extracted from the Table of Contents and the Te... more The file contains persons of the Alpenwort Corpus extracted from the Table of Contents and the Text and potential links to wikidata that where generated through automated matching of first name and family name.
This paper outlines the construction of the corpus Alpenwort, a large, genre-based corpus of Germ... more This paper outlines the construction of the corpus Alpenwort, a large, genre-based corpus of German texts on alpinism. We report on issues related to building the corpus from the Austrian Alpine Club Journal (1869–2010). First, a general description of our data and the project phases from digitization and annotation to publication is given. We focus on the most interesting challenges that the diverse layouts and the extensive use of Fraktur typefacing posed for optical layout recognition and optical character recognition (OCR) as well as post correction. The corrected data was lemmatized and annotated with part-of-speech information including named entities as well as TEI-conformant metadata. The resulting 19.9-million-word corpus is designed to be queried using CQPweb and Hyperbase and can be accessed freely online. Lastly, we give a short roadmap of current and future expansions and improvements as corpus data has been and is being enhanced in follow-up projects.
In the project <em>Alpenwort. Korpus der Zeitschrift des Deutschen und Österreichischen Alp... more In the project <em>Alpenwort. Korpus der Zeitschrift des Deutschen und Österreichischen Alpenvereins</em>, the almanac of the Austrian Alpine Club 1869 – 1998 (=Zeitschrift des Deutschen und Österreichischen Alpenvereins ZAV), was digitized and annotated. The ZAV is a very important source especially for Austria, which shares, by area, the largest part of the Alpine arc. In its first decades the magazine contributions reflect the ongoing touristic and cartographic exploration of the Alps and the economic and scientific discoveries involved. During the 20<sup>th</sup> century perspectives expanded to the mountains of the world. Globally relevant topics such as environment and nature protection are discussed as well as questions of regional identity and cultural heritage. The main goal of the project was to make this unique source accessible for the scientific community enhanced by metadata conformant with CLARIN-DARIAH standards. All original PDFs of the ZAV j...
Betrachtungen zur Toponymie an der heutigen bairisch-rätoromanischen Sprachgrenze im Tiroler Ober... more Betrachtungen zur Toponymie an der heutigen bairisch-rätoromanischen Sprachgrenze im Tiroler Oberland (mit zwei Karten) Aus toponomastischer Sicht ist eines der interessantesten Gebiete Tirols das Tiroler Oberland und hier insbesondere der Bezirk Landeck. Dies ist unter anderem auf die Tatsache zurückzuführen, dass es sich in verschiedenen Aspekten, die weiter unten näher erläutert werden sollen, um ein historisches und aktuelles Grenzgebiet handelt. Da Grenzgebiete aber immer auch Kontaktgebiete sind, entstand hier im Laufe der Jahrhunderte ein toponymisches Geflecht, das durch seine Komplexität immer wieder das Interesse von Namenforschern weckte.
<p>The geographical distributions and frequencies of in total 853 pasture names with Slavic... more <p>The geographical distributions and frequencies of in total 853 pasture names with Slavic (panel <b>A</b>), Romance (panel <b>B</b>), and Germanic (panel <b>C</b>) etymons are shown. Black dots indicate the localization of the pastures. Panel <b>D</b> illustrates the localization of the blood donors' municipalities of birth and the subdivision of the study area into two regions of former Romance (A) respectively Slavic (B) main settlement.</p
Names perform a key function in texts: through their direct reference to extra-linguistic objects... more Names perform a key function in texts: through their direct reference to extra-linguistic objects, they provide immediate information about who is doing what, who is affected by what, and about the places involved. Thus, names also play a key role in the computer-based processing of texts. For the use of semantic technologies, names and definite descriptions must be marked up in texts. This set of references to extra-linguistic objects is largely subsumed beneath the term ‚named entities‘ (NE) in information technology. This paper discusses, first, the technical processes, methods and possible ways of representing the contexts of large sets of names/named entities and, second, the challenges posed by allonymic and orthographic variants of names in text processing. Third, the substantive focus is on the differentiation between name and definite description in the Early High German mining documents „Schwazer Berglehenbuch“ and „Verleihbuch der Rattenberger Bergrichter“, which provide information on individuals, places, mines and dates linked by the legal act of lending.
<p>The Austrian political district of Lienz (“East Tyrol”) is highlighted in orange. <b&... more <p>The Austrian political district of Lienz (“East Tyrol”) is highlighted in orange. <b>A</b>: Austria, <b>G</b>: Germany, <b>I</b>: Italy.</p
The files in these directory contain the Alpenwort Corpus version from January 2020 enriched with... more The files in these directory contain the Alpenwort Corpus version from January 2020 enriched with automated NER for place, person and organisation and NEL for places and persons. Data are provided in delimited text format(csv), RDF (.ttl) and jsonLD(.json)
The unified gazetteer for this project was derived from 6 sources named here as local gazetteers.... more The unified gazetteer for this project was derived from 6 sources named here as local gazetteers. Entries from these gazeteers were automatically matched to the Alpenwort Corpus, its Table of Content and to the OeAV Photo Index. All matches where integrated in the file "mount_place_name_gaz.txt". For these 6 gazetteers a matching of the place types present in these gazetteers to broader categories was performed and the matching is in the file "gaz_union_thes.txt". details on semanticmountain.at
The file contains persons of the Alpenwort Corpus extracted from the Table of Contents and the Te... more The file contains persons of the Alpenwort Corpus extracted from the Table of Contents and the Text and potential links to wikidata that where generated through automated matching of first name and family name.
This paper outlines the construction of the corpus Alpenwort, a large, genre-based corpus of Germ... more This paper outlines the construction of the corpus Alpenwort, a large, genre-based corpus of German texts on alpinism. We report on issues related to building the corpus from the Austrian Alpine Club Journal (1869–2010). First, a general description of our data and the project phases from digitization and annotation to publication is given. We focus on the most interesting challenges that the diverse layouts and the extensive use of Fraktur typefacing posed for optical layout recognition and optical character recognition (OCR) as well as post correction. The corrected data was lemmatized and annotated with part-of-speech information including named entities as well as TEI-conformant metadata. The resulting 19.9-million-word corpus is designed to be queried using CQPweb and Hyperbase and can be accessed freely online. Lastly, we give a short roadmap of current and future expansions and improvements as corpus data has been and is being enhanced in follow-up projects.
In the project <em>Alpenwort. Korpus der Zeitschrift des Deutschen und Österreichischen Alp... more In the project <em>Alpenwort. Korpus der Zeitschrift des Deutschen und Österreichischen Alpenvereins</em>, the almanac of the Austrian Alpine Club 1869 – 1998 (=Zeitschrift des Deutschen und Österreichischen Alpenvereins ZAV), was digitized and annotated. The ZAV is a very important source especially for Austria, which shares, by area, the largest part of the Alpine arc. In its first decades the magazine contributions reflect the ongoing touristic and cartographic exploration of the Alps and the economic and scientific discoveries involved. During the 20<sup>th</sup> century perspectives expanded to the mountains of the world. Globally relevant topics such as environment and nature protection are discussed as well as questions of regional identity and cultural heritage. The main goal of the project was to make this unique source accessible for the scientific community enhanced by metadata conformant with CLARIN-DARIAH standards. All original PDFs of the ZAV j...
Uploads
Books by Gerhard Rampl
Papers by Gerhard Rampl
Aus toponomastischer Sicht ist eines der interessantesten Gebiete Tirols das Tiroler Oberland und hier insbesondere der Bezirk Landeck. Dies ist unter anderem auf die Tatsache zurückzuführen, dass es sich in verschiedenen Aspekten, die weiter unten näher erläutert werden sollen, um ein historisches und aktuelles Grenzgebiet handelt. Da Grenzgebiete aber immer auch Kontaktgebiete sind, entstand hier im Laufe der Jahrhunderte ein toponymisches Geflecht, das durch seine Komplexität immer wieder das Interesse von Namenforschern weckte.
This paper discusses, first, the technical processes, methods and possible ways of representing the contexts of large sets of names/named entities and, second, the challenges posed by allonymic and orthographic variants of names in text processing. Third, the substantive focus is on the differentiation between name and definite description in the Early High German mining documents „Schwazer Berglehenbuch“ and „Verleihbuch der Rattenberger Bergrichter“, which provide information on individuals, places, mines and dates linked by the legal act of lending.
Aus toponomastischer Sicht ist eines der interessantesten Gebiete Tirols das Tiroler Oberland und hier insbesondere der Bezirk Landeck. Dies ist unter anderem auf die Tatsache zurückzuführen, dass es sich in verschiedenen Aspekten, die weiter unten näher erläutert werden sollen, um ein historisches und aktuelles Grenzgebiet handelt. Da Grenzgebiete aber immer auch Kontaktgebiete sind, entstand hier im Laufe der Jahrhunderte ein toponymisches Geflecht, das durch seine Komplexität immer wieder das Interesse von Namenforschern weckte.
This paper discusses, first, the technical processes, methods and possible ways of representing the contexts of large sets of names/named entities and, second, the challenges posed by allonymic and orthographic variants of names in text processing. Third, the substantive focus is on the differentiation between name and definite description in the Early High German mining documents „Schwazer Berglehenbuch“ and „Verleihbuch der Rattenberger Bergrichter“, which provide information on individuals, places, mines and dates linked by the legal act of lending.