<%@LANGUAGE="JAVASCRIPT" CODEPAGE="CP_ACP"%> Untitled Document

 

ATLAS LINGÜÍSTICO (Y ETNOGRÁFICO) DE CASTILLA - LA MANCHA

 

página principal

 

 

AUTOMATIC CARTHOGRAPHY AND GEOLINGUISTIC DATA BASES

 

Francisco Moreno-Fernández
José Enrique Moreno-Fernández
Antonio García de las Heras


University of Alcalá, Spain

VERSIÓN EN ESPAÑOL

Introduction
Since 1996, the University of Alcalá has been making a remarkable effort to obtain a computer application capable of mapmaking, in an automatic way, materials previously registered in one or more databases. This effort has allowed us to incorporate our work into a systematic form of research which has been very much developed in other research centers, both in Europe and in America: we can refer to the computer systems developed for the Atlante linguistico del ladino dolomitico/centrale e dialetti limitrofi, of the University of Salzburg (Austria), in the computerization of the Atlante linguistico de la Sicilia, accomplished in the “Centro di Studi Filologici e Linguistici Siciliani” (Palermo, Italia), or in the work of the Linguistic Atlas of the Middle and South Atlantic States, of the University of Georgia (USA).
In the Spanish speaking countries, the first work which has been done within the automation field has been the Atlas Lingüístico y Etnográfico de Cantabria, directed by Manuel Alvar, though there is also news of the information processing in some of the materials of the Atlas Lingüístico de Hispanoamérica. The application developed in the University of Alcalá is called Atlante, and it has been designed for editing any linguistic atlas just with a basic adaptations.The purpose of this paper is to make a brief description of Atlante’s usefulness and characteristics.

 

Objectives of Atlante
The general objective of the Atlante program is the automation of cabinets work that lead to the attainment of a linguistic atlas, as well as, maximizing the use of all the linguistic information which is contained in a piece of work with these characteristics. To fulfill such an objective, it is necessary a database in which all the materials remain registered, a database which permits a rapid search of whatever information the researcher requires and the elaboration of several kinds of indexes. Furthermore, a mechanism is needed to permit us to transfer to a map the linguistic data contained in the base, either in the form of phonetic transcription, or in ordinary writing or through symbols.
The purpose of the Atlante application is not only to get to a computerized draft of a linguistic atlas that is can be represented on paper, but to offer the possibility of working and manipulating the linguistic information directly by computerized means. In other words, a mean has been sought to automate the conventional elaboration of linguistic atlases and permits us to present an atlas in printed paper or in computer supports, as the CD-ROM or on-line consultation.
When deciding the type of atlas we would wish to achieve, we have used as a reference one of the richest and complex models of those we know: the model of the Spanish linguistic atlases. These atlases, as it is known, include linguistic maps, ethnographic maps and linguistic-ethnographic maps, and the linguistic information is offered, some times through symbols - as it is usual in other computerized works - but most times, in phonetic transcription. The answers have a very variable extension at each point with the data distributed within the maps, outside the maps (on the margins), and in complementary plates. At the same time, different types of letters and characters of various nature alternate. The phonetic transcription is also presented following the alphabet of the Revista de Filología Española.
The characteristics of the work which has served us as a model, demanded the preparation of an application extremely flexible, capable of registering materials in phonetic transcription and in ordinary writing. This application must alternate alphabets in the same text, editing automatically very dense maps, but giving the researcher the opportunity of adjusting the definitive presentation of the maps according to his/her needs and wants. All of this is complemented with the development of the mentioned phonetic alphabet, but enriched with a long list of phonetic characters, indispensable for the transcription of numerous features of Spanish speeches.
The databases should be capable of allowing very diverse uses and manipulations, from the simple consultation of any chain of characters, to the indexes elaboration - including indexes of inverse forms- the incorporation of sound files, the incorporation of graphs, and the calculation of linguistic distances between survey points (dialectometry).

 

Main characteristics of Atlante
Atlante is a 32 bits operating system application designed for Windows 95. It has been developed using Visual Basic version 4.0. Creation and management of the Atlante’s databases have been made with the “Microsoft Access Jet Engine”. For running Atlante in a personal computer, we recommend at least the following hardware specifications: 32 meg. RAM and Pentium processor 100 MH or faster and hard drive enough to store the databases and the bmp 256 color files generated by Atlante.

The application uses three specific databases and their graphical projection on the map. These databases are the following:

Survey points or Community Database
This option is available from the main menu, once the application has been opened. Here it is where you store the name, the id code, the speakers’ name and the coordinates of each point on the map.
Speakers Database
This option is available from the main menu. In this base we will store the data concerning the speakers (profession, age, the parents’ born place, etc.)
Linguistic Information Database (RESPUESTAS)
This option is also available from the main menu. The linguistic data stored here appear in ordinary writing as well as in phonetic transcription. In this transcription it is possible to use any font installed in “Windows”.
The University of Alcalá has developed the font afe.ttf for representing the phonetics symbols established in the Revista de Filología Española. In any case you may use any available “International Phonetic Alphabet” font.
From the databases detailed, the most complex, from a technical point of view, and the richest, from a linguistic perspective, is “RESPUESTAS” (answers). Technically it is not a single database, but a set of bases, which will be more or less extensive depending on the contents you want to include: a general linguistic atlas can not be the same as a phonetic atlas, and it is not the same a 300 map linguistic atlas as one with 3000 maps. When we talk about “RESPUESTAS” database (answers) we just pretend to make the program’s structure a bit easier to understand.
The databases included in the “RESPUESTAS” database could be so many as linguistic levels in the atlas. For an atlas with just phonetic-phonological and lexical levels, only two databases should be necessary. On the other hand, every single database could be composed of tables that we call “Fields”. For example, the “Phonetic Database” could be divided into the “Vowel Phonetics” and “Consonant Phonetics” fields, and the lexical database could be divided into fields like “Human body”, “Farming Life”, “Atmospheric Phenomenon”, etc. Inside each field, the answers relating to certain objects, actions or concepts have been inscribed. The larger the number of items that make up each base, the more necessary the subdivision in tables or fields is, to insure a very fast and simple handling of the application.

The introduction of data into the “RESPUESTAS” database will be made using the form in a window: some data must be stored in writing and the rest in phonetic transcription. To introduce data in each field you must choose among the different options in the form (point, speaker, field).
Every other different possible selections are automatically showed. This selection is possible because inside the form we are provided with an update list of every survey points and speakers from our databases.
The linguistic information introduced in this form is stored in ordinary writing as well as phonetic transcription. The main reason to do it this way is to make searches for strings of characters easier. According to verb conjugation, it is also possible to have a verb form list for consulting and updating.
To introduce linguistic information, that is to say, the speakers answers in the questionnaire, we have two different helps: on the one hand, the form provides a list of answers stored previously; on the other hand, we have graphic help about the phonetic alphabet by pressing F1 and F2 keys.
To sum up, the forms and windows from where it is possible to update and consult the answers in the geolinguistic questionnaire provides updated information about survey points, speakers, the linguistic level and the field in which the answers are stored, in written form and in phonetic transcription.