Detta är urval av resurser på internet för dig som läser lingvistik. Vill du söka på egen hand så kan du använda sökverktyg på Internet
Ask-a-linguist
Ask-A-Linguist is a service provided by The LINGUIST List, an Internet network for professional linguists. Ask-A-Linguist is designed to be a place
where anyone interested in language or linguistics can ask a question and get the response of a panel of professional linguists
Bibliographic Databases in Lingustics
This page allows you to access a collection of Bibliographic Databases in Linguistics held by the CL/MT Research Group at the University of Essex
Blackwell Linguistics Resource Center
ELSNET is the European Network in Language and Speech
The long-term technological goal which unites the participants of ELSNET is to build multilingual speech and NL systems with unrestricted coverage of both spoken and written language
FLTeach. Foreign Language Teaching Forum
ILoveLanguages - Human languages page
A comprehensive catalog of language-related Internet resources. Whether you're looking for online language lessons, translating dictionaries, native literature, translation services, software,language schools, or just a little information on a language you've heard about, the HLP probably has something to suit your needs
The Linguist list
The aim of the list is to provide a forum where academic linguists can discuss linguistic issues and exchange linguistic information Eastern Michigan University, Wayne State University
Linguistics, Natural Language, and Computational Linguistics Meta-index A guide to the best linguistic resources on the web
Dmoz. Open Directory Project Language and linguistics
comp.speech Frequently Asked Questions This site provides a range of information on speech technology, including speech synthesis, speech recognition, speech coding, and related material
European Language Resources Association The European Language Resources Association (ELRA) was established as a non-profit organization in Luxembourg in February, 1995. The overall goal of ELRA is to provide a centralized organization for the validation, management, and distribution of speech, text, and terminology resources and tools, and to promote their use within the European telematics R&TD community
Eclectic Company. Language & Linguistics
ETHNOLOGUE. Languages of the World Internet Version 13th edition. Barbara F. Grimes, Editor. Consulting Editors: Richard S. and Pittman & Joseph E. Grimes. Copyright © 2000 SIL International. This electronic version of the Ethnologue, the Ethnologue Language Name Index, and the Ethnologue Language Family Index contains the entire text of the original printed volumes (except language maps)
ILoveLanguages iLoveLanguages is a comprehensive catalog of language-related Internet resources. The more than 2000 links at iLoveLanguages have been hand-reviewed to bring you the best language links the Web has to offer. Whether you're looking for online language lessons, translating dictionaries, native literature, translation services, software, language schools, or just a little information on a language you've heard about, iLoveLanguages probably has something to suit your needs
Language-Related Net Resources English Department of Mississippi State University
Ligações de Línguas e Linguística/Language and Linguistics Links An impressive list of linguistics and language links compiled by the Department of General Linguistics and Literary Theory, Universidade da Corunha, Spain. Particularly strong in its links to linguistics working papers
Linguist.de© 1999/2000 by Jan Wohlgemuth
The LINGUIST List Eastern Michigan University, Wayne State University. Links to academic resources for linguists
Linguistic Links Maintained by University of Rochester Linguistics Department
Linguistic Resources on the Internet SIL International
Linguistics. A Guide to Internet Resources
David Langenberg, University of Delaware Library
linguistics.de Erstellt am 01.03.2000 von Dirk Wiebel.
Linguistics, Natural Language, and Computational Linguistics Meta-index
A large collection of web sites about linguistic resources, theories, journals, societies, and (computational) linguistics departments and programs
The Linguist list Hosted by Eastern Michigan University and Wayne State University. Links to academic resources for linguists
Språkvetenskap på Internet
Innehåll: ordböcker, textarkiv, språkliga institutioner i orden,konferenser,bibliotek,,boklådor/antikvariat,tidskrifter, SGML
Sammanställd av Yvonne Cederholm, Institutionen för svenska språket, Göteborgs universitet
Virtual Library: Linguistics
A mega site for the field of linguistics. Included are links for linguistics research including texts, discussion lists, language resources and university linguistics departments. Listings for upcoming conferences are also provided as is an archive for selected scholarly papers.
Online Directory of ESL Resources This directory provides an online reference for the most useful ESL resources primarily for teachers and students, though other groups, such as administrators, business people, researchers, and parents, may find them helpful as well
Speech and Language Web Resources Sammanställd vid Kita Lab., Department of Information Science & Intelligent Systems, Faculty of Engineering, Tokushima University
EuroDicAutom
Eurodicautom is the multilingual terminological database of the European Commission's Translation Service
Lexicon of Linguistics. Editors: Jan Don, Johan Kerstens, Eddy Ruys
Utrecht Institute of Linguistics OTS Utrecht University. Cop. 1996-2000
Odin. Skandinavisk ordbog
Lexikon med cirka 10000 ord som skiljer mellan danska, norska och svenska. Den skandinaviska ordboken är utarbetad av Birgitta Lindgren, Skirne Helg Bruland, Allan Karker och Ståle Løland. Den är utgiven 1994 av Norstedts Förlag AB i samarbete med Nordiska språksekretariatet med ekonomiskt stöd av Nordiska ministerrådet
¨
yourDictionary.com
" ... founded in 1999 to provide the world's most comprehensive, and authoritative portal for language, and language-related products and services on the world wide web. yourDictionary.com has the widest and deepest set of dictionaries on the web (more than 1500 dictionaries representing more than 230 languages)."
The Homepage of Integrational Linguistics Integrational Linguistics (no relation to an approach by the same name developed by Roy Harris) is a specific approach to linguistics combining a comprehensive theory of language and a theory of grammars and providing a consistent framework for the analysis and description of arbitrary languages from any point of view that is linguistically relevant.
Charles S. Peirce The life and times of Charles Sanders Peirce. Hypertext editions of Peirce's writings. The Community of Inquirers. Information about mailing lists, events, organizations and individuals concerned with Peirce and his ideas.
Kultursemiotik - Visuell semiotik - Semiotisk teori Sammanställd av Göran Sonesson , Avdelningen för semiotik, Lunds universitet.
Links zur Linguistik: Semiotik Utg. Linse. Linguistik-Server Essen
Semiotics for Beginners, Daniel Chandler
Semiotics. University of Colorado at Denver. School of Education
Sites of Significance for Semiotics
Psycholinguistics
All contents copyright © 2000, SUNY Oswego
The Psychology of Language Page of Links University of Memphis.
CHILDES Bibliography The main CHILDES Bibliography is an electronic database of about 30,000 bibliographic references in the field of child language research.
Early Language Intervention This resource site is designed to help speech-language pathologists who work with children who have developmental language disorders. Comp. By Judith Johnston
The North American Association for the History of the Language Sciences(NAAHoLS)
The Computational Historical Linguistics Project: Reconstructing the evolutionary history of natural languages The Computational Historical Linguistics is a joint research project of the Computer and Information Science Department and the Linguistics Department at the University of Pennsylvania.
IE Documentation Center The University of Texas at Austin ( The Indo-European (IE) Documentation Center, supported by the Diebold Foundation, is designed as a service to scholars and members of the public interested in Indo-European language and culture. Our long-range goal is to provide a searchable index of materials.
Lehmann's Reader: A Reader in Nineteenth Century Historical Indo-European Linguistics - Anthology of important works of nineteenth-century historical Indo-European linguistics, edited and translated by W. P. Lehmann, 1967
Interactional Sociolinguistics. Useful Links
Department of English, City University of Hong Kong
Language and gender page The Language and Gender Page provides information and resources about language and gender studies, an interdisciplinary field with connections to anthropology, cultural studies, education, ethnic studies, linguistics, literary studies, psychology, sociology, and women's studies, among others
Language futures Europé This site collects links on the language futures of Europe - on language policy, multilingualism, global language structures, and the dominance of English. It starts with a comment on the structures of language; then links to texts and essays; and then sections on EU policy, national policies, and research sites; and finally the 'monolingual movement' in the United States.
Linguistic Rights The MOST Clearing House on Linguistic Rights is designed to provide tools for legislators, decision-makers, researchers and other representatives of both governmental and non-governmental organizations to monitor the transition to democracy in multicultural and multi-ethnic societies. It provides an overview of the most
Studies on Gay & Lesbian Language
Maintained by Gregory Ward/Gregory Greenman II
The ACL NLP/CL Universe
The ACL home page http://www.aclweb.org contains information about the association as well as information on becoming a member.
The NLP/CL Universe is a Web catalog/search engine that is devoted to Natural Language Processing and Computational Linguistics Web sites. It exists since March 18, 1995.
Academic departments and institutes
Companies and corporate research labs
Various resources - books, bibliographies
Conferences
The Association for Computational Linguistics
The Association for Computational Linguistics is THE international scientific and professional society for people working on problems involving natural language and computation
Centre for Computational Linguistics (CCL) , the Katholieke Universiteit Leuven
Survey of the State of the Art in Language Technology. Web edition. Edited by: Ron Cole (Editor in chief). Cambridge University Press and Giardini 1997 the WWW version of a comprehensive book on all aspects of language technology written by more than 50 international experts in the field; funded by the European Commission and the National Science Foundation: a successful result of USAmerican - EUropean scientific cooperation (The book is published by Giardini and Cambridge University Press)
Human Computer Interaction and Language Engineering (HUMLE) Stockholm The HUMLE lab at Swedish Institute of Computer Science consists of approximately 20 computer scientists, linguists, psychologists, interaction designers and communication researchers. We are interested in computer mediated social interaction between people, navigation in information spaces, service infrastructures, language engineering and believable characters. Some of the information technology we prototype, develop, and investigate is mobile
Computing Research Laboratory (CRL)
Computing Research Laboratory. Related Web Sites
Hans Uszkoreit: Computational Linguistics& Language Technology on the Web
Computerlinguistik - Sprach- und Übersetzungstechnologie WWW-Seiten zur Computerlinguistik am Fachbereich Sprachen der FH Köln mit kurzer Einführung in die Computerlinguistik in Deutsch und Englisch (Titel: "Computer und Übersetzen" / "Computers and Translation") sowie einer kleinen (ständig gepflegten) Link-Liste.
Linguistic Data Consortium
The Linguistic Data Consortium is an open consortium of universities, companies and government research laboratories. It creates, collects and distributes speech and text databases, lexicons, and other resources for research and development purposes. The University of Pennsylvania is the LDC's hostinstitution
Natural Language Processing (NLP)
Statistical natural language processing and corpus-based computational linguistics An annotated list of resources
WWW Information for Speech/Acoustics Research
Annotated Bibliography of Contemporary Research in Tense, Grammatical Aspect, Aktionsart, and Related Areas
Contragram For a number of years the University of Ghent's departments of English, French, and Dutch have been engaged in joint research projects in the area of contrastive grammar. The research group evolving from this collaboration was recently christened the CONTRAGRAM group. They publish a newsletter with the same name, which reports on the results of the research and also contains other information of interest to contrastive grammarians.
FrameNet The Berkeley FrameNet project is creating an online lexical resource for English, based on frame semantics and supported by corpus evidence
The functional grammar information system
Head-driven Phrase Structure Grammar
HPSG is a constraint-based, lexicalist approach to grammatical theory that seeks to model human languages as systems of constraints on typed feature structures
Head-Driven Phrase Structure Grammar
Information for Systemic-Functional Linguists Mick O'Donnell
Role and Reference Grammar (RRG)
University at Buffalo The State University of New York Department of Linguistics
WORD GRAMMAR
Word Grammar is a theory of language structure which Richard (= Dick) Hudson has been building since the early 1980's
The XTAG Project
Page maintained by Anoop Sarkar
XTAG is an on-going project to develop a wide-coverage grammar for English using a lexicalized Tree Adjoining Grammar (TAG) formalism. XTAG also serves as an system for the development of TAGs and consists of a parser, an X-windows grammar development interface and a morphological analyzer
yourDictionary.com
The Web of On-line Dictionaries is now yourDictionary.com, the web's most authoritative and comprehensive language portal
Acoustics and Speech
by Philip Rubin, Haskins Laboratories
Center for Spoken Language Understanding stellt umfangreiche Materialien zur maschinellen Verarbeitung gesprochener Sprache zur Verfügung, darunter (a) Tutorials und Demos zur Spracherkennung und Sprachsynthese, (b) Tools zur Erforschung gesprochener Sprache und Mensch-Maschine-Kommunikation, (c) die Beschreibung kostenlos lieferbarer großer Corpora von Telefongesprächen in mehr als zwanzig Sprachen, (d) zahlreiche online verfügbare Publikationen sowie (e) die Hypertextversion des ausführlichen Survey of the State of the Art in Human Language Technology von 1996
The International Phonetic Association
Links zur Linguistik: Phonetik/Phonologie Utg.: Linse. Linguistik-Server Essen
More Phonetics Resources. Compiled and maintained by Frank Gooding
Phonetics Resources
Sammanställd av George L. Dillon, University of Washington
Phonology & the web. Links Matthew Brooks
Praat: doing phonetics by computer
The computer program Praat is a research, publication, and productivity tool for phoneticians.This comprehensive speech analysis, synthesis, and manipulation package includes general numerical and statistical stuff, is built on a general-purpose GUI shell for handling objects, and produces publication-quality graphics.
Praat has been developed by Paul Boersma and David Weenink at the Institute of Phonetic Sciences of the University of Amsterdam, The Netherlands
Proceedings of the Institute of Phonetic Sciences, Amsterdam IFA Proceedings 15 (1991-
Reproduction of The International Phonetic Alphabet (Revised to 1993, Updated 1996)
Speech Internet Dictionary Welcome to the Speech Internet Dictionary (SID). The aim of SID is to provide concise definitions of technical terms used in phonetics, phonology, speech and hearing science and allied disciplines.
Speech Literature and References Utg. av Instituut voor Fonetische Wetenschappen, Universiteit van Amsterdam
Speech Research
Perceptual Science Laboratory at the University of California - Santa Cruz
Distributed Morphology Bibliography
Sammanställd av Rolf Noyer
Lexeme-Morpheme Base Morphology (LMBM)
Sammanställd av Robert Beard, Bucknell University
Lexikonaufbau und Morphologie-Analyseverfahren Dies ist die Übersichts- und Startseite für Materialien zur Vorlesung: Lexikonaufbau und Morphologie-Analyseverfahren wie sie im Sommersemester 1999 an der Universität Zürich angeboten wird (Dozent: Dr. Martin Volk, Computerlinguistik). Die Materialien wurden im Sommersemester 1996 erarbeitet und in den folgenden Jahren jeweils aufdatiert.
Multifunctional Morphological Dictionary Analysiert und erzeugt sowohl Flexions- als auch Ableitungsformen englischer und deutscher Wörter.
The Project on Annotated Bibliography of Contemporary Research in Tense, Grammatical Aspect, Aktionsart, and Related Areas Professor Robert I. Binnick, Division of Humanities, University of Toronto at Scarborough, Scarborough, ON Canada
Autolexical Theory
by Eric Schiller
Dependency-Based Approaches to Natural Language Syntax
Unofficial Links and Notes on LFG/OT
by Joan Bresnan
Autolexical Theory by Eric Schiller
Bibliography of linguistics papers dealing with lexical semantics by Heidi Harley
Bibliography on possessives and related problems By Yura Lander
Dependency-Based Approaches to Natural Language Syntax
LFG Morphosyntax. Some recent work on Morphology, Syntax and the Morphology/Syntax Interface Sammanställd av Louisa Sadler, Essex University
Resources for Research on Genitives/Possessives and Beyond By Barbara Partee
Unofficial Links and Notes on LFG/OT by Joan Bresnan
The Minimalist Syntax Archives The Arizona Minimalist Syntax Archives (AMSA) contains pdf and postscript files of important and recent papers written within the Minimalist Framework
A step-by-step introduction to the Government and Binding theory of syntax By Cheryl A. Black A textbook-length explanation of Chomsky's theory. PDF file.
Semantic Links Sammanställd av semanticsarchive.net Rubriker: Conferences and Events, Mailing List, Journals, Book Sereies, and Conference Proceedings, Organizations, Other Online Paper Archives, Bibliographies, Web Resources
Semantics on the Web Kai von Fintel's list of semantics and pragmatics links on the Web includes home pages of individual researchers (many with on-line papers available)
Semanticsarchive.net Site For exchanging papers of interest to natural language semanticists; also includes links to other semantics resources
.PHRASEOLOGY. A Catalogue of Multilingual Resources on the Internet
NORNA. Links to Nordic name-research sites The Nordic cooperative committee for onomastic research (NORNA) is an association of Nordic scholars specialising in research into names.
Patrick Hanks and Flavia Hodges A Dictionary of First Names, Oxford University Press 1990
Links zur Linguistik: Sprachkartographie und Dialektforschung Utg. Linse. Linguistik-Server Essen
The Creolist Archives Home Page
Bilingualism Database This database is intended for use by those interested or involved in the field of Bilingualism regardless of whether they are professionals, researchers, students or parents of bilingual children. The database can be accessed via this website, as can a number of webpages containing useful information on bilingualism. The database is drawn from resources reflecting the availability of research material from the US, the United Kingdom, Canada and Australia.
Center for the Cognitive Science of Metaphor Online
The Metaphor Home Page This site contains opinions and resources about the linguistic and conceptual phenomena we refer to as Metaphor and Analogy. But rather than approach the phenomena from the traditional perspectives offered by philosophy and psychology, this site offers a computational perspective and uses tools and ideas from the field of Artificial Intelligence to provide further insights into what happens when we use and comprehend metaphor.
The Metaphor and Metonymy Group The Metaphor and Metonymy Group was formed in 1995-1996 and consists of Zazie Todd at the Department of Psychology, University of Leicester, Dr. Brigitte Nerlich and Prof. David D Clarke at the University of Nottingham, and a number of associate members elsewhere. We hope people interested in metaphor and metonymy (from a developmental, diachronic, cognitive, rhetorical etc. perspective) can use our web site to exchange news and views.
Metaphor in Scientific Thinking Sammanställd av Ray C. Paton, Department of Computer Science,University of Liverpool
The Forest of Rhetoric: silva rhetoricae This online rhetoric, provided by Dr. Gideon Burton of Brigham Young University, is a guide to the terms of classical and renaissance rhetoric. Sometimes it is difficult to see the forest (the big picture) of rhetoric because of the trees (the hundreds of Greek and Latin terms naming figures of speech, etc.) within rhetoric. This site is intended to help beginners, as well as experts, make sense of rhetoric, both on the small scale (definitions and examples of specific terms) and on the large scale (the purposes of rhetoric, the patterns into which it has fallen historically as it has been taught and practiced for 2000+ years).
Rhetoric and Composition This Web page is intended to list a variety of resources useful to rhetoricians. While many rhetoric and composition pages on the Web are written in conjunction with writing centers or specialize in computer-mediated communication, this page also has links to works of classical rhetoric, articles on literacy and education, and a few miscellaneous but useful things--how to suscribe to some highly-trafficked mailing lists and links to glossaries of rhetorical terms, for example.
Informationen zur Gesprächsanalyse
Webb's World Translation Resources Compiled by Lynn Web
xlation.com Webmasters: Dyran and Robert Altenburg
MACHINE TRANSLATION: An Introductory Guide Doug Arnold, Lorna Balkan, Siety Meijer, R.Lee Humphreys, Louisa Sadler Here you will find the text of the book in a Web Browsable form. A printable (PostScript) version of the book can be downloaded
American Sign Language Linguistic Research Project The ASLLRP includes investigation of the syntactic structure of ASL and development of multimedia tools to facilitate access to and analysis of primary data for sign language research. Each of these projects is currently funded by the National Science Foundation.
International Bibliography of Sign Language Compiled and managed by Guido H. G. Joachim - Siegmund Prillwitz - Thomas Hanke, Institute of German Sign Language and Communication of the Deaf University of Hamburg
Sign language sites on the World Wide Web This page is maintained by Onno Crasborn, Leiden Sign Phonology Group
Eurolang Eurolang is a new information service which aims to cover issues related to language diversity within the EU and to the development of the Europe of the regions. The objectives of Eurolang are to supply national and regional media with news of general interest about Europe's linguistic diversity. It concentrates on minority and regional language matters and news from European Institutions which affect the minority communities of Europe. Eurolang serves national media across Europe as well as minority and regional language media. It covers day-to-day news as well as supplying longer reports on issues of particular interest to European readers.
European minority (or minoritized!) languages
Foundation for Endangered Languages. links
General information on minority languages
Language futures Europé This site collects links on the language futures of Europe - on language policy, multilingualism, global language structures, and the dominance of English. It starts with a comment on the structures of language; then links to texts and essays; and then sections on EU policy, national policies, and research sites; and finally the 'monolingual movement' in the United States.
Linguistic Diversity: Terralingua's Internet Resource List on Language Endangerment, Survival, and Revitalization Terralingua maintains this inventory of institutions and organizations (academic, governmental, non-governmental (NGO), international, national, regional, local) dedicated to the study and especially the *maintenance* of the indigenous and minority languages of the world (including, but not limited to, endangered languages), as well as to fostering linguistic diversity in general.
Swedish Institute for CS (SICS), Kista
The Natural Language Software Registry (NLSR) The Natural Language Software Registry (NLSR) is a concise summary of the capabilities and sources of a large amount of natural language processing (NLP) software available to the NLP community. It comprises academic, commercial and proprietary software with specifications and terms on which it can be acquired clearly indicated.
Annotate: The NEGRA/TIGER annotation Annotate is a tool for efficient semi-automatic annotation of corpus data. It facilitates the generation of context-free structures and additionally allows crossing edges. Functions for the manipulation of such structures are provided. Terminal nodes, non-terminal nodes, and edges are labeled. In the NEGRA project, these labels are used for parts-of-speech and morphology (terminal nodes), phrase categories (non-terminal nodes), and grammatical functions (edges). Type and number of labels are defined by the user. Annotated corpora are stored in a relational database. Annotate has a specified interface for communication with external taggers and parsers.
The CHORUS demo system: Processing of underspecified representations
DORIS. Discourse Oriented Representation and Inference System by Johan Bos
The MIDAS. system: Multiple Inference-Based Dialogue Analysis System
The British National Corpus (BNC) The British National Corpus (BNC) is a 100 million word collection of samples of written and spoken language from a wide range of sources, designed to represent a wide cross-section of current British English,both spoken and written.
CHILDES: Child Language Data Exchange System The CHILDES system provides tools for studying conversational interactions. These tools include a database of transcripts, programs for computer analysis of transcripts, methods for linguistic coding,and systems for linking transcripts to digitized audio and video.
COLT. The Bergen Corpus of London Teenage Language The Bergen Corpus of London Teenage Language (COLT) is the first large English Corpus focusing on the speech of teenagers. It was collected in 1993 and consists of the spoken language of 13 to 17-year-old teenagers from different boroughs of London. The complete corpus, half a million words, has been orthographically transcribed and word-class tagged, and is a constituent of the British National Corpus. A pilot-version consisting of 151 texts is now available on the Internet. You can search for any word, collocation of words or letter combination by means of the TACTweb software. The search program can also show the distribution of an item in relation to factors such as age, sex, socioeconomic class, location etc.
Corpus Linguistics This website is maintained by Michael Barlow, Rice University. A large list of electronic corpora in several different languages, as well as links to other resources for corpus analysis. This is a linguistics-oriented, rather than literature-oriented, site.
An Electronic Corpus of Ingrian Finnish Welcome to the Joensuu-Bergen pilot project for making a corpus of spoken Ingrian Finnish available via WWW. The material presented here is from the project Language Contacts in the Northeastern Regions of the Baltic Sea, led by Professors Ilkka and Muusa Savijärvi at the University of Joensuu.
Electronic Texts and Publishing Resources. A Library of Congress Internet Resource Page
The Electronic Text Center The Electronic Text Center's holdings include approximately 51,000 on- and off-line humanities texts in twelve languages, with more than 350,000 related images (book illustrations, covers, manuscripts, newspaper pages, page images of Special Collections books, museum objects, etc.)
Electronic Text Collections in Western European Literature This page lists Internet sources for literary texts in the western European languages other than English.
English Language Corpora and Corpus resources This page lists centres and projects from which language corpora (chiefly English language) are readily available. Please let us know of any projects we've forgotten, or not yet discovered. It also includes links to resources of general interest for those working on corpus linguistics
Euralex Resources - Corpora and Dictionaries EURALEX is the European Association for Lexicography: an international association which was founded in 1983, with the aims of furthering all aspects of the broad field of lexicography, and of promoting the exchange of ideas and information
European Language Resources Association The European Language Resources Association (ELRA) was established as a non-profit organization in Luxembourg in February, 1995. The overall goal of ELRA is to provide a centralized organization for the validation, management, and distribution of speech, text, and terminology resources and tools, and to promote their use within the European telematics community
Göteborg spoken language corpora
Some of the above will be Multimodal. You will need a password to get the files.
1.Göteborg Spoken Language Corpus (Kernel Corpus - adult 1st language Swedish), 1.2 million words
(transcriptions, browser)
2.Adult language learners of Swedish (transcriptions)
3.Child language corpus (Swedish and Scandinavian), 0.5 million words including the adults (lexicon
files, Childes files)
4.Aphasic, deaf and dyslexic speakers
5.Child (3-6 years old) language corpus, 94 children, 260 000 words, Lisbeth Hedelin's material
(transcriptions)
6.Non-Swedish adult spoken language corpus
Chinese (70 000 words)
Bulgarian (25 000 words) (transcriptions)
Arabic
English, 10 000 words (transcriptions), BNC
Finnish
Italian, 3000 words (transcriptions)
Norwegian, 140 000 words (transcriptions)
Spanish
7.WOZ Corpus, Bionic
8.Intercultural communication corpus
9.Educational progress - 416 interviews, 2 million words, Kjell Härnqvist's material (transcriptions)
ICAME. International Computer Archive of Modern and Medieval English ICAME is an international organization of linguists and information scientists working with English machine-readable texts. The aim of the organization is to collect and distribute information on English language material available for computer processing and on linguistic research completed or in progress on the material, to compile an archive of English text corpora in machine-readable form, and to make material available to research institutions. The archive mentioned in the name resides at the Norwegian Computing Centre for the Humanities (NCCH) in Bergen, Norway. This acts as a distribution centre for computerized English-language corpora and corpus-related software. ICAME publishes the ICAME Journal which appears at least once a year, with articles and information about English computer corpora. There is also an electronic information service. Conferences, usually in May/June each year, have been arranged since 1979.
Linguistic Data Consortium The Linguistic Data Consortium is an open consortium of universities, companies and government research laboratories. It creates, collects and distributes speech and text databases, lexicons, and other resources for research and development purposes. The University of Pennsylvania is the LDC's host institution. The LDC was founded in 1992 with a grant from the Advanced Research Projects Agency (ARPA), and is partly supported by grant IRI-9528587 from the Information and Intelligent Systems division of the National Science Foundation.
Linguistic exploration Linguistic Exploration is a mode of investigation in (computational) linguistics involving empirical research on complex, dynamic, multimodal datasets through the combination of traditional field methods with new technologies for storing and analyzing linguistic data. The languages under study may range from the undescribed to the well-studied, and the investigator may operate in a village or a laboratory. The focus is the documentary and exploratory mode of research, generating reusable language resources, and developing new techniques for working with continually evolving datasets.
NEGRA corpus. A Syntactically Annotated Corpus of German Newspaper Texts The NEGRA corpus consists of approximately 176,000 tokens (10,000 sentences) of German newspaper text, taken from the Frankfurter Rundschau as contained in the CD "Multilingual Corpus 1" of the European Corpus Initiative. It is based on approx. 60,000 tokens that were tagged for part-of speech at the Institut für maschinelle Sprachverarbeitung, Stuttgart. This corpus was extended, tagged with part-of-speech and completely annotated with syntactic structures. The corpus was created in the projects NEGRA (DFG Sonderforschungsbereich 378, Projekt C3) and LINC (Universität des Saarlandes) in Saarbrücken.
On-line books An index of on-line literature, mainly English but with a sizable list of links to books in other languages and to specialty archives. A good starting point.
Oxford Text Archive The OTA works closely with members of the Arts and Humanities academiccommunity to collect, catalogue, and preserve high-quality electronic texts for research and teaching. The OTA currently distributes more than 2500 resources in over 25different languages, and is actively working to extend its catalogue of holdings.
Språkbanken En språklig referensdatabas vid Göteborgs Universitet . The Bank of Swedish - A Linguistic Reference Database of Göteborg University
Statistical natural language processing and corpus-based computational linguistics: An annotated list of resources Resources page for statistical NLP and corpus linguistics. Christopher Manning's thorough and lightly annotated list. Tools, Corpora, Dictionaries, Lexical/morphological resources, Courses, Syllabi, and other Educational Resources, Mailing lists, Other stuff on the Web
W3-Corpora Site These pages have been created as part of the W3-Corpora Project at the University of Essex. Lots of information about corpus linguistics including searchable corpora (more for noncommercial users who register)
Keenan, Ed The following papers are available for reading online (requires Adobe Acrobat Reader for viewing). Explaining the Creation of Reflexive Pronouns in English - May 2001. A Semantic Characterization of the Definiteness Effect - March 2001. A Quantitative Study of Voice in Malagasy - November 2000. With Cecile Manorohanta. Reciprocals in Malagasy - October 2000. With Jean Paulin Razafimamonjy. Quantification in English is Inherently Sortal - May 1999. To appear in History of Philosophy and Logic. Determiners, Adjectives and a Query of van Benthems - March 1999. Raising from NP in Malagasy - January 1998. With Baholisoa Ralalaoherivony.
CogPrints. Cognitive Sciences Eprint Archive CogPrints, an electronic archive for papers in any area of Psychology, Neuroscience, and Linguistics, and many areas of Computer Science (e.g., artificial intelligence, robotics, vison, learning, speech, neural networks), Philosophy (e.g., mind, language, knowledge, science, logic), Biology (e.g., ethology, behavioral ecology, sociobiology, behaviour genetics, evolutionary theory), Medicine (e.g., Psychiatry, Neurology, human genetics, Imaging), Anthropology (e.g., primatology, cognitive ethnology, archeology, paleontology), as well as any other portions of the physical, social and mathematical sciences that are pertinent to the study of cognition.
Johns Hopkins Cog. Sci. on-line papers
Linguistics Working Papers Directory by Cascadilla Press
Semanticarchive.net For exchanging papers of interest to natural language semanticists. Archiving a paper is not considered a form of publication, but instead is analogous to circulating a manuscript, preprint, or offprint. Therefore it is not appropriate to cite a paper as appearing on the semantics archive.
UCLA Linguistics on-line papers
University College London on-line papers
Univ. of Delaware Linguistics on-line papers & dissertations
University of Edinburgh on-line Occasional Papers in Linguistics
University of Maryland, Department of linguistics. Papers on Minimalism