Description

Book Synopsis
This book describes new methodological and technological approaches to corpus building and presents recent research based on the Norwegian Newspaper Corpus. This is a large monitor corpus of contemporary Norwegian language, compiled through daily harvesting of web newspapers. The book gives an overview of the corpus and its system architecture, and presents tools used for tasks such as text harvesting, annotation, topic classification and extraction and frequency profiling of new words and phrases. Among the innovative technologies is Corpuscle, a corpus query engine and management system which is flexible enough to handle very large corpora in an efficient way. The individual research contributions based on the corpus explore different aspects of Norwegian, including the occurrence of anglicisms, neologisms and terminology, and the use of metonymy and metaphor in newspaper language. The book also describes an innovative method of applying correspondence analysis and implicational analysis to investigate interdependencies between morphosyntactic variants.

Table of Contents
1. Building a large corpus based on newspapers from the web (by Andersen, Gisle); 2. Part I. Exploiting the web as a corpus - Methods and tools; 3. Corpuscle - a new corpus management platform for annotated corpora (by Meurer, Paul); 4. OBT+stat: A combined rule-based and statistical tagger (by Johannessen, Janne Bondi); 5. Exploring corpora through syntactic annotation (by Rosen, Victoria); 6. Collocations and statistical analysis of n-grams: Multiword expressions in newspaper text (by Lyse, Gunn Inger); 7. Automatic topic classification of a large newspaper corpus (by Hagen, Thomas M.); 8. A data-driven approach to anglicism identification in Norwegian (by Losnegaard, Gyri Smordal); 9. Part II. Corpus-based case studies; 10. A corpus-based study of the adaptation of English import words in Norwegian (by Andersen, Gisle); 11. Norm clusters in written Norwegian (by Dyvik, Helge); 12. Lexical neography in modern Norwegian (by Fjeld, Ruth Vatvedt); 13. Ash compound frenzy: A case study in the Norwegian Newspaper Corpus (by De Smedt, Koenraad); 14. Financial jargon in a general newspaper corpus (by Kristiansen, Marita); 15. Metonymic extension and vagueness: Schengen and Kyoto in Norwegian newspaper language (by Halverson, Sandra L.); 16. Spatial metaphors in present-day Norwegian newspaper language (by Breivik, Leiv Egil); 17. Doing historical linguistics using contemporary data (by Andersen, Oivin); 19. Subject index

Exploring Newspaper Language: Using the web to

Product form

£78.85

Includes FREE delivery

RRP £83.00 – you save £4.15 (5%)

Order before 4pm tomorrow for delivery by Fri 19 Dec 2025.

A Hardback by Gisle Andersen

Out of stock


    View other formats and editions of Exploring Newspaper Language: Using the web to by Gisle Andersen

    Publisher: John Benjamins Publishing Co
    Publication Date: 23/03/2012
    ISBN13: 9789027203540, 978-9027203540
    ISBN10: 9027203547

    Description

    Book Synopsis
    This book describes new methodological and technological approaches to corpus building and presents recent research based on the Norwegian Newspaper Corpus. This is a large monitor corpus of contemporary Norwegian language, compiled through daily harvesting of web newspapers. The book gives an overview of the corpus and its system architecture, and presents tools used for tasks such as text harvesting, annotation, topic classification and extraction and frequency profiling of new words and phrases. Among the innovative technologies is Corpuscle, a corpus query engine and management system which is flexible enough to handle very large corpora in an efficient way. The individual research contributions based on the corpus explore different aspects of Norwegian, including the occurrence of anglicisms, neologisms and terminology, and the use of metonymy and metaphor in newspaper language. The book also describes an innovative method of applying correspondence analysis and implicational analysis to investigate interdependencies between morphosyntactic variants.

    Table of Contents
    1. Building a large corpus based on newspapers from the web (by Andersen, Gisle); 2. Part I. Exploiting the web as a corpus - Methods and tools; 3. Corpuscle - a new corpus management platform for annotated corpora (by Meurer, Paul); 4. OBT+stat: A combined rule-based and statistical tagger (by Johannessen, Janne Bondi); 5. Exploring corpora through syntactic annotation (by Rosen, Victoria); 6. Collocations and statistical analysis of n-grams: Multiword expressions in newspaper text (by Lyse, Gunn Inger); 7. Automatic topic classification of a large newspaper corpus (by Hagen, Thomas M.); 8. A data-driven approach to anglicism identification in Norwegian (by Losnegaard, Gyri Smordal); 9. Part II. Corpus-based case studies; 10. A corpus-based study of the adaptation of English import words in Norwegian (by Andersen, Gisle); 11. Norm clusters in written Norwegian (by Dyvik, Helge); 12. Lexical neography in modern Norwegian (by Fjeld, Ruth Vatvedt); 13. Ash compound frenzy: A case study in the Norwegian Newspaper Corpus (by De Smedt, Koenraad); 14. Financial jargon in a general newspaper corpus (by Kristiansen, Marita); 15. Metonymic extension and vagueness: Schengen and Kyoto in Norwegian newspaper language (by Halverson, Sandra L.); 16. Spatial metaphors in present-day Norwegian newspaper language (by Breivik, Leiv Egil); 17. Doing historical linguistics using contemporary data (by Andersen, Oivin); 19. Subject index

    Recently viewed products

    © 2025 Book Curl

      • American Express
      • Apple Pay
      • Diners Club
      • Discover
      • Google Pay
      • Maestro
      • Mastercard
      • PayPal
      • Shop Pay
      • Union Pay
      • Visa

      Login

      Forgot your password?

      Don't have an account yet?
      Create account