Results for 'comparable corpora'

971 found
Order:
  1.  12
    Lexicogrammar and textometrics: Identification and visualisation of typical lexico-grammatical patterns in two comparable corpora of legal French.Christopher Gledhill, Stéphane Patin & Maria Zimina - 2017 - Corpus 17.
    Nous examinons ici les schémas lexico-grammaticaux (LG) associés à trois verbes supports (« effectuer », « réaliser », « procéder ») dans deux corpus juridiques comparables (un corpus de directives européennes rédigées en français et leur transposition en France dans les textes de lois). A cette fin, nous utilisons Le Trameur, un logiciel de textométrie qui permet une comparaison systématique des données textuelles en mobilisant plusieurs méthodes statistiques exploratoires : analyse des fréquences par partie, spécificités, segments répétés, co-occurrences, poly-cooccurrences, etc. (...)
    No categories
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  2.  8
    Conversion and the lexicon: comparing evidence from corpora and experimentation.Laura Teddiman - 2012 - In Dagmar Divjak & Stefan Thomas Gries (eds.), Frequency effects in language representation. Boston: De Gruyter Mouton. pp. 235--254.
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark  
  3.  18
    Discourse on climate and energy justice: a comparative study of Do It Yourself and Bootstrapped corpora.Camille Biros, Caroline Rossi & Inesa Sahakyan - 2018 - Corpus 18.
    This article offers a descriptive and analytic view of the different stages leading to the constitution of a corpus that is representative of the issues of climate and energy justice. Overall, the corpus contains over five million words and gathers reports, newsletters and web-pages dealing with the most equitable ways of moving to a low-carbon future in the aim of limiting climate change. It can be divided into six sub-corpora, according to types of discourse communities, and methods of constitution. (...)
    No categories
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   1 citation  
  4. Comparing Methods for Single Paragraph Similarity Analysis.Benjamin Stone, Simon Dennis & Peter J. Kwantes - 2011 - Topics in Cognitive Science 3 (1):92-122.
    The focus of this paper is two-fold. First, similarities generated from six semantic models were compared to human ratings of paragraph similarity on two datasets—23 World Entertainment News Network paragraphs and 50 ABC newswire paragraphs. Contrary to findings on smaller textual units such as word associations (Griffiths, Tenenbaum, & Steyvers, 2007), our results suggest that when single paragraphs are compared, simple nonreductive models (word overlap and vector space) can provide better similarity estimates than more complex models (LSA, Topic Model, SpNMF, (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   3 citations  
  5.  17
    Small bi- and multilingual spoken corpora – a theoretical and methodological challenge.Inga Hennecke - 2018 - Corpus 18.
    Cette contribution présente des types de petits corpus très hétérogènes, qui ne sont toujours pas clairement définis : les petits corpus oraux bilingues et plurilingues. Elle envisage d’abord de donner un aperçu de petits corpus et discute l’insécurité terminologique du domaine des corpus bi- et plurilingues. Ensuite, les bases de données de petits corpus bilingues Talkbank, HZSK, Sketchengine et CLAPOTY sont brièvement décrites pour discuter les possibilités de visualisation ainsi que de mise en disposition des données de corpus. Finalement, un (...)
    No categories
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  6.  36
    The early days of contemporary philosophy of science: novel insights from machine translation and topic-modeling of non-parallel multilingual corpora.Christophe Malaterre & Francis Lareau - 2022 - Synthese 200 (3):1-33.
    Topic model is a well proven tool to investigate the semantic content of textual corpora. Yet corpora sometimes include texts in several languages, making it impossible to apply language-specific computational approaches over their entire content. This is the problem we encountered when setting to analyze a philosophy of science corpus spanning over eight decades and including original articles in Dutch, German and French, on top of a large majority of articles in English. To circumvent this multilingual problem, we (...)
    No categories
    Direct download (3 more)  
     
    Export citation  
     
    Bookmark  
  7.  11
    A comparative analysis of English for academic purposes teachers’ interactive metadiscourse across the British and Chinese contexts.Xinxin Wu & He Yang - 2022 - Frontiers in Psychology 13.
    This exploratory research compares the interactive metadiscourse use by native English-speaking English for academic purposes writing teachers in the United Kingdom and their non-native counterparts in the Chinese contexts. The analysis is based on a self-compiled corpus, including two sub-corpora, which were composed of instructor contributions to classroom discourse: eight sessions of EAP lessons from the Chinese context and eight sessions of EAP lessons from the British context. Adopting an interpersonal model of metadiscourse, the two sub-corpora were compared (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  8.  22
    A comparative analysis of the U.S. and China’s mainstream news media framing of coping strategies and emotions in the reporting of COVID-19 outbreak on social media.Rita Gill Singh & Cindy le YaoSing Bik Ngai - 2022 - Discourse and Communication 16 (5):572-597.
    This study compares the coverage of coping strategies and emotions portrayed in news regarding COVID-19 by The New York Times in the U.S. and People’s Daily of China via social media. By employing corpus assisted discourse analysis to scrutinize the text corpora, our study uncovered prominent keywords and themes. Findings indicate that a comprehensive range of themes relating to coping strategies was more common in People’s Daily while a relatively smaller number of themes was apparent in The New York (...)
    Direct download  
     
    Export citation  
     
    Bookmark  
  9.  35
    Using corpus methodology for semantic and pragmatic analyses: What can corpora tell us about the linguistic expression of emotions?Ulrike Oster - 2010 - Cognitive Linguistics 21 (4):727-763.
    The aim of this paper is to explore some of the possibilities, advantages and difficulties of corpus-based analyses of semantic and pragmatic aspects of language in one particular field, namely the linguistic expression of emotion concepts. For this purpose, a methodological procedure is proposed and an exemplary analysis of the emotion concept “fear” in English is performed. The procedure combines Kövecses' lexical approach and Stefanowitsch's metaphorical pattern analysis with additional concepts from corpus linguistics such as semantic preference and semantic prosody. (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   6 citations  
  10.  80
    Automatic Extraction of Property Norm‐Like Data From Large Text Corpora.Colin Kelly, Barry Devereux & Anna Korhonen - 2014 - Cognitive Science 38 (4):638-682.
    Traditional methods for deriving property-based representations of concepts from text have focused on either extracting only a subset of possible relation types, such as hyponymy/hypernymy (e.g., car is-a vehicle) or meronymy/metonymy (e.g., car has wheels), or unspecified relations (e.g., car—petrol). We propose a system for the challenging task of automatic, large-scale acquisition of unconstrained, human-like property norms from large text corpora, and discuss the theoretical implications of such a system. We employ syntactic, semantic, and encyclopedic information to guide our (...)
    Direct download  
     
    Export citation  
     
    Bookmark  
  11.  25
    Complexity in organoleptic paths of motion in the genre of craft beer reviews: a comparative study of Spanish and English.David Clarke - 2019 - Dissertation, Dublin City University
    The study of how languages differ in their portrayal of motion events has received much attention since Talmy provided the first detailed account of the phenomenon. Interest has extended from real, or factive motion, to imagined or fictive motion, and from there to metaphorical motion, in which experience in one sensory domain is understood in terms of motion. Studies of metaphorical motion have, however, concentrated so far on a limited number of sensory domains, principally vision, and drawn data from a (...)
    No categories
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  12. Managing Noise in Part-of-Speech Tagging for Extremely Low-Resource Languages: Comparing Strategies for Corpus Collection and Annotation in Dagur and Alsatian.Delphine Dolińska Bernhard - 2025 - Corpus 26 (26).
    Although Dagur and Alsatian represent two typologically distant language families, they share several similarities: both languages are endangered, do not have a unified spelling system, and have few available digital corpora. Given these challenges, the main aim of this article is to compare the noise in corpora for these languages and its impact on part-of-speech (POS) annotation and tagging. We first discuss what strategies can be used to reduce the noise due to spelling inconsistencies observed during corpus collection, (...)
    No categories
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  13.  13
    A Comparative Corpus-Based Study on the Political Discourse of the U.S. Presidents: Obama and Trump.Arta Toçi & Enes Ismeti - 2022 - Seeu Review 17 (2):71-86.
    The aim of this research is to analyze the political discourse and the language they used in public addresses provided by two former presidents of the United States, specifically, President Barak Obama and President Donald Trump. This research reflects on the material that has been collected for several months which aimed to contribute on the analysis of the corpus, distinctions and similarities, as well as their attitudes towards the public opinion. The objectives of this study are mainly empirical and theoretical. (...)
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark  
  14.  50
    In Quest of Sufficient Equivalence. Polish and English Insolvency Terminology in Translation. a Comparative Study.Aleksandra Matulewska - 2014 - Studies in Logic, Grammar and Rhetoric 38 (1):167-188.
    The paper deals with the problem of translating selected insolvency terminology from Polish into English and from English into Polish. The re- search corpora encompassed the Insolvency Act 1986 as amended and Ustawa z dnia 28 lutego 2003. Prawo upadłościowe i naprawcze [the Act on Polish Insolvency and Rehabilitation Law of 28th February 2003 as amended]. The research methods included: the comparison of parallel texts, the method of axiomatisation of the legal linguistic reality, the termino- logical analysis of the (...)
    Direct download (4 more)  
     
    Export citation  
     
    Bookmark   6 citations  
  15.  53
    (1 other version)Examining evaluativity in legal discourse: a comparative corpus-linguistic study of thick concepts.Pascale Willemsen, Lucien Baumgartner, Severin Frohofer & Kevin Reuter - 2023 - In Stefan Magen & Karolina Prochownik (eds.), Advances in Experimental Philosophy of Law. New York, NY: Bloomsbury Academic. pp. 192-214.
    How evaluative are legal texts? Do legal scholars and jurists speak a more descriptive or perhaps a more evaluative language? In this paper, we present the results of a corpus study in which we examined the use of evaluative language in both the legal domain as well as public discourse. For this purpose, we created two corpora. Our legal professional corpus is based on court opinions from the U.S. Courts of Appeals. We compared this professional corpus to a public (...)
    No categories
    Direct download (4 more)  
     
    Export citation  
     
    Bookmark   3 citations  
  16.  33
    Primary metaphors: Importance as size and weight in a comparative perspective.Ning Yu, Lu Yu & Yue Christine Lee - 2017 - Metaphor and Symbol 32 (4):231-249.
    This is a linguistic study of two primary metaphors with the same target-domain concept, “IMPORTANCE IS SIZE” and “IMPORTANCE IS WEIGHT,” in English and Chinese. It is suggested that these two primary metaphors are derived from the OBJECT image schema, abstracted from our embodied, sensorimotor experience, especially our visual and tactile perception, in dealing with physical objects in everyday life. The study focuses on size and weight adjectives in both languages and on linguistic evidence in two areas: their lexicalizations of (...)
    No categories
    Direct download (5 more)  
     
    Export citation  
     
    Bookmark   1 citation  
  17. From Book to Text: Towards a Comparative History of Philologies.Christian Jacob & Juliet Vale - 1999 - Diogenes 47 (186):4-22.
    Our methods of research, duly elaborated hereafter, would benefit from being applied to the realm of the East. For that matter, the examination of Syriac, Armenian, Coptic or Arabic manuscripts does not differ in the least from that of a Greek or Latin manuscript. The rules developed by classical philologists are just as valid for the study of the Maxims of Phtahhotep and the Precepts of Kagemeni…Alphonse Dain (1975), Les Manuscrits (Paris, Les Belles Lettres)One of the objects of a comparative (...)
    No categories
    Direct download (3 more)  
     
    Export citation  
     
    Bookmark  
  18.  15
    Online disagreement in WhatsApp groups: A comparative study of Spanish family members and work colleagues.Lucía Fernández-Amaya - 2021 - Discourse and Communication 15 (5):542-558.
    The purpose of this paper is to compare disagreement in two different WhatsApp groups: one for members of the same family, and another for work colleagues. After the analysis, 427 instances of disagreement were identified in the family group, and 161 in the interactions between work colleagues. The most common strategy in both corpora is ‘Giving opposite opinions’. Nevertheless, the rest of the results present very significant dissimilarities, most notably the higher presence of disagreement in the family WhatsApp group. (...)
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark  
  19.  9
    À la recherche de la valeur causale des conjonctions as et since dans de grands corpus, dans le but de les comparer.Bénédicte Guillaume - 2018 - Corpus 18.
    This article deals with English causal subordinate clauses introduced by since or as. Both these markers may convey different meanings according to contextual variations, and can express temporal relations, from which their causal value is derived. The semantic closeness between as and since whenever they express a causal relation makes it necessary to harvest a large number of attested examples in order to compare and contrast them. The recourse to large on-line corpora such as the British National Corpus gives (...)
    No categories
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  20.  23
    Framing Gender in the Coverage of Protests: Arab Women’s Uprisings in English and German Press.Zahra Mustafa-Awad, Majdi Sawalha, Monika Kirner-Ludwig & Duaa Tabaza - 2023 - International Journal for the Semiotics of Law - Revue Internationale de Sémiotique Juridique 36 (6):2501-2521.
    We report on the first stage of a project on the representations of gender in the coverage of the Arab Spring by Western media. We focus on designing comparable corpora to examine Arab women’s depiction in English and German news during the uprisings. The English corpus is composed of reports published by _The Guardian and The New York Times_. The German corpus consists of articles collected from _Der Spiegel, Die Welt_, _Die Zeit, Frankfurter Allgemeine Zeitung,_ and _Süddeutsche Zeitung_. (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  21.  43
    Using semantic tagging to examine the American Dream and the Chinese Dream.Zhide Hou - 2019 - Semiotica 2019 (227):145-168.
    This paper uses Wmatrix to generate semantic tagging to compare corpora of media representations between the American Dream and the Chinese Dream. The USAS tagger is used to assign the semantic field tags to the America Dream Corpus and the Chinese Dream Corpus. The motivation of this study is to replicate the studies using an automated and inclusive method based on semantic tagging. 295–324), and more importantly, to conduct a broad semantic categorization on both national dreams so as to (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  22.  17
    Discourses about independence: A corpus-based analysis of discourse prosodies in Spanish and Catalan newspapers.Marcello Giugliano - 2022 - Discourse and Communication 16 (5):525-550.
    The present study is a corpus-assisted analysis of discourses on the Catalan movement of independence in the Catalan and Spanish press. A referendum for Catalan independence was held on 1 October 2017 but was not approved by the central Spanish government and was declared illegal. Spanish and Catalan newspapers differed considerably in their treatment and representation of the conflict, which also drew the attention of the international press to Catalonia. My analysis interconnects theoretical perspectives from linguistic critical discourse studies, cognitive (...)
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark  
  23.  33
    Genre Variation and Changes in Frame Sequences Across Cultures: The Case of Criminology RA Abstracts in English and French.Silvia Cavalieri & Chiara Preite - 2017 - Studies in Logic, Grammar and Rhetoric 49 (1):37-53.
    Though not as widely studied as the Research Article, the abstract has attracted increasing interest among researchers over last decades. A number of contrastive or comparative studies of abstracts in English and other languages have already been carried out considering mainly the hard sciences and some soft sciences such as linguistics and history, however no cross-cultural analyses have been conducted so far between RA abstracts in English and RA abstracts in French published in the legal field. This paper seeks to (...)
    No categories
    Direct download (4 more)  
     
    Export citation  
     
    Bookmark  
  24.  27
    Making Referents Seen and Heard Across Signed and Spoken Languages: Documenting and Interpreting Cross-Modal Differences in the Use of Enactment.Sébastien Vandenitte - 2022 - Frontiers in Psychology 13:784339.
    Differences in language use and structures between signed and spoken languages have often been attributed to so-called language “modality.” Indeed, this is derived from the conception that spoken languages resort to both the oral-aural channel of speech and the visual-kinesic channel of visible bodily action whereas signed languages only resort to the latter. This paper addresses the use of enactment, a depictive communicative strategy whereby language users imitate referents in signed and spoken languages. Reviewing comparative research on enactment, this paper (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  25. A Corpus Study of "Know": On the Verification of Philosophers' Frequency Claims about Language.Nat Hansen, J. D. Porter & Kathryn Francis - 2019 - Episteme 18 (2):242-268.
    We investigate claims about the frequency of "know" made by philosophers. Our investigation has several overlapping aims. First, we aim to show what is required to confirm or disconfirm philosophers’ claims about the comparative frequency of different uses of philosophically interesting expressions. Second, we aim to show how using linguistic corpora as tools for investigating meaning is a productive methodology, in the sense that it yields discoveries about the use of language that philosophers would have overlooked if they remained (...)
    Direct download (9 more)  
     
    Export citation  
     
    Bookmark   16 citations  
  26.  55
    Social Media and Language Processing: How Facebook and Twitter Provide the Best Frequency Estimates for Studying Word Recognition.Herdağdelen Amaç & Marelli Marco - 2017 - Cognitive Science 41 (4):976-995.
    Corpus-based word frequencies are one of the most important predictors in language processing tasks. Frequencies based on conversational corpora are shown to better capture the variance in lexical decision tasks compared to traditional corpora. In this study, we show that frequencies computed from social media are currently the best frequency-based estimators of lexical decision reaction times. The results are robust and are still substantial when we control for corpus size.
    Direct download (3 more)  
     
    Export citation  
     
    Bookmark   2 citations  
  27.  13
    Constructions as Resources for Interaction: Lists in English and Spanish Conversation.Ivo Sánchez-Ayala - 2003 - Discourse Studies 5 (3):323-349.
    The comparative study of synchronic occurrences of a particular construction in interactional corpora from different languages can provide a complete understanding of the development of the construction as a response to some basic intimately associated set of interactional moves and cognitive demands. A comparative formal analysis of lists in Spanish and English conversations shows that in both languages the construction has developed a stylized intonation pattern based on the holding of tones by means of lengthening of nuclear syllables, a (...)
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark   3 citations  
  28.  16
    Sadness Expressions in English and Chinese: corpus linguistic contrastive semantic analysis.Ruihua Zhang - 2014 - New York: Bloomsbury Academic.
    This book reports on the contrastive-semantic investigation of sadness expressions between English and Chinese, based on two monolingual general corpora and a parallel corpus. The exploration adopts a unique theoretical approach which integrates corpus-linguistic theories on meaning (as a social construct, usage and paraphrase) with a corpus-linguistic lexical model. It employs a new complex but workable methodology which combines computational tools with manual examination to tease meaning out of corpus evidence, to compare and contrast lexical items that do not (...)
    Direct download  
     
    Export citation  
     
    Bookmark  
  29.  1
    Terminologie de la couleur bleue en diachronie longue selon Google Livres Ngram Viewer.Agnieszka K. Kaliska - 2025 - Corpus 26 (26).
    The digitization of ancient texts and tools for processing written corpora now make it possible to refine linguistic research in long diachrony. The aim of this analysis is to use the Ngram Viewer to study changes in the frequency of blue and its shades (studied through various terms, names of colors, pigments and dyes, e.g. bleu céleste, bleu roi, bleu de Paris, safre) in the French sub-collection of the Google Books corpus, which includes works published between 1500 and 2022. (...)
    No categories
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  30.  9
    Things that Place Names Do.Carola Lentz & Matthias Egeler - 2022 - Anthropos 117 (2):453-466.
    Drawing on corpora from West Africa and Iceland, the article presents a fieldwork-based comparative exploration of ‘things that place names do’. Treating toponyms as performative elements of culture, we have observed striking parallels as well as differences in the uses of place names in both regions. Place names communicate spatial orientation; play an important role in the commemoration of people and events; mark claims of possession; support the construction of identity; sacralize landscapes; and voice moral reprimands. They can provide (...)
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark  
  31.  23
    Polish, Greek and Cypriot Civil Procedure Terminology in Translation. A Parametric Approach.Karolina Gortych-Michalak - 2017 - Studies in Logic, Grammar and Rhetoric 49 (1):73-88.
    The paper discusses the problem of translating selected Civil Procedure terminology from Greek into Polish and from Polish into Greek. The research material includes corpora of normative acts and more precisely those, which regulate Civil Procedure of Poland, Greece and the Republic of Cyprus. The research methodology is based on the concept of parameterisation, according to which the legal linguistic reality becomes axiomatic. Then the set of relevant dimensions and parameters is extracted. The set of parameters are a tool (...)
    No categories
    Direct download (4 more)  
     
    Export citation  
     
    Bookmark  
  32.  74
    Recurrent neural network-based models for recognizing requisite and effectuation parts in legal texts.Truong-Son Nguyen, Le-Minh Nguyen, Satoshi Tojo, Ken Satoh & Akira Shimazu - 2018 - Artificial Intelligence and Law 26 (2):169-199.
    This paper proposes several recurrent neural network-based models for recognizing requisite and effectuation parts in Legal Texts. Firstly, we propose a modification of BiLSTM-CRF model that allows the use of external features to improve the performance of deep learning models in case large annotated corpora are not available. However, this model can only recognize RE parts which are not overlapped. Secondly, we propose two approaches for recognizing overlapping RE parts including the cascading approach which uses the sequence of BiLSTM-CRF (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   10 citations  
  33.  38
    Thirty years of Artificial Intelligence and Law: the second decade.Giovanni Sartor, Michał Araszkiewicz, Katie Atkinson, Floris Bex, Tom van Engers, Enrico Francesconi, Henry Prakken, Giovanni Sileno, Frank Schilder, Adam Wyner & Trevor Bench-Capon - 2022 - Artificial Intelligence and Law 30 (4):521-557.
    The first issue of Artificial Intelligence and Law journal was published in 1992. This paper provides commentaries on nine significant papers drawn from the Journal’s second decade. Four of the papers relate to reasoning with legal cases, introducing contextual considerations, predicting outcomes on the basis of natural language descriptions of the cases, comparing different ways of representing cases, and formalising precedential reasoning. One introduces a method of analysing arguments that was to become very widely used in AI and Law, namely (...)
    Direct download (3 more)  
     
    Export citation  
     
    Bookmark   4 citations  
  34.  23
    Visual and Affective Multimodal Models of Word Meaning in Language and Mind.Simon De Deyne, Danielle J. Navarro, Guillem Collell & Andrew Perfors - 2021 - Cognitive Science 45 (1):e12922.
    One of the main limitations of natural language‐based approaches to meaning is that they do not incorporate multimodal representations the way humans do. In this study, we evaluate how well different kinds of models account for people's representations of both concrete and abstract concepts. The models we compare include unimodal distributional linguistic models as well as multimodal models which combine linguistic with perceptual or affective information. There are two types of linguistic models: those based on text corpora and those (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   6 citations  
  35.  20
    Deep Text Mining for Automatic Keyphrase Extraction from Text Documents.Muhammad Abulaish, Jahiruddin & Lipika Dey - 2011 - Journal of Intelligent Systems 20 (4):327-351.
    Due to existence of a huge amount of textual data either on the World Wide Web or in textual databases like PubMed, the development of novel automatic keyphrase extraction methods has emerged as one of the key research problems in recent past. Consequently, a number of machine learning techniques, mostly supervised, have been proposed to extract keyphrases from text documents. But, one of the main bottlenecks that hinders the success of such systems is the requirement of annotated corpora for (...)
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark  
  36.  60
    Bootstrapping the lexicon: a computational model of infant speech segmentation.Eleanor Olds Batchelder - 2002 - Cognition 83 (2):167-206.
    Prelinguistic infants must find a way to isolate meaningful chunks from the continuous streams of speech that they hear. BootLex, a new model which uses distributional cues to build a lexicon, demonstrates how much can be accomplished using this single source of information. This conceptually simple probabilistic algorithm achieves significant segmentation results on various kinds of language corpora - English, Japanese, and Spanish; child- and adult-directed speech, and written texts; and several variations in coding structure - and reveals which (...)
    Direct download (5 more)  
     
    Export citation  
     
    Bookmark   12 citations  
  37.  18
    Vocabulary Demands of Informal Spoken English Revisited: What Does It Take to Understand Movies, TV Programs, and Soap Operas?Hung Tan Ha - 2022 - Frontiers in Psychology 13.
    The article presents a methodological update on the lexical profile of informal spoken English with the emphasis on movies, television programs, and soap operas. The study analyzed Mark Davies’s mega-corpora with data containing approximately 625 million words and employed Paul Nation’s comprehensive and up-to-date British National Corpus/Corpus of Contemporary American English wordlists. Data from the analyses showed that viewers would need a vocabulary knowledge at 3,000 and 5,000 words frequency levels to understand 95 and 98% of the words in (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   2 citations  
  38.  28
    An integrated explicit and implicit offensive language taxonomy.Barbara Lewandowska-Tomaszczyk, Anna Bączkowska, Chaya Liebeskind, Giedre Valunaite Oleskeviciene & Slavko Žitnik - 2023 - Lodz Papers in Pragmatics 19 (1):7-48.
    The current study represents an integrated model of explicit and implicit offensive language taxonomy. First, it focuses on a definitional revision and enrichment of the explicit offensive language taxonomy by reviewing the collection of available corpora and comparing tagging schemas applied there. The study relies mainly on the categories originally proposed by Zampieri et al. (2019) in terms of offensive language categorization schemata. After the explanation of semantic differences between particular concepts used in the tagging systems and the analysis (...)
    No categories
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   7 citations  
  39. Perspective-taking and intersubjectivity in oral narratives of people with a schizophrenia diagnosis: a cognitive linguistic viewpoint analysis.José Sanders, Simon A. Claassen, Kobie van Krieken & S. Linde van Schuppen - 2023 - Cognitive Linguistics 34 (2):197-229.
    Disruptions in theory of mind faculties and the ability to relate to an intersubjective reality are widely thought to be crucial to schizophrenic symptomology. This paper applies a cognitive linguistic framework to analyze spontaneous perspective-taking in two corpora of stories told by people with a schizophrenia diagnosis. We elicited natural narrative language use through life story interviews and a guided storytelling task and analyzed the linguistic construal of viewpoint in these stories. For this analysis, we developed a reliable and (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  40.  28
    Converging evidence: Bringing together experimental and corpus data on the association of verbs and constructions.Stefan Th Gries, Beate Hampe & Doris Schönefeld - 2005 - Cognitive Linguistics 16 (4):635-676.
    Much recent work in Cognitive Linguistics and neighbouring disciplines has adopted a so-called usage-based perspective in which generalizations are based on the analysis of authentic usage data provided by computerized corpora. However, the analysis of such data does not always utilize methodological findings from other disciplines to avoid analytical pitfalls and, at the same time, generate robust results. A case in point is the strategy of using corpus frequencies. In this paper, we take up a recently much debated issue (...)
    Direct download (3 more)  
     
    Export citation  
     
    Bookmark   10 citations  
  41.  20
    Explorations into the social contexts of neologism use in early English correspondence.Tanja Säily, Eetu Mäkelä & Mika Hämäläinen - 2018 - Pragmatics and Cognition 25 (1):30-49.
    This paper describes ongoing work towards a rich analysis of the social contexts of neologism use in historical corpora, in particular the Corpora of Early English Correspondence, with research questions concerning the innovators, meanings and diffusion of neologisms. To enable this kind of study, we are developing new processes, tools and ways of combining data from different sources, including the Oxford English Dictionary, the Historical Thesaurus, and contemporary published texts. Comparing neologism candidates across these sources is complicated by (...)
    Direct download (3 more)  
     
    Export citation  
     
    Bookmark   3 citations  
  42.  12
    Cultural Conceptualization of Congratulatory Happy Events in British English and Turkish: A Cross-Cultural Perspective.Hümeyra Can & Çiler Hatipoğlu - 2023 - Journal of Cognition and Culture 23 (3-4):289-309.
    This study investigates the cultural conceptualization of congratulatory happy events in British English and Turkish and discusses them cross-culturally. A lexical search was carried out in various corpora from the newspaper genre using the verbs congratulate in English and its dictionary counterparts tebrik etmek and kutlamak in Turkish along with their various lexical forms, which not only report the act of congratulating but also perform it. The results of the study showed that there were cultural differences and similarities in (...)
    No categories
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  43.  23
    Culturally Immersed Legal Terminology on the Example of Forest Regulations in Poland, The United Kingdom, The United States of America and Germany.Paula Trzaskawka & Joanna Kic-Drgas - 2021 - International Journal for the Semiotics of Law - Revue Internationale de Sémiotique Juridique 34 (5):1483-1513.
    The importance of forests is reflected in the national forest legislation which has been developed and implemented in European countries over recent years. Due to regional and national specificities, forest regulations include culturally immersed terms specific to the described area. The aim of this paper is to analyses the culturally driven legal terms existing in specific legal regulations concerning forestry in Germany, the United Kingdom, the United States of America and Poland, and identify possible ways of translating them. In order (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   1 citation  
  44.  18
    Warning, or Manipulating in Pandemic Times? A Critical and Contrastive Analysis of Official Discourse Through the English and Spanish News.María Ángeles Orts & Chelo Vargas-Sierra - 2022 - International Journal for the Semiotics of Law - Revue Internationale de Sémiotique Juridique 35 (3):903-935.
    Focusing on media discourse and adopting a Critical Discourse Analysis—linguistic and rhetorical—perspective, this paper explores the role of the media in influencing citizens’ behaviour towards the COVID-19 crisis. The paper evaluates the set of potentially persuasive lexical items and emotional implicatures used by two quality newspapers, i.e. The Guardian and El País, to report on the pandemic during the three waves—the periods between the onset and trough of virus contamination—that occurred until March 2021. A representative, ad-hoc, comparable corpus was (...)
    Direct download (3 more)  
     
    Export citation  
     
    Bookmark  
  45.  16
    ‘Lose weight, save the NHS’: Discourses of obesity in press coverage of COVID-19.Gavin Brookes - 2022 - Critical Discourse Studies 19 (6):629-647.
    This article examines the discourses that are used by the British press to represent obesity in its coverage of coronavirus disease 2019 (COVID-19). Obesity is understood to be a risk factor for COVID-19, with people with obesity being more likely to die from the virus. This study adopts a corpus-based approach to Critical Discourse Studies and utilises a novel approach to keyword analysis, based on comparing analysis corpora against two reference corpora in order to yield keywords that are, (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   2 citations  
  46.  9
    The Relevance of Defixiones Iudiciariae for Early Greek Rhetoric (and Vice Versa).Jan Rothkamm - 2017 - Hermes 145 (1):113-117.
    More than a century after the publication of the two main corpora of curse tablets by Richard Wünsch and Auguste Audollent,1 defixiones are now a relatively well-established field of study. New collections, finding lists, and monographs have appeared over the last twenty-five years.2 Still, the relation to the Near Eastern sources, although noticed from the very beginning,3 has never been explored in full. Nor have there been many attempts to use the text on the tablets for investigations that go (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  47.  29
    Knowledge-How Attribution in English and Japanese.Shun Tsugita, Yu Izumi & Masaharu Mizumoto - 2021 - In Karyn L. Lai (ed.), Knowers and Knowledge in East-West Philosophy: Epistemology Extended. Springer Nature. pp. 63-90.
    This chapter presents two cross-linguistic studies of knowledge-how attributions that compare English and Japanese speakers. The first study investigates the felicity judgements of ordinary people about knowledge-how sentences, where we find a large difference in judgements about the sentences in which a person lacks an ability to perform a certain action but is nevertheless attributed the relevant knowledge of how to perform that action. The second study investigates the frequency of the natural occurrences of knowing-how constructions in English and Japanese (...)
    Direct download  
     
    Export citation  
     
    Bookmark   1 citation  
  48.  23
    Assessing Classification Reliability of Conditionals in Discourse.Alex Reuneker - 2023 - Argumentation 37 (3):397-418.
    Conditional constructions (if–then) enable us to express our thoughts about possible states of the world, and they form an important ingredient for our reasoning and argumentative capabilities. Different types and argumentative uses have been distinguished in the literature, but their applicability to actual language use is rarely evaluated. This paper focuses on the reliability of applying classifications of connections between antecedents and consequents of conditionals to discourse, and three issues are identified. First, different accounts produce incompatible results when applied to (...)
    Direct download (3 more)  
     
    Export citation  
     
    Bookmark  
  49.  23
    Reconsidering subjectification from the perspective of animal signalling.Nikolaus Ritt, Andreas Baumann, Eva Zehentner & Alexandra Zöpfl - 2020 - Evolutionary Linguistic Theory 2 (2):138-152.
    This paper discusses the view that subjectifications are primarily motivated by speakers’ need for self-expression. Approaching the issue from the perspective of animal signalling, we propose that semantic subjectifications are at least equally likely to reflect evaluations and attitudes read into utterances by listeners who attempt to read speakers’ minds. We compare speaker-based and listener-based theories with regard to their predictions, sketch ways in which they can be tested and report findings from first attempts at doing so. First, we report (...)
    Direct download (3 more)  
     
    Export citation  
     
    Bookmark  
  50. A Corpus-based Cognitive Linguistic Analysis of Pre-existing Knowledge of Scientific Terminology: The Case of English Energy and Arabic طَاقَة (ṭāqa).Hicham Lahlou - 2020 - Arab World English Journal for Translation and Literary Studies 4 (1):3-13.
    The present paper aims to broaden the current understanding of students’ misconception of scientific terminology by identifying the gaps between Arabic and English scientific terminologies and between everyday language and scientific language. The paper compares the polysemy, prototypes, and motivating factors of English energy with those of Arabic طَاقَة (ṭāqa), with more focus on students’ prior knowledge. The study employs Lakoff’s (1987) idealized cognitive models and Rosch’s (1975) prototype theory to reveal the radial members of both categories, i.e., energy and (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   1 citation  
1 — 50 / 971