3 Collecting Frozen Sentences Many frozen sentences, especially those that are most usual or most obviously idiomatic, have already been collected both in general and in specialized dictionaries of ‘idioms’.In these dictionaries, frozen sentences are usually undistinguished from other types of multiword expressions, such as compound nouns, adverbs, prepositions and conjunctions; proverbs, etc.In order to build this Lexicon-Grammar of frozen sentences of European Portuguese, several sources were used, including (general and specialized) dictionaries, and these were completed with information retrieved from newspapers, magazines, the web, etc. and our knowledge as native speakers of Portuguese. Basically, Mello (1986), Santos (1990), Simões (1993), Moreira (1996), and Neves (2000).The electronic dictionary of frozen sentences of Brazilian Portuguese (Vale, 2001) was also consulted, but many of those sentences either do not exist if European Portuguese or else they present substantial syntactical and lexical differences, so that a detailed comparative study is in order. Many sentences were checked against corpora of different nature, using the Linguateca resources (www.linguateca .pt), and web browsers.
4 Defining Frozen Sentences no easy definitionheterogeneous expressionsFrozen sentences are elementary sentences where the main verb and at least one of its argument noun-phrases are distributionally constraint, and usually the global meaning of the expression cannot be calculated from the individual meaning of its component elements when they are used independently(M. Gross 1982, 1989, 1996; G. Gross 1996; Baptista et al 2003; Ranchhod 2003).
5 For that reason, the whole expression must be taken as a complex, multiword lexical unit: (1) O João fechou-se em copas(lit:John closed himself in hearts; John isolated himself)the verb-object combination (fechar-copas) is frozen. One cannot replace copas (hearts) by other noun of the same lexical paradigm:* O João fechou-se em (espadas + paus + ouros)(lit: John closed himself in spades + clubs + diamonds)
6 Classification of Frozen Sentences The formal framework of M. Gross (1982, 1989, 1996) was adopted to classify frozen sentences.The classification is based on sentence structure, number and type of noun phrases (NP) attached to the main verb, their frozen (C) or free nature (N), as well as the syntactic properties of the construction.Table 1 (next) shows some formal classes, their internal structure, an illustrative example, and the approximate number of sentences collected so far.See Leclère 2002, for an updated overview of the current status of French Lexicon-Grammar
7 Table 1. Classification of frozen sentences (extract) StructureExampleSizeC1N0 V C1O Pedro matou a galinha dos ovos de ouro800CANN0 V (C de N)1 = C1 a N2O Pedro arrefeceu os ânimos (de Maria =à Maria)200CDNN0 V (C de N)1O Pedro queria a cabeça da Maria100CP1N0 V Prep C1O Pedro bateu com a porta900CPNN0 V Prep (C de N)1O Pedro foi aos cornos do JoãoC1PNN0 V C1 Prep N2O Pedro arrastou a asa à Maria400CNP2N0 V N1 Prep C2O Pedro tirou o relógio do prego350C1P2N0 V C1 Prep C2O Pedro deitou mãos à obraCPPN0 V Prep C1 Prep C2O Pedro foi de cavalo para burroCPPNN0 V C1 Prep C2 Prep C3O Pedro deitou o bebé fora com a água do banho50Total3,500Frozen sentences with sentential subjects (C0Q, C5) or objects (C6), or with frozen subject noun phrases (C0) were not considered in this paper. N and C stand for free or frozen noun phrases, respectivelly; N0 is the subject, N1 , N2 and N3 the first, second and third complement; V is the verb and Prep a preposition.
8 Classification of Frozen Sentences Compared with figures reported for other languages:French (+20,000; M.Gross 1996),Spanish (3,500; Mogorrón-Huerta 2002),Modern Greek (4,500; Fotopoulou 1993)Brazilian Portuguese (3,500; Vale 2001),it is clear that our own lists are still far from complete and they should, in fact, be completed, probably using other corpus-based methods for lexical acquisition (McKeown & Rodev 2000, Mutsimoto 2003).
9 Format of DictionaryThe Lexicon-Grammar may be viewed as an electronic dictionary of frozen sentences.The electronic dictionary is composed of several matrices, one per formal class.In these matrices, each line is a frozen sentence and the columns contain the lexical elements of the sentence and their syntactic (distributional and transformational) proper-ties.The set of matrices constitutes the lexicon-grammar of frozen sentences.
10 Table 2. Class CPN (extract) N0=:NhumN0=:N-humVVseNegObrigPrepDetCN1=: NhumN1=: N-humde N = a Nde N = PossExample+-<acabar>comaraçaO Pedro acabou com a raça da Maria<atirar>ospésO Pedro atirou-se aos pés da Maria<chegar>calcanharesO Pedro não chega aos calcanhares da Maria<cortar>emcasacaO Pedro cortava na casaca da Maria<ir>astrombasO Pedro foi às trombas do JoãocantigaO Pedro foi na cantiga da MariacaraA Maria foi à cara do Pedro<pegar>deixaO Pedro pegou na deixa da Maria<rir>O Pedro riu na cara da MariaO Pedro riu-se na cara da Maria<sair>deopeloO salário sai-lhe do pelo<subir>cabeçaA fama subiu à cabeça do Pedro<viver>sombraO Pedro vive na sombra da MariaTable 2. Class CPN (extract)
11 distributional constraints on free NP (e.g. ±Nhum) Syntactic Propertiesdistributional constraints on free NP (e.g. ±Nhum)intrinsically reflexive constructions (Vse): reflex Pro cannot be zeroed nor replaced by free NP (not referent to the subject)(2) O Pedro atirou-se aos pés da Maria(lit: Peter threw himself to the feet of Mary, ‘Peter humbled himself before Mary)*O Pedro atirou (E + o João) aos pés da Maria(lit: Peter threw E/John to the feet of Mary)
12 Syntactic Properties (continued) obligatory negation (NegObrig):(3) O Pedro não chega aos calcanhares da Maria(lit: Peter does not get to the heels of Mary)‘Peter is not a match for Mary’?*O Pedro chega aos calcanhares da Maria(lit: Peter gets to the heels of Mary)
13 Syntactic Properties (continued) dative NP restructuring (Leclère 1995):(4a) O Pedro foi às trombas do João = ao João(lit: Peter went to_the snouts of/to John, ‘Peter beat John’)(4b) O Pedro foi-lhe (= ao João) às trombas.but in some sentences with the a similar syntactic structure, the reduction to a dative Pro is not possible:(5) O Pedro foi na cantiga do João/ *ao João/ *-lhe(lit: Peter went in_the song of John)‘Peter was persuaded by John’s ill-intended words’
14 Syntactic Properties (continued) reduction of free NP to an oblique pronoun and reduction of de N to a possessive pronoun:(5) O Pedro foi na cantiga do João [Pro_Obl]= O Pedro foi na cantiga dele [Pro_Pos]= O Pedro foi na sua cantigabut in some cases, the reduction to a possessive is blocked:(4) O Pedro foi às trombas do João [Pro_Obl]= ? O Pedro foi às trombas dele [Pro_Pos]= *? O Pedro foi às suas trombas.
15 Syntactic Properties (continued) Other relations: Conversion-like transformation (G.Gross 1989; Baptista 1997)(6) O Pedro foi às trombas ao João (CP1)(lit: Peter went to the snouts to John, ‘Peter beat John’)= (7) O João apanhou nas trombas do Pedro (CPP)(lit: John got on the snouts from Peter, ‘John was beaten by Peter’)- permutation of free NP around frozen elements;- replacement of V ir (active) by apanhar (passive);- change of Prep1
16 - similar variants of V as in Vsup Npred entering the Conversion transformation (G. Gross 1989; described for E_PT by Baptista 1997, among others)(8) O João levou/comeu nas trombas do Pedro(lit: John took/ate on the snouts from John, ‘John was beaten by Peter’)
17 Free vs frozen sentences (Modes of freezing) In many frozen sentences, V presents the same syntactic structure as in free sentencesN0 desaparecer de Nloc (‘to disappear from’)(9) O João desapareceu do mapa(lit: John disappeared from the map, ‘John went away/escaped’)Many sentences, however, show structures with frozen complements unrelated to the basic structure(s) of their free constructions:(10) Esta aldeia não vem no mapa (lit:This village does not come on the map, ‘it is not very important’)NB: verb vir/‘to come’ does not accept a em Nloc complement
18 Often, these have an adverbial-like status: locative: Sentences with frozen prepositional complements (CP1,CPN, CPP), in particular, show frozen complements of very diverse syntactic and semantic natureOften, these have an adverbial-like status:locative:(11) O João veio a terreiro <dizer Que_F>(lit.: John came to yard <to say that_S>‘John went public <to say that_S>’)causative:(12) O João pagou pela língua(lit.: John paid for the tongue, ‘John was punished for saying something’)
19 Classification problems complex sentencesfrozen modifiers(13) O João voltou à vaca fria (CP1) (lit. John returned to the cold cow, ‘return to a difficult subject/problem’) *O João voltou à vaca que (era/estava) fria */?* A vaca (era/estava) friaLG formalism (tables) integrate these frozen modifiers, but other types of complexity can hardly be represented in this fashion
20 Classification problems (continued) frozen subordinate clauses- relative clausesO João comeu o pão que o Diabo amassou/ O João comeu um pão#o Diabo amassou esse pão
21 Classification problems (continued) frozen adverbs(14) O João nasceu com o rabo virado para a lua (CP1) (lit: John was born with the bottom turned to the moon John was always very fortunate in everything in his life)/ o João nasceu# o rabo do João estava virado para a luaconstraint co-reference (João – rabo)
22 Classification problems (continued) constraint co-reference:(15) A fama subiu à cabeça do Pedro(lit. The fame went up to the head of Peter)*A fama do João subiu à cabeça do Pedro(lit. The fame of John went up to the head of Peter)
23 pseudo-transitive predicative constructions: (16) O João/A Maria não se deu por achado/a (CP1)(lit. John/Mary did not give him-/herself by found_ms/fs, ‘he/she didn’t stop him/herself from doing something’)*/#O João/A Maria (era+estava) achado/a(John/Mary was found_ms/fs)(17) O João deu o trabalho/a tarefa por terminado/a(‘John considered the work_ms/task_fs as finished_ms/fs’)O João considerou o trabalho/a tarefa terminado/aO trabalho/A tarefa estava terminado/a(‘The work_ms/task_fs was finished_ms/fs’)
24 Coordinated PPs and/or NPs (18) O João agradou a gregos e a troianos (CPP)(lit.: John pleased to Greeks and Trojans, ’John pleased everybody’)º O João agradou a gregosº O João agradou a troianosIn this case, Prep2 can be zeroed:O João agradou a gregos e troianosbut this is not always the case:(19) O João e a Maria brincaram aos papás e às mamãs (CPP)(lit.: John and Mary wanted to play daddy and mummy, ‘wanted to have sex’)*?O João e a Maria brincaram aos papás e mamãs
25 Concluding remarks and perspectives same methodology and formal criteria as in M. Gross (1982,1989) and other LG teamsdata comparable to that of other languagesimprove lexical coverageattentive to these ‘shoehorn’ solutions, especially complex frozen sentencesimprove classification towards more homogenous classesexperiments on corpora both for lexical acquisition and semi-automatically add information regarding variants
26 References Acknowledgement : Fundação Calouste Gulbenkian. Araújo-Vale, Oto, Expressões Cristalizadas do Português do Brasil: Uma Proposta de Tipologia (Ph.D. Thesis). Araquara (Brazil): UNESP.Chacoto, Lucília, Estudo e Formalização das Propriedades Léxico-Sintácticas das Expressões Fixas Proverbiais. (M.A. Thesis). Lisbon: FLUL.Fotopoulou, Aggeliki, Une classification des phrases à compléments figés en grec moderne. (PhD Thesis). Paris : Univ. Paris 8.Gaatone, David, A quoi sert la notion d’ «expression figée» ?, in Buvet, P.-A., D. le Pesant, M. Mathieu-Colas (eds.), Lexique, Syntaxe et Sémantique, BULAG (hors série), Besançon : Centre Lucien Tesnière/PUFC, pp. 295‑308.Gross, Gaston, Degrée de figement des noms composés. Langages 90. Paris : Larousse, ppGross, Gaston, Les Expressions Figées en Français. Paris: Ophrys.Gross, Maurice Une classification des phrases ‘figées’ du français. Revue Québécoise de Linguistique Montréal : UQAM, p. 151‑185.Gross, Maurice Les nominalisations d’expressions figées. Langue Française 69, Paris: Larousse, pp. 64‑84.Gross, Maurice Les limites de la phrase figée. Langages 90. Paris: Larousse, ppGross, Maurice Les expressions figées : une description des expressions françaises et ses conséquences théoriques. Rapport Téchnique 8. Paris : LADL-Univ. Paris 7 / CERIL.Gross, Maurice Lexicon-Grammar. in K. Brown and J. Miller (eds.). Concise Encyclopedia of Syntactic Theories. Cambridge: Pergamon, pp. 244‑259.Jurafsky, Daniel and James H. Martin, 2000, Speech and Language Processing. New Jersey: Prentice Hall.Leclère, Christian, Sur une restructuration dative. Language Research Seoul: LRI- Seoul National Univ, pp. 179‑198.Leclère, Christian, Organization of the Lexicon-Grammar of French Verbs, Linguisticae Investigationes 25-1, Amesterdam: John Benjamins Pub. Co., pp. 29‑48.McKeown, Kathleen R. and Dragomir Rodev, 2000, Collocations, in Dale, R., H. Moisl and H. Sommers (eds.) Handbook of Natural Language Processing. New York: Marcel Dekker Inc., ppMejri, Salah, Le figment lexical. Description linguistique et structuration sémantique. La Manouba (Tunis) : Pub. Fac. Lettres.Mel’cuk, I, La phraseologie et son rôle dans l’enseignment / apprentissage d’une langue étrangère. ELA, Didier Érudition, ppMello, Fernando R., Nova Recolha de Provérbios Portugueses e Outros Lugares-Comuns (2nd. ed.). Lisbon: Ed. Afrodite.Mogorrón-Huerta, Pedro, La expressividad en las locuciones verbales españolas y francesas. Alicante: Pub. Univ. Alicante.Moreira, António, Provérbios Portugueses. Lisbon : Ed. Notícias.Mutsimoto, Yuji, Lexical Knowledge Acquisition, in Miktov, R. (ed.) The Oxford Handbook of Computational Linguistics. Oxford: OUP, pp. 395‑413.Neves, Orlando, Dicionário de Expressões Correntes (2nd. ed.) Lisbon: Ed. Notícias.Ranchhod, Elisabete, Cristina Mota, Jorge Baptista, A Computational Lexicon for Automatic Text Parsing, Proceedings of SIGLEX’99: ACL/NScF, pp. 74‑80.Ranchhod, Elisabete M., O lugar das expressões ‘fixas’ na gramática do Português. in Castro, I. and I. Duarte (eds.), Razão e Emoção, vol. II, Lisbon: INCM, pp. 239‑254.Santos, António, Novos Dicionários de Expressões Idiomáticas. Lisbon: João Sá da Costa.Silberztein, Max, Dictionnaires électroniques et analyse automatiques de textes : le système Intex. Paris : Masson.Silberztein, Max, Intex Manual.Simões, Guilherme A., Dicionário de Expressões Populares Portuguesas. Lisbon: D. Quixote.Acknowledgement : Fundação Calouste Gulbenkian.