| 1965 | - | Aspects of the Theory of Syntax - Chomsky | :: | 88 |
| 1980 | - | Rules and Representations - Chomsky | :: | 55 |
| 1990 | - | Finding structure in time - Elman | :: | 49 |
| 1995 | - | The minimalist program - Chomsky | :: | 38 |
| 1987 | - | Foundations of cognitive grammar: Theoretical Prerequisites - Langacker | :: | 37 |
| 1993 | - | Learning and development in neural networks: The importance of starting small - Elman | :: | 33 |
| 1981 | - | Roots of language - Bickerton | :: | 32 |
| 1987 | - | Knowledge of Language: Its Nature, Origin and Use - Chomsky | :: | 32 |
| 1984 | - | The Language Bioprogram Hypothesis - Bickerton | :: | 30 |
| 1989 | - | Learnability and Cognition: The acquisition of Argument Structure - Pinker | :: | 22 |
| 1984 | - | Language Learnability and Language Development - Pinker | :: | 21 |
| 1986 | - | Learning internal representations by error propagation - Rumelhart,Hinton,Williams | :: | 18 |
| 1990 | - | Maturational Constraints on Language Learning - Newport | :: | 17 |
| 1970 | - | Derivational Complexity and Order of Acquisition in Child Speech - Brown,Hanlon | :: | 13 |
| 1981 | - | Government and Binding - Chomsky | :: | 13 |
| 1994 | - | Introduction to Government and Binding Theory - Haegeman | :: | 12 |
| 1994 | - | Impairments of tense in a familial language disorder - Gopnik | :: | 7 |
| 1979 | - | Syntactic Theory and the Projection Problem - Baker | :: | 6 |
| 1979 | - | On understanding grammar - Givon | :: | 6 |
| 1986 | - | Serial Order: a Parallel Distributed Processing Approach - Jordan | :: | 6 |
| 1988 | - | Creole languages and the bioprogram - Bickerton | :: | 6 |
| 1998 | - | WordNet: An Electronic Lexical Database - Fellbaum | :: | 5 |
| 1988 | - | The `No Negative Evidence' Problem: How Do Children Avoid Constructing an Overly General Grammar - Bowerman | :: | 5 |
| 1988 | - | On the proper treatment of connectionism - Smolensky | :: | 5 |
| 1982 | - | Functionalist approaches to grammar - Bates,MacWhinney | :: | 4 |
| 1988 | - | Encoding sequential structure in simple recurrent networks - Servan-Schreiber,Cleeremans,McClelland | :: | 4 |
| 1993 | - | Formal Semantics - Cann | :: | 3 |
| 1984 | - | Syntax: A functional-typological introduction - Givon | :: | 3 |
| 1986 | - | On learning the past tense of English verbs - Rumelhart,McClelland | :: | 3 |
| 1986 | - | Distributed Representations - Hinton,McClelland,Rumelhart | :: | 3 |
| 1997 | - | Bootstrapping Word Boundaries: A Bottom-up Corpus-Based Approach to Speech Segmentation - Cairns,Shillcock,Chater,Levy | :: | 2 |
| 1991 | - | Connectionism and the Mind: An Introduction to Parallel Processing in Networks - Bechtel,Abrahamsen | :: | 2 |
| 1986 | - | An Introduction to Cognitive Grammar - Langacker | :: | 2 |
| 1990 | - | Connectionism and Cognitive Linguistics - Harris | :: | 2 |
| 1988 | - | The mechanisms of `construction grammar' - Fillmore | :: | 2 |
| 1987 | - | Competition, variation and language learning - Bates,MacWhinney | :: | 2 |
| 1970 | - | Language development: Form and function in emerging grammars - Bloom | :: | 2 |
| 1987 | - | Connectionist Learning Procedures - Hinton | :: | 2 |
| 1996 | - | When/Why/Of What is Less More - Joyce | :: | 2 |
| 1980 | - | The critical period and feral children - Curtiss | :: | 2 |
| 1997 | - | Probabilistic Constraints in Acquisition - Allen | :: | 2 |
| 1997 | - | Argument Structures without Lexical Entries - Allen | :: | 2 |
| 1994 | - | Learning long-term dependencies with gradient is difficult - Bengio,Simard,Frasconi | :: | 1 |
| 1985 | - | Lexicalization patterns - Talmy | :: | 1 |
| 1986 | - | A Parallel Network that Learns to Read Outloud - Sejnowski,Rosenberg | :: | 1 |
| 1988 | - | The Story of `Over': Polysemy, Semantics and the Structure of the Lexicon - Brugman | :: | 1 |
| 1989 | - | A connectionist approach to the story of over - Harris | :: | 1 |
| 1992 | - | A theory of the child's theory of mind - Fodor | :: | 1 |
| 1988 | - | Cognitive topology and lexical networks - Brugman,Lakoff | :: | 1 |
| 1986 | - | Parallel Distributed Processing - McClelland,Rumelhart,McClelland | :: | 1 |
| 1972 | - | The Projection Problem: How is a Grammar to be Selected - Peters | :: | 1 |
| 1982 | - | Connectionist models and their properties - Feldman,Ballard | :: | 1 |
| 1984 | - | Parallel computations for controlling an arm - Hinton | :: | 1 |
| 1982 | - | Space grammar, analysability, and the English passive - Langacker | :: | 1 |
| 1986 | - | Learning distributed representations of concepts - Hinton | :: | 1 |
| 1986 | - | Parallel Distributed Processing - Rumelhart,Rumelhart | :: | 1 |
| 1990 | - | Semantic Structures, MIT Press, Cambridge, MA - Jackendoff | :: | 1 |
| 1997 | - | English syntax and argumentation - Aarts | :: | 1 |
| 1942 | - | Learning to speak after six and one half years of silence - Mason | :: | 1 |
| 1984 | - | The acquisition of the dative alternation: unlearning overgeneralisations - Mazurkewich,White | :: | 1 |
| 1993 | - | Modelling the Effects of Processing Limitations on the Acquisition of Morphology: the Less is More Hypothesis - Goldowsky,Newport | :: | 1 |
| 1994 | - | Unaccusativity: At the SyntaxLexical Semantics Interface, MIT Press, Cambridge, MA - Levin,Rappaport-Hovav | :: | 1 |
| 1986 | - | Mechanisms of Sentence Processing: Assigning Roles to Constituents of Sentences - McClelland,Kawamoto | :: | 1 |
| 1987 | - | Resolving a learnability paradox in the acquisition of the verb lexicon - Pinker | :: | 1 |
| 1981 | - | The Story of Over - Brugman | :: | 1 |
| 1981 | - | An interactive activation model of context effects in letter perception: Part I - McClelland,Rumelhart | :: | 1 |
| 1984 | - | Active zones - Langacker | :: | 1 |