behaviorism
Zombies, AI and Temporal Lobe Epilepsy : towards a universal consciousness and behavioral grammar?
I was recently reading an article on Zombies about how the Zombie argument has been used against physicalism and in consciousness debates in general, and one quote by Descartes at the beginning of the article captured my attention :
Descartes held that non-human animals are automata: their behavior is explicable wholly in terms of physical mechanisms. He explored the idea of a machine which looked and behaved like a human being. Knowing only seventeenth century technology, he thought two things would unmask such a machine: it could not use language creatively rather than producing stereotyped responses, and it could not produce appropriate non-verbal behavior in arbitrarily various situations (Discourse V). For him, therefore, no machine could behave like a human being. (emphasis mine)
To me this seems like a very reasonable and important speculation: although we have learned a lot about how we are able to generate an infinite variety of creative sentences using the generative grammar theory of Chomsky (I must qualify, we only know how to create a new grammatically valid sentence-the study of semantics has not complimented the study in syntax – so we still do not know why we are also able to create meaningful sentences and not just grammatically correct gibberish like “Colorless green ideas flow furiously” : the fact that this grammatically correct sentence is still interpretable by using polysemy , homonymy or metaphorical sense for ‘colorless’, ‘green’ etc may provide the clue for how we map meanings -the conceptual Metaphor Theory- but that discussion is for another day), we still do not have a coherent theory of how and why we are able to produce a variety of behavioral responses in arbitrarily various situations.
If we stick to a physical, brain-based, reductionist, no ghost-in-the-machine, evolved-as-opposed-to-created view of human behavior, then it seems reasonable that we start from the premise of humans as an improvement over the animal models of stimulus-response (classical conditioning) or response-reinforcement (operant conditioning) theories of behavior and build upon them to explain how and what mechanism Humans have evolved to provide a behavioral flexibility as varied, creative and generative as the capacity for grammatically correct language generation. The discussions of behavioral coherence, meaningfulness, appropriateness and integrity can be left for another day, but the questions of behavioral flexibility and creativity need to be addressed and resolved now.
I’ll start with emphasizing the importance of response-reinforcement type of mechanism and circuitry. Unfortunately most of the work I am familiar with regarding the modeling of human brain/mind/behavior using Neural Networks focuses on the connectionist model with the implicit assumption that all response is stimulus driven and one only needs to train the network and using feedback associate a correct response with a stimulus. Thus, we have an input layer for collecting or modeling sensory input, a hidden association layer and an output layer that can be considered as a motor effector system. This dissociation of input acuity, sensitivity representation in the form of input layer ; output variability and specificity in the form of an output layer; and one or more hidden layers that associate input with output and may be construed as an association layer maps very well to our intuitions of a sensory system, a motor system and an association system in the brain to generate behavior relevant to external stimuli/situations. However, this is simplistic in the sense that it is based solely on stimulus-response types of associations (the classical conditioning) and ignores the other relevant type of association response-reinforcement. Let me clarify that I am not implying that neural networks models are behavioristic: in the form of hidden layers they leave enough room for cognitive phenomenon, the contention is that they not take into account the operant conditioning mechanisms. Here it is instructive to note that feedback during training is not equivalent to operant-reinforcement learning: the feedback is necessary to strengthen the stimulus-response associations; the feedback only indicates that a particular response triggered by the particular stimuli was correct.
For operant learning to take place, the behavior has to be spontaneously generated and based on the history of its reinforcement its probability of occurrence manipulated. This takes us to an apparently hard problem of how behavior can be spontaneously generated. All our life we have equated reductionism and physicalism with determinism, so a plea to spontaneous behavior seems almost like begging for a ghost-in-the-machine. Yet on careful thinking the problem of spontaneity (behavior in absence of stimulus) is not that problematic. One could have a random number generator and code for random responses as triggered by that random number generator. One would claim that introducing randomness in no way gives us ‘free will’, but that is a different argument. What we are concerned with is spontaneous action, and not necessarily, ‘free’ or ‘willed’ action.
To keep things simple, consider a periodic oscillator in your neural network. Lets us say it has a duration of 12 hours and it takes 12 hours to complete one oscillation (i.e. it is a simple inductor-capacitor pair and it takes 6 hours for capacitor to discharge and another 6 hours for it to recharge) ; now we can make connections a priori between this 12 hr clock in the hidden layer and one of the outputs in the output layer that gets activated whenever the capacitor has fully discharged i.e. at a periodic interval of 12 hours. Suppose that this output response is labeled ‘eat’. Thus we have coded in our neural networks a spontaneous mechanism by which it ‘eats’ at 12 hour durations.
Till now we haven’t really trained our neural net, and moreover we have assumed a circuitry like a periodic oscillator in the beginning itself, so you may object to this saying this is not how our brain works. But let us be reminded that just like normal neurons in the brain which form a model for neurons in the neural network, there is also a suprachiasmatic nuclei that gives rise to circadian rhythms and implements a periodic clock.
As for training, one can assume the existence of just one periodic clock of small granularity, say 1 second duration in the system, and then using accumulators that code for how many ticks have elapsed since past trigger, one can code for any arbitrary periodic response of greater than one second granularity. Moreover, one need not code for such accumulators: they would arise automatically out of training from the other neurons connected to this ‘clock’ and lying between the clock and the output layer. Suppose, that initially, to an output marked ‘eat’ a one second clock output is connected (via intervening hidden neuron units) . Now, we have feedback in this system also. Suppose, that while training, we provide positive feedback only on 60*60*12 trials (and all its multiples) and provide negative feedback on all other trials, it is not inconceivable to believe that an accumulator neural unit would get formed in the hidden layer and count the number of ticks that come out of the clock: it would send the trigger to output layer only on every 60*60*12 th trial and suppress the output of the clock on every other trial. Viola! We now have a 12 hour clock (which is implemented digitally using counting ticks) inside our neural network coding for a 12 hour periodic response. We just needed to have one ‘innate’ clock mechanism and using that and the facts of ‘operant conditioning’ or ‘response-reinforcement’ pairing we can create an arbitrary number of such clocks in our body/brain. Also, please notice the fact, that we need just one 12 hour clock, but can flexibly code for many different 12 hour periodic behaviors. Thus, if the ‘count’ in accumulator is zero, we ‘eat’; if the count is midway between 0 and 60*60*12, we ‘sleep’. Thus, though both eating and sleeping follow a 12 hour cycle, they do not occur concurrently, but are separated by a 6 hour gap.
Suppose further, that one reinforcement that one is constantly exposed to and that one uses for training the clock is ‘sunlight’. The circadian clock is reinforced, say only by the reinforcement provided by getting exposed to the mid noon sun, and by no other reinforcements. Then, we have a mechanism in place for the external tuning of our internal clocks to a 24 hour circadian rhythm. It is conceivable, that for training other periodic operant actions, one need not depend on external reinforcement or feedback, but may implement an internal reinforcement mechanism. To make my point clear, while ‘eat’ action, i.e. a voluntary operant action, may get generated randomly initially, and in the traditional sense of reinforcement, be accompanied by intake of food, which in the classical sense of the word is a ‘reinforcement’; the intake of food, which is part-and-parcel of the ‘eat’ action should not be treated as the ‘feedback’ that is required during training of the clock. During the training phase, though the operant may be activated at different times (and by the consequent intake of food be intrinsically reinforced) , the feedback should be positive only for the operant activations inline with the periodic training i.e. only on trials on which the operant is produces as per the periodic training requirement; and for all other trails negative feedback should be provided. After the training period, not only would operant ‘eat’ be associated with a reinforcement ‘food’: it would also occur as per a certain rhythm and periodicity. The goal of training here is not to associate a stimulus with a response ( (not the usual neural networks association learning) , but to associate a operant (response) with a schedule(or a concept of ‘time’). Its not that revolutionary a concept, I hope: after all an association of a stimulus (or ‘space’) with response per se is meaningless; it is meaningful only in the sense that the response is reinforced in the presence of the stimulus and the presence of the stimulus provides us a clue to indulge in a behavior that would result in a reinforcement. On similar lines, an association of a response with a schedule may seem arbitrary and meaningless; it is meaningful in the sense that the response is reinforced in the presence of a scheduled time/event and the occurrence of the scheduled time/event provides us with a reliable clue to indulge in a behavior that would result in reinforcement.
To clarify, by way of an example, ‘shouting’ may be considered as a response that is normally reinforcing, because of say its being cathartic in nature . Now, ‘shouting’ on seeing your spouse”s lousy behavior may have had a history of reinforcement and you may have a strong association between seeing ‘spouse’s lousy behavior’ and ‘shouting’. You thus have a stimulus-response pair. why you don’t shout always, or while say the stimuli is your ‘Boss’s lousy behavior’, is because in those stimulus conditions, the response ‘shouting’, though still cathartic, may have severe negative costs associated, and hence in those situations it is not really reinforced. Hence, the need for an association between ‘spouse lousy behavior’ and ‘shouting’ : only in the specific stimulus presence is shouting reinforcing and not in all cases.
Take another example that of ‘eating’, which again can be considered to be a normally rewarding and reinforcing response as it provides us with nutrition. Now, ‘eating’ 2 or 3 times in a day may be rewarding; but say eating all the time, or only on 108 hours periodicity may not be that reinforcing a response, because that schedule does not take care of our body requirements. While eating on a 108 hours periodicity would impose severe costs on us in terms of under nutrition and survival, eating on 2 mins periodicity too would not be that reinforcing. Thus, the idea of training of spontaneous behaviors as per a schedule is not that problematic.
Having taken a long diversion, arguing for a case for ‘operant conditioning’ based training of neural networks, let me come to my main point.
While ‘stimulus’ and the input layer represent the external ‘situation’ that the organism is facing, the network comprising of the clocks and accumulators represent the internal state and ‘needs’ of the organism. One may even claim, a bit boldly, that they represent the goals or motivations of the organism.
A ‘eat’ clock that is about to trigger a ‘eat’ response, may represent a need to eat. This clock need not be a digital clock, and only when the 12 hour cycle is completed to the dot, an ‘eating’ act triggered. Rather, this would be a probabilistic, analog clock, with the ‘probability’ of eating response getting higher as the 12 hour cycle is coming to an end and the clock being rest, whenever the eating response happens. If the clock is in the early phases of the cycle (just after an eating response) then the need for eating (hunger) is less; when the clock is in the last phases of the cycle the hunger need is strong and would likely make the ‘eating’ action more and more probable.
Again, this response-reinforcement system need not be isolated from the stimulus-response system. Say, one sees the stimulus ‘food’, and the hunger clock is still showing ‘medium hungry’. The partial activation of the ‘eat’ action (other actions like ‘throw the food’, ignore the food, may also be activated) as a result of seeing the stimulus ‘food’ may win over other competing responses to the stimuli, as the hunger clock is still activating a medium probability of ‘hunger’ activation and hence one may end up acting ‘eat’. This however, may reset the hunger clock and now a second ‘food’ stimulus may not be able to trigger ‘eat’ response as the activation of ‘eat’ due to ‘hunger clock’ is minimal and other competing actions may win over ‘eat’.
To illustrate the interaction between stimulus-response and response-reinforcement in another way, on seeing a written word ‘hunger’ as stimulus, one consequence of that stimulus could be to manipulate the internal ‘hunger clock’ so that its need for food is increased. this would be simple operation of increasing the clock count or making the ‘need for hunger’ stronger and thus increasing the probability of occurrence of ‘eat’ action.
I’ll also like to take a leap here and equate ‘needs’ with goals and motivations. Thus, some of the most motivating factors for humans like food, sex, sleep etc can be explained in terms of underlying needs or drives (which seem to be periodic in nature) and it is also interesting to note that many of them do have cycles associated with them and we have sleep cycles or eating cycles and also the fact that many times these cycles are linked with each other or the circadian rhythm and if the clock goes haywire it has multiple linked effects affecting all the motivational ‘needs’ spectrum. In a mainc pahse one would have low needs to sleep, eat etc, while the opposite may be true in depression.
That brings me finally to Marvin Minsky and his AI attempts to code for human behavioral complexity.
In his analysis of the levels of mental activity, he starts with the traditional if, then rule and then refines it to include both situations and goals in the if part.

To me this seems intuitively appealing: One needs to take into account not only the external ‘situation’, but also the internal ‘goals’ and then come up with a set of possible actions and maybe a single action that is an outcome of the combined ‘situation’ and ‘goals’ input.
However, Minsky does not think that simple if-then rules, even when they take ‘gaols’ into consideration would suffice, so he posits if-then-result rules.
To me it is not clear how introducing a result clause makes any difference: Both goals and stimulus may lead to multiple if-then rule matches and multiple actions activation. These action activations are nothing but what Minsky has clubbed in the result clause and we still have the hard problem of given a set of clauses, how do we choose one of them over other.
Minsky has evidently thought about this and says:
What happens when your situation matches the Ifs of several different rules? Then you’ll need some way to choose among them. One policy might arrange those rules in some order of priority. Another way would be to use the rule that has worked for you most recently. Yet another way would be to choose rules probabilistically.
To me this seems not a problem of choosing which rule to use, but that of choosing which response to choose given several possible responses as a result of application of several rules to this situation/ goal combination. It is tempting to assume that the ‘needs’ or ‘gaols’ would be able to uniquely determine the response given ambiguous or competing responses to a stimulus; yet I can imagine a scenario where the ‘needs’ of the body do not provide a reliable clue and one may need the algorithms/heuristics suggested by Minsky to resolve conflicts. Thus, I see the utility of if-then-result rules: we need a representation of not only the if part (goals/ stimulus) in the rule; which tells us what is the set of possible actions that can be triggered by this stimulus/ situation/ needs combo; but also a representation of the results part of the rule: which tells us what reinforcement values these response(actions) have for us and use this value-response association to resolve the conflict and choose one response over the other. This response-value association seems very much like the operant-reinforcement association, so I am tempted once more to believe that the value one ascribes to a response may change with bodily needs and rather is reflective of bodily needs, but I’ll leave that assumption for now and instead assume that somehow we do have different priorities assigned to the responses ( and not rules as Minsky had originally proposed) and do the selection on the basis of those priorities.
Though I have posited a single priority-based probabilistic selection of response, it is possible that a variety of selection mechanisms and algorithms are used and are activated selectively based on the problem at hand.
This brings me to the critic-selector model of mind by Minsky. As per this model, one needs both critical thinking and problem solving abilities to act adaptively. One need not just be good at solving problems- one also has to to understand and frame the right problem and then use the problem solving approach that is best suited to the problem.

Thus, the first task is to recognize a problem type correctly. After recognising a problem correctly, we may apply different selctors or problem solving strategies to different problems.
He also posits that most of our problem solving is analogical and not logical. Thus, the recognizing problem is more like recognizing a past analogical problem; and the selecting is then applying the methods that worked in that case onto this problem.
How does that relate to our discussions of behavioral flexibility? I believe that every time we are presented with a stimulus or have to decide how to behave in response to that stimulus, we are faced with a problem- that of choosing one response over all others. We need to activate a selection mechanism and that selection mechanism may differ based on the critics we have used to define the problem. If the selection mechanism was fixed and hard-wired then we wont have the behavioral flexibility. Because the selection mechanism may differ based on our framing of the problem in terms of the appropriate critics, hence our behavioral response may be varied and flexible. At times, we may use the selector that takes into account only the priorities of different responses in terms of the needs of the body; at other times the selector may be guided by different selection mechanisms that involve emotions and values us the driving factors.
Minsky has also built a hierarchy of critics-selector associations and I will discuss them in the context of developmental unfolding in a subsequent post. For now, it is sufficient to note that different types of selection mechanisms would be required to narrow the response set, under different critical appraisal of the initial problem.
To recap, a stimulus may trigger different responses simultaneously and a selection mechanism would be involved that would select the appropriate response based on the values associated with the response and the selection algorithm that has been activated based on our appraisal of the reason for conflicting and competing responses. while critics help us formulate the reason for multiple responses to the same stimuli, the selector helps us to apply different selection strategies to the response set, based on what selection strategy had worked on an earlier problem that involved analogous critics.
One can further dissociate this into two processes: one that is grammar-based, syntactical and uses the rules for generating a valid behavioral action based on the critic and selector predicates and the particular response sets and strategies that make up the critic and selector clause respectively. By combining and recombining the different critics and selectors one can make an infinite rules of how to respond to a given situation. Each such rule application may potentially lead to different action. The other process is that of semantics and how the critics are mapped onto the response sets and how selectors are mapped onto different value preferences.
Returning back to the response selection, given a stimulus, clearly there are two processes at work : one that uses the stored if-then rules (the stimulus-response associations) to make available to us a set of all actions that are a valid response to the situation; and the other that uses the then-result rules (and the response-value associations, that I believe are dynamic in nature and keep changing) to choose one of the response from that set as per the ‘subjective’ value that it prefers at the moment. This may be the foundation for the ‘memory’ and ‘attention’ dissociations in working memory abilities used in stroop task and it it tempting to think that the while DLPFC and the executive centers determine the set of all possible actions (utilizing memory) given a particular situation, the ACC selects the competing responses based on the values associated and by selectively directing attention to the selected response/stimuli/rule.
Also, it seems evident that one way to increase adaptive responses would be to become proficient in discriminating stimuli and perceiving the subjective world accurately; the other way would be to become more and more proficient in directing attention to a particular stimulus/ response over others and directing attention to our internal representations of them so that we can discriminate between the different responses that are available and choose between them based on an accurate assessment of our current needs/ goals.
This takes me finally to the two types of consciousness that Hughlings-Jackson had proposed: subject consciousness and object consciousness.
Using his ideas of sensorimotor function, Hughlings-Jackson described two “halves” of consciousness, a subject half (representations of sensory function) and an object half (representations of motor function). To describe subject consciousness, he used the example of sensory representations when visualizing an object . The object is initially perceived at all sensory levels. This produced a sensory representation of the object at all sensory levels. The next day, one can think of the object and have a mental idea of it, without actually seeing the object. This mental representation is the sensory or subject consciousness for the object, based on the stored sensory information of the initial perception of it.
What enables one to think of the object? This is the other half of consciousness, the motor side of consciousness, which Hughlings-Jackson termed “object consciousness.” Object consciousness is the faculty of “calling up” mental images into consciousness, the mental ability to direct attention to aspects of subject consciousness. Hughlings-Jackson related subject and object consciousness as follows:
The substrata of consciousness are double, as we might infer from the physical duality and separateness of the highest nervous centres. The more correct expression is that there are two extremes. At the one extreme the substrata serve in subject consciousness. But it is convenient to use the word “double.”
Hughlings-Jackson saw the two halves of consciousness as constantly interacting with each other, the subjective half providing a store of mental representations of information that the objective half used to interact with the environment.
Further,
The term “subjective” answers to what is physically the effect of the environment on the organism; the term “objective” to what is physically the reacting of the organism on the environment.
Hughlings-Jackson’s concept of subjective consciousness is akin to the if-then representation of mental rules.One needs to perceive the stimuli as clearly as possible and to represent them along with their associated actions so that an appropriate response set can be activated to respond to the environment. His object consciousness is the attentional mechanism that is needed to narrow down the options and focus on those mental representations and responses that are to be selected and used for interacting with the environment.
As per him, subject and object consciousness arise form a need to represent the sensations (stimuli) and movements (responses) respectively and this need is apparent if our stimulus-response and response-reinforcement mappings have to be taken into account for determining appropriate action.
All nervous centres represent or re-represent impressions and movements. The highest centres are those which form the anatomical substrata of consciousness, and they differ from the lower centres in compound degree only. They represent over again, but in more numerous combinations, in greater complexity, specialty, and multiplicity of associations, the very same impressions and movements which the lower, and through them the lowest, centres represent.
He had postulated that temporal lobe epilepsy involves a loss in objective consciousness (leading to automatic movements as opposed to voluntary movements that are as per a schedule and do not happen continuously) and a increase in subjective consciousness ( leading to feelings like deja-vu or over-consciousness in which every stimuli seems familiar and triggers the same response set and nothing seems novel – the dreamy state). These he described as the positive and negative symptoms or deficits associated with an epileptic episode.
It is interesting to note that one of the positive symptom he describes of epilepsy, that is associated with subjective consciousness of third degree, is ‘Mania’ : the same label that Minsky uses for a Critic in his sixth self-consciousness thinking level of thinking. The critic Minsky lists is :
Self-Conscious Critics. Some assessments may even affect one’s current image of oneself, and this can affect one’s overall state:
None of my goals seem valuable. (Depression.)
I’m losing track of what I am doing. (Confusion.)I can achieve any goal I like! (Mania.)
I could lose my job if I fail at this. (Anxiety.)Would my friends approve of this? (Insecurity.)
Interesting to note that this Critic or subjective appraisal of the problem in terms of Mania can lead to a subjective consciousness that is characterized as Mania.
If Hughlings-Jackson has been able to study epilepsy correctly and has been able to make some valid inferences, then this may tell us a lot about how we respond flexibly to novel/ familiar situations and how the internal complexity that is required to ensure flexible behavior, leads to representational needs in brain, that might lead to the necessity of consciousness.
Chomsky Vs. Skinner : a role for behaviorist ideas in Language Acquisition
Chomsky, in a classical paper, discusses Skinner’s book Verbal Behavior and the associated attempts of behaviorists to explain Language Acquisition as just another complex behavior learned entirely through behaviorist mechanisms of classical and operant conditioning.
Chomsky himself clarifies the difference between cognitive and behaviorist explanations as follows:
It is important to see clearly just what it is in Skinner’s program and claims that makes them appear so bold and remarkable, It is not primarily the fact that he has set functional analysis as his problem, or that he limits himself to study of observables, i.e., input-output relations. What is so surprising is the particular limitations he has imposed on the way in which the observables of behavior are to be studied, and, above all, the particularly simple nature of the function which, he claims, describes the causation of behavior. One would naturally expect that prediction of the behavior of a complex organism (or machine) would require, in addition to information about external stimulation, knowledge of the internal structure of the organism, the ways in which it processes input information and organizes its own behavior. These characteristics of the organism are in general a complicated product of inborn structure, the genetically determined course of maturation, and past experience. …… The differences that arise between those who affirm and those who deny the importance of the specific “contribution of the organism” to learning and performance concern the particular character and complexity of this function, and the kinds of observations and research necessary for arriving at a precise specification of it. If the contribution of the organism is complex, the only hope of predicting behavior even in a gross way will be through a very indirect program of research that begins by studying the detailed character of the behavior itself and the particular capacities of the organism involved.
It would be prudent for me to clarify at the outset, that I am a Cognitivist and definitely see the merits of Chomsky’s arguments and the inadequacy of the potentially misguided attempts of Skinner and other behaviorists to apply the behaviorist concepts and results derived from animal studies to the study of semantics or how words get associated with a particular meaning and are used in particular contexts – either due to their prior association with a stimulus (stimulus control…something like classical conditioning in which the word ‘red’ gets associated with the property redness of an object and the internal visual response or qualia of redness that is produced automatically in response to the stimulus redness causes a conditioned association between “red’ and the qualia redness) or because the word or sentence was reinforced variably through various mechanisms like self-reinforcement, reinforcement-by-way-of-praise etc.
I definitely do not concur with Skinner’s arguments and definitions, and Chomsky show to some extent an understanding of the behaviorist concepts (especially in section II), but he also at times shows his profound lack of appreciation of finer subtleties of behaviorist concepts. For example:
In the book under review, response strength is defined as “probability of emission” (22). This definition provides a comforting impression of objectivity, which, however, is quickly dispelled when we look into the matter more closely. The term probability has some rather obscure meaning for Skinner in this book.9 We are told, on the one hand, that “our evidence for the contribution of each variable [to response strength] is based on observation of frequencies alone” (28). At the same time, it appears that frequency is a very misleading measure of strength, since, for example, the frequency of a response may be “primarily attributable to the frequency of occurrence of controlling variables” (27). It is not clear how the frequency of a response can be attributable to anything BUT the frequency of occurrence of its controlling variables if we accept Skinner’s view that the behavior occurring in a given situation is “fully determined” by the relevant controlling variables.
Here Chomsky has mixed and made a mess of the two separate concepts and processes in behaviorism- classical and operant conditioning. In the above paragraph, the definition of response in terms of ‘probability of occurrence’ is in terms of operant conditioning – wherein responses are autonomously generated by an organism irrespective of any stimulus (leave aside the case of discriminating stimulus as of now) that is present – for e.g. a bar-press- and based on the reinforcing stimulus that is presented to the organism , post response, the response strength or probability that the response would occur, autonomously, in future , increases. This is mixed up with the earlier concept of stimulus control (or classical conditioning) wherein controlling variables (or conditioned stimulus) relevant to a situation lead to an utterance or verbal behavior. This determining of verbal behavior due to presence of a a conditioned stimulus (reflexive language) would be a different mechanism form that used in deliberative language , wherein, an utterance is produced voluntarily and in defiance of its surrounding stimuli, but the probability of that occurrence is in proportion to its history of reinforcement. By mixing the two concepts, Chomsky just manages to show his ignorance and lack of appreciation of the behaviorist concepts/ mechanisms.
But my gripe with Chomsky is more for the change in focus that he has managed to pull off, with the study of semantics taking a backseat to the study of grammar or syntax. In my limited comprehension, I am unable to appreciate, how concepts of Universal Grammar, however much relevant and innate, could be a substitute for a proper analysis of language acquisition in terms of an ability to not only mastered the grammar, but also the semantics. Grammar or Grammar acquisition, per se, does not inform much about the actual and most relevant aspects of language acquisition- viz. semantics and pragmatics.
Addressing semantics, would be a task for a later day (and perhaps for a more capable person than me), but today I would like to tentatively propose a role for behaviorist concepts of reinforcement or operant conditioning as relevant to the general ability to understand and produce language and also to the general difference in talkativeness (and listening-ness, if there exists such a concept) between different people
Language acquisition should be broken into two components – a language understanding (or hearing) component and another language production (or speaking) component. It is a fact that the first component related to language understanding develops prior to language production. Also, it should be kept in mind that language is essentially a two person activity, with the utterance of one acting as (reinforcing) stimulus for the other and the utterance of another acting as a response.
The Hearing (or language understanding) activity:
This language component is used for understanding the meaning of utterances (say spoken language as opposed to written or depicted using sign language) and is relatively independent of language production.
Response is parsing the spoken sentence into words and by analyzing the syntax and meaning of the words constructing a mental image of the intention, beliefs, knowledge and possible behavior of the person who spoke the sentence and integrate that knowledge with the representation and expectation of the world in general.
Reinforcing stimulus is observing the behavior of the person who spoke the sentence to be in accordance with that earlier constructed expectation and prediction (the hearing response). It is assumed that an external act (whether negative or positive) that is in accord with an internal expectation would be rewarding in the sense that it would satisfy and reduce the internal drive to know in general – and to know the future in particular. Alternately, it can be posited that the state of not knowing clearly about the future is a state of unbearable tension and the uncertainty associated with the world is a negative stimulus (property) associated with the world. By hearing and understanding a sentence uttered by someone else, some of this aversive stimulus (uncertainty) is removed and thus by negative reinforcement (removal of an aversive stimulus) any act of hearing (or understanding…or refining the predictions regarding the world) is inherently rewarding irrespective of whether the actual outcome is as per the constructed expectations. Positive reinforcement of having the expectation met would result in strengthening of the hearing response. This is a general strengthening of the hearing response (or the response of creating expectations from heard utterances) and is independent of the actual content of that expectation. Thus, if an effort to construe meaning from an utterance was followed by a positive reinforcement of having that meaning verified, then the propensity of construing meaning from utterances would increase in strength. It is posited that this behaviorist mechanism is one of the strong motivating factor that encourages a child to understand the language of its parents/ society.
Although as adults, parsing sentences into words and extracting meaning from it seems automatic to us, for a child extracting meaning from a string of syllables is a very effort full activity, and the fact that doing so leads to positive reinforcement would encourage the child to pay attention to the hearing and understanding activity and increase their habit strength. The alternate mechanism to such a behavioristically mediated hearing acquisition could be claiming that development of language understanding is under genetic control and is similar to imprinting or genetic unfolding. This claim is weakened by an ability of mature adults to learn a foreign language. Thus, if this mechanism uses imprinting alone, it should be possible only under a critical period of childhood and not amenable to acquisition in adulthood. the fact that children are able to learn second languages faster and better than adults and some evidence form study of feral children as to a critical period necessary for first language acquisition, point to a mixed role of genetic factors like imprinting and behaviorist factors like reinforcement of the ‘predicting the world capability’.
The Speaking (or language production) activity:
This language component is used for production of meaningful utterances (say spoken language as opposed to written or depicted using sign language) and follows the relevant stage of language comprehension.
Response, in this case, would be constructing a valid, informative sentence by piecing together words that denote the shared meaning of objects and situations and uttering a valid meaningful sentence directed towards another listener. The intention for the utterance could be pedagogic (informing or teaching a fact to someone about whom you care), instrumental (using the person spoken to as a tool to achieve desired outcome), empathetic (sharing thoughts, feeling etc with the other person) or of some other kind.
Reinforcing stimulus, in this case would be observing the behavior of spoken-to person and discovering that the relevant information/ facts have been conveyed and understood properly. This reinforcing stimulus, can take the form of either observing the actual behavior, inline with the intended meaning of the utterance, or can be as subtle as deciphering the facial expressions of the listener for signs of understanding. In elongated verbal conversations, a verbal utterance by the listener, may serve as a reinforcing stimulus, and substitute for the outward behavior/ understanding expression (This for example is relevant in telephonic conversations and is one of the reasons children learn to speak on telephones later than they learn talking to adults face-to-face). The stimulus is reinforcing because it satisfies an earlier drive to control (use the other person as a tool for ones ends), the drive to share (the drive for belongings and intimacy) or the drive to inform (pedagogic drive).
Speaking, or constructing valid sentences by stringing syllables together, is again an effortful activity, and though as an adult it may seem effortless, strong motivations have to be present in childhood, for development of proper language production capabilities. The reinforcing stimulus, of having one’s intentions met, by observing the behaviour of the listener, provides the required incentive and mechanism whereby the habit strength of generating meaningful utterances is strengthened.
How to test for this theory:
It is clear from above discussion, that Hearing or language understanding predominantly relies on the drive for meaning or for predicting the world as its guiding mechanism, whereby the speaking or language production relies on other mechanisms involving drive for control , empathy and instruction.
I was recently introduced to Terror Management Theory (TMT) while reading a post by Mixing Memory on how TMT may influence Art Appreciation and I believe a similar study can be used for determining the ability of language understanding to provide meaning.
Specifically, if some subjects are primed with thoughts of death (as opposed to a neutral control topic) , then may exhibit a stringer drive for subsequent activities that give rise to a sense of meaning. This manipulation could be in the form of thinking of the September 11 attacks which increase mortality salience or by asking the participants to read the following instructions designed to increase their mortality salience:
Please briefly describe the emotions that the thought of your own death arouses in you.
Jot down, as specifically as you can, what you think will happen to you physically as you die and once you are physically dead.
The other half of the participants should be made to respond to similar instructions, but in reference to an upcoming exam rather than death.
Afterwards, both groups should be allowed an activity that involves language understanding (say listening to a meaningful audio radio program or conversation) and one that does not involve language understanding (say painting or sketching a drawing). The respondents should then be asked which activities (language comprehension related or visual painting related) they found more satisfying or meaningful. If those who had high mortality salience also showed a preponderant satisfaction by indulging in language comprehension related activities as opposed to control group and control task, then this would be a strong indicator of the importance of meaning formation in the motivation for language comprehension. A particular confound here is the second task ( as per the Mixing Memory task, Art may also serve as a Meaning generator and hence not be a suitable control task and should be replaced by a meaningless task like repetitive manual action task) and it should be ensured that this task does not involve Meaning generation. One control that seems appropriate is language production, as the mechanism underlying that is posited to be different from Meaning acquisition. Thus, the control activity could be related to language production (say allowing the participants to make an extempore speech on a topic for 20 minutes).
Finally, I would like to highlight a real life experiment. Those who participate in a ten day Vipassana Meditation camp are not allowed to speak for those ten days. As such, the amount they hear is also limited to some morning/ evening hymns (that may involve more music than language) and apart from that no other hearing or language understanding takes place. After the ten day speaking fast, when the participants talk to each other, one finds great meaning in the conversations. This may be a case of reduction of the meaning drive, after its prolonged starvation.
Also, the traits like loquaciousness may be explained partially in terms of the different underlying needs for control, empathy, instruction etc that give rise to the talking behavior, as well as the particular history of reinforcement that the subject has undergone, thus making that trait subject to both genetic and environmental influences.
To end on a lighter note, please note the Mixing Memory’s evaluation of such studies linking TMT and Art.
I’ve never really hung out in a social psychology laboratory, but here is how I picture a typical day in one. There are some social psychologists sitting around, drinking some sort of exotic tea, and free associating. One psychologist will say the name of a random social psychological theory, and another will then throw out the first thing that comes into his or her head. They’ll write each of these down, and the associations will then become the basis for their next several research projects. OK, so that’s probably not really what’s going on, and I suppose there’s a more scientific method to the social psychologist’s madness, but occasionally I come across a study that makes me wonder. And the great thing about having a blog is that I get to write about it when I do. Today’s example: terror management theory and modern art
I am, at present, camping in the filed of Social Psychology and thus take the privilege of suggesting a more bizarre study that could possibly prove what we may all intuitively know – that the motivation for hearing something is because we derive meaning from it! (Remember the cocktail party effect, wherein we are able to selectively listen to the conversation of interest- or one that is most meaningful to us). As the say, no research is that abstruse as to not get funded. So all you students out there, anyone care to conduct such a research (and prove me right)!

Recent Comments