I just received this email from Netflix:

As of the submission by team "BellKor's Pragmatic Chaos" on June 26, 2009 18:42:37 UTC, the Netflix Prize competition entered the "last call" period for the Grand Prize. In accord with the Rules, teams have thirty (30) days, until July 26, 2009 18:42:37 UTC, to make submissions that will be considered for this Prize. Good luck and thank you for participating!

Indeed, BellKor's Pragmatic Chaos -- the newly formed coalition of 4 teams -- broke the 10% barrier to 1M dollar prize in the Netflix competition. Here is the post that describes how the team was named. Congratulations to the team!

As stated in the email, this submission starts the "last call" 30-day period to provide a final solution for the prize. Whoever has the best submission by the end of the last call period wins the Prize.

Few days ago I attended one of the Google Seattle Tech Talks; this one was given by Helen Greiner, iRobot's co-founder and chairman. She was visiting Seattle to sign the licensing agreement for UW's Seagliders and stopped at Google Kirkland to talk about iRobot and its robots.

Her talk was interesting, not so much for the descriptions of robots that iRobot develops, but mostly for the history of the company and their learning experience as they were trying to design and build robots and find their market to sell products to survive. This somewhat reminded me the story of Webmind AI Engine (Waking Up from the Economy of Dreams), albeit with a happy ending as iRobot is a live and profitable company (although its stock has been trading recently close to 52-week low).

There was a brief Q-and-A session in the end and I had an opportunity to ask a few questions, one of them was "When do you see your robots becoming more intelligent?" Helen's response was that intelligence is a complex term, but their robots are intelligent to some degree: they don't fall off the stairs, can sense objects around them, interact with virtual walls, can re-dock themselves, and, most importantly, they clean carpets and floors and do a good job at that. Essentially -- what follows is obviously my interpretation -- they are well designed for a particular task and there is no obvious need or an easy way for them to become more intelligent.

As I own several generations of Roombas I'm familiar with how they operate and what they are capable of. Still, I struggled to counter Helen's definition of intelligence without falling prey to Tesler's Theorem -- "AI is whatever hasn't been done yet" (Godel, Escher, Bach: an Eternal Golden Braid, p.601). Do Roombas possess even limited intelligence? I do not believe so and here is why.

It's not so much the ability to solve a problem that is critical to intelligence as it is the ability to learn how to solve problems. Most of the tests for intelligence seem to be focused on testing ability to solve problems, relying on the assumption that if you can solve those problems you learned the ability to solve them, hence you're intelligent. A better way to test intelligence would be have some sort of dynamic test when you learn to solve problems as part of the test and the progress you make would define your level of intelligence. Otherwise we don't really test intelligence so much as we test experience.

Wang From NARS to a Thinking Machine, p.2: "...whether the system is intelligent, or how intelligent it is, is not determined by what the system can do, but what it can learn to do."

Roombas don't learn; they don't improve their actions. For example, I have 5-inch space between floor transition in my kitchen and thick carpet in my living room and Roomba keeps getting stuck there (literally; it can't move and needs to be picked up). With more intelligence, it could possibly learn to avoid those spots or maybe go through them using a different direction. They also don't acquire new actions and don't generate actions to do their job more effectively. Even their initial "experience" is not acquired, but hard wired based on experience of its designers. While there are noticeable differences between the first and last generations of Roomba -- more autonomy, ability to dock itself and to sense objects and slow down before hitting an obstacle, and other things -- this is still not enough to call it intelligent.

The more difficult question is, if we add more sensors, mapping capabilities, working memory, and other components, would it make it intelligent then? I will not attempt to answer this question here, but will do a separate post on how I define intelligence and what it may take to make an intelligent robot.

PLoS Computational Biology journal has published Ten Simple Rules for Getting Published. My favorite is number 7:

7. Start writing the paper the day you have the idea of what questions to pursue

Your papers should be concise, and impart as much information as possible in the least number of words. Maintain a good bibliographic database as you go, and read the papers in it.

I'm using CiteULike and have my own list with 110+ articles and books, most of which I have read. Still, I am struggling with the fact that I don't have a good way of capturing my thoughts. A getting-things-done-like system may help, but its focus is on managing tasks and things to do, rather than a bunch of poorly organized and (not yet) related thougths and ideas. Yes, I have a paper notebook, but it doesn't capture granularity of thought I'm after. I also leave many notes on papers I read and no good place to transfer them to. The system covered in How to make a complete map of every thought you think looks very promising:

You'll really have clear idea of what kinds of thought are going through your head. You'll really understand your ideas. And you'll also see connections that you were never consciously aware of. You'll see a structure and a pattern in your life. Your goals and psychology will become clearer to you. You'll be clearer too about what you don't understand. You'll be familiar with your mental terrain. Addictive clarity. Vast clarity. Exceptional clarity. Extraordinary clarity.

The book covers materials, general principles, intra-, extra-subject architecture, theory of notebooks (!) and has everything to get you started. You just need to have patience to do a lot of work to keep it going.

Here is another good article packed with useful and insightful tips for Ph.D. students

Your task now is to get to the central result as fast as possible. [...] There should be nothing before the main result that a reader does not need to know in order to understand the main result.

And another great advise on taking responsibility for your writing:

Much bad writing comes down to trying to avoid responsibility for what you're saying. That's why people resort to passive sentences, "it should be noted that", poor organization with literature first and your idea last, and so on. Take a deep breath, and take responsibility for what you're writing.

By the way, are you working on one of the most important problems in your field?

Unsolved problems of neuroscience:

  • Awareness: What is the neuronal basis of subjective experience, wakefulness, alertness, arousal and attention? What is its function?
  • Perception: How does the brain transfer sensory information into coherent, private percepts? What are the rules by which perception is organized? What are the features/objects that constitute our perceptual experience of internal and external events? How are the senses integrated? Is face perception special (e.g. inert)? What is the relationship between subjective experience and the physical world?
  • Learning and Memory: Where do our memories get stored and how are they retrieved again? How can learning be improved? What is the difference between explicit and implicit memories? How plastic is the mature brain?
  • Development: How and why did the brain evolve (the way it did)? What are the molecular determinants of individual brain development?
  • Sleep: What is the function of sleep? Why do we dream? What are the underlying brain mechanisms? What is its relation to anesthesia?
  • Cognition and Decisions: How and where does the brain evaluate reward value and effort (cost) to modulate behavior? How does previous experience alter perception and behavior? What are the genetic and environmental contributions to brain function?
  • Language: How is it implemented neurally? What is the basis of semantic meaning?
  • Diseases: What are the neural bases (causes) of mental diseases like psychotic disorders (e.g. mania, schizophrenia), Parkinson's disease, Alzheimer's disease or addiction? Is it possible to recover loss of sensory or motor function?

"Ten Unsolved Questions of Neuroscience", an upcoming book by David Eagleman and Patricia Churchland, presents a similar list (I couldn't find any more information about this book other than what's mentioned here):

  • How is information coded in neural activity?
  • How are memories stored and retrieved?
  • How do brains balance plasticity against retention?
  • What is the brain’s extensive baseline activity about?
  • How should we neurally characterize emotions?
  • What is intelligence?
  • How is time managed in nervous systems?
  • Why do brains sleep and dream?
  • How do the highly specialized subsystems of the brain coordinate with one another?
  • What is the neural substrate of consciousness?

I have my own list of questions to think about.

Robot Hardware

Artificial eyes to "monitor areas over 360°" (I wish I knew how area over 360° looks like) in real time, touch sensors that allow feel out the likeness of Abraham Lincoln on a penny, and alcohol-powered muscles that are 100 times stronger than natural muscles, able to do 100 times greater work per cycle and produce, at reduced strengths, larger contractions than natural muscles. What's next?

[via Robots.net and Cognitive Daily]

Microsoft just announced Microsoft Robotics Studio -- a Windows-based environment that includes robotics development platform and lightweight services oriented runtime. The Studio provides simulation runtime and relies on the web services application model to interact with available services (this is especially close to my heart as the two books that I co-authored for O'Reilly were about webservices).

The toolkit currently supports LEGO Mindstorms (both RCX and NXT) and Fischertechnik interfaces.

The discussion group that covers Robotics Studio already has several interesting threads. From one of the messages (slightly edited for presentability); unfortunately I don't see any way to link to a particular message, but you can search for several "Absolute Horror" messages:

Robotics Studio will take your focus OFF the real problem - how to integrate timing into your design. Time is THE most important thing when you design a robot. You can have a dumb cap and resistor running the whole show as in Beam robotics and get great results. The further you abstract your framework and the more you rely on the simulation the more damage you will do to the community. What we really need is a BUS not a framework. And a way to hook up that bus to a PC. We do not need a framework without a bus. Robotics Studio is trying to create a distributed environment where timing will be explicit. These concepts are only applicable to the simple machines most people mistakenly call robots -- these are NOT robots. These are toys. [...] Instead learn the following:

  • Everything you can get your hands on regarding oscillations and timing related to oscillations. Forget about traditional NNs - they are made of Layers (just like Ogres - watch shrek 1 for a good explanation) If they weren't made of layers (backpropagation is only good for non-temporal patterns) they just might work... Statistics (corrleation especially). Embodyment. Compliance. Types of learning (Supervised, Unsupervised, Reinforced). Hardware such as Bus design, servos, H-bridges, sensors. Forget PID control - your bot has to learn it otherwise it will not integrate well with the rest of the system.
  • One thing I want to tell about amature robotics - the only way you get a complex behaviour is by increasing the number of actuators/sensors. The only way you can do it is by making the devices more affordable. The only way you can do that is by making the devices simpler and standardizing them. Robotics Studio wants to add XML processing to devices which contain 25-128 bytes of ram such as pics... and waist my precious CPU cycles on parsing it. I got one word: forgettaboutit.

Cofing4Fun also has a page dedicated to programming for LEGO Mindstorms (although no articles that cover the new NXT version yet). All this should run on top of Visual Studio Express, which is free.

Questions to Ponder

Questions to ponder after reading of Cognition and Reality, The Quest for Consciousness, and The Feeling of What Happens.

  • How does it happen that different people notice different aspects of the same situation?
  • Why are some potions of the retinal input treated as belonging to the same object, others as independent?
    • May it have something to do with results of co-occurrence analysis?
  • Why do we often seem to perceive the meanings of events rather than their detectable surface features?
  • How are successive glances at the same scene "integrated"?
  • Why is perceiving almost always accurate, given inadequacies of the retinal image?
  • If percepts are constructed, why are they usually accurate?
  • What kind of cognitive structure does perception require?
  • What happens when we choose what to see and how do we learn to see better?
  • How are illusions and errors possible if perception is simply the pickup of information that is already available and specific?
  • How do we pickup unanticipated information?
  • What happens when a new object enters the field of view for the first time?
  • How is it decided whether to use an existing schema or to develop a new one (exploitation vs. exploration)?
  • How can anything at all be seen in a brief flash if perception is a temporally extended activity?
  • How is schema modified by new information? What elements of it are being modified? Anticipation? Motor program?
  • Is development of the schema from the general to the specific, from undifferentiated to precise or in the opposite direction?
  • Why do introspective reports suggest that the meaning is available first, and the stimulus details only later or not at all?
    • This is probably because introspection starts from the highest level schema of all active schemata
  • How does schema come to exist?
    • Some may be innate, but for those that are not, what's the mechanism? Co-occurrence?
    • The perceptual cycle must occur before it can develop
  • Can we perceive something not having an appropriate schema?
    • Maybe only at the lowest possible level, which will allow for bootstrapping based on some mechanism, like co-occurrence.
  • Which schema develops earlier: the more complex "high-level" one, or the more simple "low-level" schema? (for example, shape vs. smile)
  • How do schemata get co-activated? When they are related? Include one another? Not related?
  • Does a schema for a concept (for example, "number") get easier and faster activated than a schema for an object that is part of that category (for example, number "five")? How do they influence each other?
  • Is it possible to deploy more than one schema at a time?
    • Yes, multiple schemata can be active, even though they have different level of activation (and influence of other schemata); based on that level of activation they compete with other schemata for resources (gaze, effectors, and so on)
  • What does presence of expected/unexpected information in the environment does to the schema? In terms of activation? In terms of its modification?
  • Why is it difficult to pick up information from two different messages/streams at the same time?
  • Is it possible to attend to two things at once?
  • What are the limits of automatic mental activity?
  • Why/how does perception depend on a skill? What skill is that?
    • Motor program that supports co-activation of other programs...
  • Are schemata being influenced (activated/modified) by unattended information?
  • Is there a single mechanism responsible for our cognitive limitations? What are the limitations? (The Magical Number Seven)
  • What types of conflicts may arise from activation of two schemata? How do those conflicts get resolved/arbitrated?
  • Is there some impediment to the parallel development of independent, but similar schemata? Does existing schema "canalize" incoming information and experience effectively preventing the second (similar) schema to be formed?
  • Is consciousness an aspect of activity or an independently definable mechanism?
  • Is it possible to think without being conscious?
  • How does the subject know whether the present content of his consciousness originated with an external stimulus?
  • Does imagery appear when pickup of information is delayed or interrupted?
  • If images are anticipations rather than pictures, what's going on when we describe them?
  • How can we imagine things we don't expect, for example, things we know cannot happen?
    • Anticipations concern things that only might come to pass rather than those things whose existence is already established
  • How does detachment of images from the immediate context come about?
    • This detachment occurs inevitably in at least one situation with which we are all familiar: locomotion
    • Any delay between the anticipation and the pickup creates a state of unfulfilled perceptual readiness, and the inner aspect of that active shema is a mental image
  • How are cognitive maps (and other types of schemata) acquired?
  • What sorts of information do they incorporate at various stages in their development?
  • How are they altered by experience?
  • Under what conditions are they forgotten?
  • Where many schemata exist, what distinguishes the right one?
  • How schemata stored in memory? What represents long-term memory?
    • Repeated representation of the same material constitute a regularity to be detected
    • Use of schemata accounts for remembering
    • Forgetting occurs whenever the present inputis not specific enough to select a schema unequivocally
  • What's the difference between perception and imagery?
    • Perception is a cyclic activity that includes an anticipatory phase; imagery is anticipation occurring alone
  • What schemata are we born with? Schemata sensitive to expressions of emotion? Intentions?
  • How words come to refer to objects?
    • Is it required to be "engaged in two perceptual cycles at once"? (Cognition and Reality, p.164)
    • Are those words embedded in the schema or related to/associated with it?
  • What is the principal function of grammatical structure of a sentence?
    • Is it to help the listener to develop proper anticipations that may span many seconds? (Ibid, p.167)
  • How does one describe what one sees?
  • Why do we feel that we know what we're going to say, but only in some general way?
  • How does anticipations get detached from the stimulus information that generated them to become things we imagine?
  • Who has more freedom in their actions: adult or infant?
  • How do we perceive emotion of feeling in others?
    • Perceiver needs to have schemata to pick this information up
    • How much do these schemata depend on social experience?
  • Why do we (sometimes) hear multiple voices arguing?
  • How is anticipated image formed? What specifically is being anticipated (out of many possibilities)?
  • How is movie-in-the-brain generated? How does the brain generate the sense of an owner and observer for that movie?
  • What's the relationship between consciousness and emotion? (Emotion And Consciousness)
  • Are there any non-conscious feelings? Are we conscious of all our emotions and feelings?
  • What distinguishes emotion from feeling?
    • Emotions are external indication of feelings
    • Emotions are experienced as feelings
    • Emotions play signaling role; feelings facilitate learning and motivate behavior (on a larger scale) or produce a specific behavior (on a smaller scale) like freezing
      • "[Pleasure] is related to the clever anticipation of what can be done not to have a problem." "[Nature] seduces us into good behavior." (The Feeling of What Happens, p.78)
    • Expression of emotion drives feeling, which in turn supports expression of emotion; chicken and egg
  • Can we control/suppress emotions/feelings?
  • Why do we need to be conscious to have feelings or express emotions?
  • What's the difference between sensing pain and knowing you have pain?
  • What distinguishes zombies from normal (non-zombie) organisms? Automatism? Lack of intentions? Lack of continuity of purpose? Always-on learning (zombie learning)?
  • What are the properties of consciousness?
    • William James (cf. The Feeling of What Happens, p.126): it's personal, selective, continuous, and pertains to objects rather than itself
  • Why is there significant delay (up to 500ms; Libet's experiments) between registering of a sensation and conscious experience of it?
  • How does always changing mind preserve its identity? What is is that provides mind with this core that is always the same? Is "self" a concept as any other concept and is built based on interactions with other individuals (culture and meme links)?

Modeling tools and simulators

What would I want from a framework or a simulator?

  1. It needs to implement biologically plausible models
  2. It needs to be fast enough to model tens of thousands on neurons with 1ms step (#1 and #2 usually don't come together)
  3. It needs to provide an interface for developing simulations and monitoring their execution with real-time visualization (ideally as good as Neural Viewer does) and network activity diagrams
  4. It needs to be easily configurable. All network parameters and their visualization should be compactly specified
  5. It should support various sensory and motor systems
  6. It should provide a comprehensive set of networks to play with

When I was a kid I liked playing blind chess with my dad and my brother. This is the type of chess when you make moves without looking at the board at all (you just tell your opponent what piece to move and where). I couldn't beat my father (to my defence I couldn't beat him very often in regular chess either), but I could play through end-game with 30 moves or more.

Now I want to compare this skill with my inability to remember a 10-digit phone number after hearing it one time or copy a 9-digit loan number between two computer screens from memory (I need to look at it at least twice). We all know about a span of immediate memory and its limit of seven plus or minus two items (The Magical Number Seven), which (partially) explains my difficulties with numbers. I just don't see how to reconcile it with my ability to play blind chess.

For those who never played blind chess, here is a brief summary of what you need to do. You need to 1) keep the current position in your head (initially 32 pieces on 64 squares), 2) be able to evaluate this position, and 3) consider possible moves and your opponent's responses (for any serious game this analysis needs to be done for several iterations/levels).

One possible explanation is that blind chess has more to do with mental imagery and less with working memory. While imagery is definitely involved here, I don't think this solves the problem. For starters, I don't imagine seeing the board as you'd see it in real life. The board that I see has no texture and no color; even squares are not colored black and white. I can "paint" it any color I want, but this will require some effort and "by default" it's completely feature-less. Much to my own suprise (this is the first time I examine this aspect of my own mental imagery), pieces too have no colors and even no shapes. I just "know" that this square has my pawn and that square has opponent's bishop. As with board colors, I can make them look anything I want, but this doesn't change the fact that by default they don't have any look. It's difficult to explain, but I just know conceptually that this is my piece on this square. Which makes me think that this has less to do with mental image and more with working memory that not only stores all those concepts and their relationships, but is also involved in analysis of the position and future moves.

Another possible explanation is that each of those tasks involves a different type of working memory; alternatively, there may be no separation, but working memory may have different task-specific capacity limits. For example, this article provides seven views on this capacity limit:

  1. There are capacity limits but that they are in line with Miller's 7+2.
  2. Short-term memory is limited by the amount of time that has elapsed rather than by the number of items that can be held simultaneously.
  3. There is no special short-term memory faculty at all; all memory results obey the same rules of mutual interference, distinctiveness, etc.
  4. There may be no capacity limits per se but only constraints such as scheduling conflicts in performance and strategies for dealing with them.
  5. There are multiple, separate capacity limits for different types of material.
  6. There are separate capacity limits for storage versus processing
  7. Capacity limits exist, but they are completely task-specific, with no way to extract a general estimate.

Yet another explanation is that somehow I trained myself to store and process all this information. I don't buy this argument simply because I have been dealing with short sequences of digits for much longer than with blind chess, but without any significant progress.

I think it has something to do with the fact that "chunks" that I'm trying to remember in the first case (digits in a phone/loan number) are not related, while in the second case they are tightly related to each other. Even more importantly, what needs to be captured is a sequence of digits, rather than a group. This may be caused by mutual interference between elements in the sequence; as a result the sequence requires significant effort to maintain itself. If those number can be somehow related to each other (for example, they may be similar to a social security number) it may be much easier to remember by capturing "social security number with some modifications...". I think this also supports the idea of a hierarchical organization of (working) memory as while the chess play requires to maintain significantly larger number of elements, they all seem to be maintained "inside" that concept without much bear on working memory. Yes, you need to focus and pay attention when you play blind chess, but I'd argue that the effort required is probably less than when you need to remember a 9-10 digit sequence for 30 seconds.

Note that this effort doesn't seem to be proportional to the number of pieces to maintain in memory: it's easier to keep the position in memory in the beginning of the game when all 32 pieces are present rather than during end game where only few pieces may be left. This may be related to the fact that the initial position is more constrained (has fewer degrees of freedom) than any end game position, which makes it easier to track.

2 years 3 weeks

I'm watching Andrey who is trying to reach across a small table to grab his toy car. The car is too far and while the table is small he keeps trying even though he can't reach it and it would be very easy for him to go around the table and take the car. Still, he persists in using his (not working) method even after I tell him about the alternative.

This reminded me about what Douglas Hofstadter called a difference between physical distance and problem distance (Godel, Escher, Bach: an Eternal Golden Braid, p.612). When a dog wants to get his favorite bone that is just few feet away, but on another side of a fence with a gate not too far away he has two choices: 1) run up to the fence, stand next to it and bark; or 2) get to the open gate and double back to the bone. While the second options seems to be counter-intuitive (as it increases the distance to the bone), it actually brings the dog closer to it.

What needs to "click" in dog's brain for this to happen? What's that new level of abstraction that allows to solve this problem? What does this solution in some abstract "problem space" have to do with the real material world, where this problem is being solved in a similar way?

Last time this happened six weeks ago. After brief one-time experience with a real fence Andrey knows how to fetch a toy he wants even though he may need initially to move away from his target. I continue being amazed how quickly he learns even from one-time experiences.

Chris Chatham posted a brief overview of the classic working memory model and offered a new diagram for that model. There are several interesting points, but I'd like to emphasize two of them: processing and memory are two sides of the same coin and gating functions are likely to be present at every intersection of arrows on the working memory diagram.

Having said that, I side with O'Reilly and his collaborators who proposed a completely different working memory model outlined in their article Banishing the Homunculus: making working memory work. The model is based on tripartite architecture, which is composed of the posterior cortex (PC) that performs majority of "automatic" sensory and motor processing, the hippocampus (HC) that is responsible for rapid learning that binds together arbitrary information, and the prefrontal cortex and the basal gandlia (PFC/BG) system that maintains internal contextual information (PFC), which can be dynamically updated by the BG.

In this model working memory is defined as an emergent property of the interactions between these three brain areas. These mechanisms not only support those basic memory functions that are generally associated with working memory, but also those controlled processing functions that are typically associated with a "central executive". The critical difference between this model and the model covered in Chris's post is that all information is viewed to be distributed in a relatively stable configuration throughout the cortext with working memory being represented by the controlled activation of those distributed representations. Authors' view is that working memory and executive function are two sides of the same coin, based on the fact that processing and memory functions are typically distributed within and performed by the same neural substrates.

As a side note, while a framework that separates working and long-term memory may look familiar and computantionally appealing, think about "simple" operation of copying a concept from long-term to working memory. What mechanism would support this copying that (to properly reproduce the concept) would need to include not only sensory and motor information associated with it, but also (potentially) large number of related concepts? How would "activation" of such a copy look like?

The authors also identify six key functional demands underlying working memory and describe how those are addressed in the proposed model:

  • Rapid updating
  • Robust maintenance
  • Multiple, separate working memory representations
  • Selective updating
  • Top-down biasing of processing
  • Learning what and when to gate

There is one more interesting point in the paper that I'll expand on later:

...it is likely that working memory may represent a kind of phylogenetic extension of the same kinds of mechanisms that underlie all forms of complex motor coordination and planning. (emphasis mine)

update 2006/03/20: More on working and long-term memory processes and assessment from Intelligence Testing.

2 years 1 week

Andrey is playing with his older brother, Daniil, who is 10 years old. Daniil is asking him to say words (in Russian) -- "say, water; say, pipe; say, boat" -- and Andrey is repeating those words after his older brother. Now it's Andrey's turn: "say, (then after a 3 second delay, mimicking his brother) ...lamp; say, (after looking around) ...room; say, ...car" (much to our surprise as there wasn't any car around him). It looked like he was thinking about what to ask next and was doing search in his memory.

Overall, Andrey appears to be very proficient with words. He can easily say 4-syllable words, 4-5 word sentences, requests like "louder" (when listening to music in a car) and descriptions like "the smallest" (showing a small toy he is playing with).

1 month

Alysa's hand movements look more meaningful. She is not trying to reach for anything yet, but her movements look more purposeful maybe because they are longer, not as random as they used to be, and sometimes follow her eyes (probably just by accident). Her eye-hand coordination has also improved considerably. When she is awake she prefers positions where she can observe what's happening around her.

4 weeks

Alysa at the rosy age of four weeks

Best Robots Ever

First issue of Wired in 2006 published a list of The 50 Best Robots Ever. Unfortunately, my favorite robot, Johnny-5, was not on the list, even though the list includes many movie robots. I found it on The Ten Best Movie Robots list.

Out of those robots that were on the list, my favorite is Genghis, a six-legged walking robot created by Rodney Brooks and his team and covered with much detail in Brooks's book Flesh and Machines. Its unique (for its time) feature was that it was based on the subsumption architecture (described in more detail in How to build complete creatures), which provides an incremental method for building robot control systems that link perception to action. This control system is implemented as a set of layers, with each layer progressively implementing more and more complex behavior; robot's macro behavior (following people) arises from many independent micro behaviors (moving legs, balance control, steering, and the like). Conflict resolution happens at the motor command level rather than at the sensor or perception level resulting in coherent, smooth, and, as Brooks calls it in his book, "lifelike" behavior. However, contrary to what the Wired article says, the robot doesn't learn; this behavior is "innate" and doesn't change with time.

Those of you who are interested in first hand experience, can start from one of the kits on the Top 10 Robot Christmas Gift Ideas compiled by the robots.net folks.

update 2006/03/20: Not really best robots, but still related to this topic. A brief overview of various military robots from Developing Intelligence

2 years

Andrey on his second birthday

The functioning of an organism in a complex and unpredictable environment is supported by two complimentary processing systems, cognition and emotion. While the cognitive system is responsible for interpreting and making sense of the world, the emotion system is responsible for evaluating and judging events to assess their overall value with respect to the organism (Robot Emotion: A Functional Perspective; p.2). Paul Ekman in Basic Emotions chapter (Handbook of Cognition and Emotion; ch.3) offers a slightly different perspective: "...the primary function of emotion is to mobilize the organism to deal quickly with important interpersonal encounters, prepared to do so by what types of activity have been adaptive in the past."

More specifically, functions of emotions in animals (including humans) can be summarized as follows (based on Neural Networks and Brain Function; pp.138-40):

  1. Elicitation of automatic responses. Emotion may elicit an autonomic (for example, a change in a heart rate) or endocrine (for example, the release of adrenaline) response that prepares the body for action.
  2. Flexibility of reinforcement. Emotional states allow a simple interface between sensory inputs and motor outputs, because only the valence of of the stimulus to which attention is being paid needs to be passed to the motor system, rather than a full representation of the sensory world. In addition to that, when a stimulus elicits an emotional state, we can flexibly choose any appropriate response, which is more flexible than simply learning a fixed behavioural response to a stimulus.
  3. Motivation. For example, fear learned by stimulus-reinforcement association formation provides the motivation for actions performed to avoid noxious stimuli.
  4. Communication. Animals can communicate their emotional state to others. What is the role of mirror neurons?
  5. Storage of memories. Emotion may facilitate the storage of memories. This may be advantageous in that storing many details of the prevailing situation when a strong reinforcer is delivered may be useful in generating apropriate behavior in similar situations in the future. This may be facilitated in several ways: a) by bringing more attention to important situations the animal may be storing more information, b) by guiding attention toward what's important and away from distractions one may be storing more relevant information and c) by involving emotions stronger associations can be built.
  6. Evaluation of memories. The current mood state can affect the cognitive evaluation of events or memories.
  7. Memory recall: Emotion may trigger the recall of memories stored in neoroctical representations. Amygdala backprojections to the cortex could perform this for emotion in a way analogous to that in which the hippocampus could implement the retrieval of recent memories in the neocortex.
  8. Better performance. The slowing of cognitive processes caused by negative emotions may enable more careful and deliberate scrutiny of self and circumstances, allowing the individual to gain a new perspective to help improve performance in the future. Negative effect allows us to think in a highly focused way while positive effect allows us to think more creatively and to make broader associations.

Paul Ekman in Handbook of Cognition and Emotion identifies 15 basic emotions: amusement, anger, contempt, contentment, disgust, embarrassment, excitement, fear, guilt, pride in achievement, relief, sadness/distress, satisfaction, sensory pleasure, and shame. He also discusses enjoyment and surprise, but doesn't include those as basic emotions as they don't "subserve patterns of motor behavior which were adaptive for each of these emotions, preparing the organism for quite different actions." Also, interest "may be better regarded as a cognitive state rather than an emotion..." This list of emotions also doesn't include love, hate, grief, and jealousy as these are "emotional plots, more specific, more enduring than the basic emotions, specific contexts in which a number but not all of the basic emotions can be expected to occur." Ekman also defines moods -- which have different causes and last much longer, and are highly saturated with emotions -- and personaly traits, such as hostility.

Cynthia Breazel and Rodney Brooks in Robot Emotion: A Functional Perspective provide descriptions for a number of basic emotions that were implemented in the robot Kismet and gives details of that implementation (pp.17-20)

  1. Anger. Anger serves to mobilize and sustain energy and vigorous activity at high levels. If it often elicited when progress toward a goal is hindered or blocked. This mobilizes an organism to try alternative strategies.
  2. Disgust. Disgust is manifested as a distancing from some object, event, or situation, and can be characterized as a rejection of an unwanted stimulus.
  3. Fear. The function of fear is to motivate avoidance of escape from a dangerous situation.
  4. Joy. The emotion of joy is believed to heighten openness to experience. It often arises upon the success of achieving a goal or in the pleasure of mastery, exhibited even by very young children. The expression of joy operates as a universally recognizable signal of readiness for friendly interaction.
  5. Sorrow. Experiencing of sadness is causing to slow the cognitive and motor systems, which enables one to reflect upon a disappointing performace to gain a new perspective that will help improve future performance. The expression of sorrow communicates to others that one is in trouble and increases likelyhood that the others will feel sympathy and lend assistance.
  6. Interest. Interest motivates exploration, learning, and creativity. It mobilizes the creature for engagement and interaction. It serves as a mechanism of selective attention that keeps the creature focused on a particular object, person, or situation, and away from other distractions that impinge upon its senses.
  7. Boredom. Boredom is an emotion that arises when the organism is not stimulated for a while. Hofstadter has an interesting note about boredom in Godel, Escher, Bach (p.621): "you get bored with something not when you have exhausted its repertoire of behavior, but when you have mapped out the limits of the space that contains its behavior."

update 2006/03/02: Just noticed that Chris Chatham also posted some interesting (as always) information on this subject on his weblog earlier this month: Emotional Robotics and Simulating Emotion.

update 2006/03/21: This article by Edmund Rolls has an interesting diagram that put emotions on a scale from rage to relief and from terror to ecstasy.

3 weeks

Alysa maintains much better head control now and can turn her head while sleeping.

10 days

Alysa is well coordinating her eyes; this wasn't the case even two days ago.

You and Your Research

You and Your Research
Richard Hamming

One of the characteristics of successful scientists is having courage. Once you get your courage up and believe that you can do important problems, then you can. If you think you can't, almost surely you are not going to.

  1. What are the most important problems in your field?
  2. Are you working on one of them?
  3. Why not?

When you are famous it is hard to work on small problems.

``Knowledge and productivity are like compound interest.'' Given two people of approximately the same ability and one person who works ten percent more than the other, the latter will more than twice outproduce the former. The more you know, the more you learn; the more you learn, the more you can do; the more you can do, the more the opportunity - it is very much like compound interest.

...it is not sufficient to do a job, you have to sell it. [...] There are three things you have to do in selling. You have to learn to write clearly and well so that people will read it, you must learn to give reasonably formal talks, and you also must learn to give informal talks.

The people who do great work with less ability but who are committed to it, get more done that those who have great skill and dabble in it, who work during the day and go home and do other things and come back and work the next day. They don't have the deep commitment that is apparently necessary for really first-class work. They turn out lots of good work, but we were talking, remember, about first-class work.

...some of the reasons why so many people who have greatness within their grasp don't succeed are: they don't work on important problems, they don't become emotionally involved, they don't try and change what is difficult to some other situation which is easily done but is still important, and they keep giving themselves alibis why they don't.

If you read all the time what other people have done you will think the way they thought. If you want to think new thoughts that are different, then do what a lot of creative people do - get the problem reasonably clear and then refuse to look at any answers until you've thought the problem through carefully how you would do it, how you could slightly change the problem to be the correct one.

The present growth of knowledge will choke itself off until we get different tools. I believe that books which try to digest, coordinate, get rid of the duplication, get rid of the less fruitful methods and present the underlying ideas clearly of what we know now, will be the things the future generations will value.

...in the long-haul, books which leave out what's not essential are more important than books which tell you everything because you don't want to know everything.

Somewhere around every seven years make a significant, if not complete, shift in your field.

About

I am Paul Kulchenko.
I live in Kirkland, WA with my wife and three kids.
I work for Six Apart as a software developer.
I study machine learning and artificial intelligence.
I write books and open-source software.

Recommended

Close