jump to navigation

Faces everywhere September 21, 2007

Posted by Johan in Face Perception, Links.
add a comment

 

It’s amazing how faces seem to appear everywhere – whether this is a result of a face-specific innate mechanism, or just tons of practice with this particular class of stimulus. I recently discovered the wonderful Faces in Places blog, which is filled with examples of faces appearing in the most unlikely places. The picture above is perhaps a poor example of the human readiness to perceive anything as a face, since this particular item was likely designed with faces in mind.

 

 

 

 

That is a less plausible explanation for why these motorcycle gas tanks look like faces – they weren’t designed to be viewed from this angle.

We seem to be inclined to interpret almost any object with three enclosed dots as a face. This is quite different from other objects with which we have a lot of experience: you probably spend considerable amounts of time looking at your hands in order to guide your actions, but if you’re anything like me you  don’t see hands everywhere.

Detecting genetic disorders with 3d face scans September 16, 2007

Posted by Johan in Abnormal Psychology, AI, Applied, Behavioural Genetics, Developmental Psychology, Face Perception.
add a comment

Following on from last week’s post on smile measuring software, The Scotsman (via Gizmodo) reports on the work by Hammond and colleagues at UCL, who are developing 3d face scans as a quick, inexpensive alternative to genetic testing. This is not as crazy as it sounds at first since it is known that in a number of congenital conditions, the hallmark behavioural, physiological or cognitive deficits are also (conveniently) accompanied by characteristic appearances. The classic example of this is Down syndrome, which you need no software to recognise. More examples appear in the figure above, where you can compare the characteristic appearances of various conditions to the unaffected face in the middle.

Hammond’s software can be used to identify 30 congenital conditions, ranging from Williams syndrome (a sure topic of a future post) to Autism, according to the Scotsman. I know of no facial characteristics of autism, so I would take that part of the story with a grain of salt. The system claims an accuracy rate of over 90 percent, which is not conclusive, but certainly good enough to inform a decision to carry out genetic tests that are. The UCL press release gives some more information about how the software works:

The new method compares a child’s face to similarly aged groups of individuals with known conditions and selects which condition looks the most similar. In order to do this, collections of 3D face images of children and adults with the same genetic condition had to be gathered, as well as controls or individuals with no known genetic condition.

It really is too bad that the software uses 3d images – those cameras are neither cheap nor ubiquitous, which somewhat defeats the point of using this software as an affordable alternative to (or initial screening for) genetic testing. I can’t help but wonder if it wouldn’t be possible to achieve similar accuracy using normal portraits. If you can tell how much someone is smiling in a photo, you should be able to pick up on that extra chromosome…

AI detection of facial expressions September 7, 2007

Posted by Johan in AI, Applied, Emotion, Face Perception.
1 comment so far

I’ve written previously about how algorithms that detect faces in images are appearing everywhere, including Google Images and many recent digital cameras, where they are used to ensure that focus is on the face (presumably, no one who buys a Cybershot is interested in the aesthetic effects of not having the face in focus).

This technology is being expanded into the realm of specific facial expressions by OMRON (among others), a company that just released software that promises to measure the smile factor of faces in a picture. The smile factor as OMRON conceives of it goes from 0 to 100 %, and will presumably serve to shift the blame nicely when you want people to smile more in a picture (“look, I think the picture is fine, but the camera thinks you should be smiling more”). It is only a matter of time before this makes it into digital cameras, soon followed by a spinach-on-the-teeth detector.

Other proposed applications for OMRON’s software include human-computer interactions, and as an objective measure of liking in food tasting studies. I imagine the software would also be useful for more theoretical investigations into emotional expressivity. As it stands, scoring the magnitude or kind of expression manually is quite tricky.

It never ceases to amaze me how object recognition software is steadily advancing along the ventral visual stream.

N170 face controversy continues June 27, 2007

Posted by Johan in Face Perception, Neuroscience, Sensation and Perception, Social Neuroscience.
1 comment so far

A while back I blogged a paper by Thierry et al (2007 – see also the Neurocritic’s post). Some controversy is brewing about the paper now, so I thought I’d offer an update.

To recap: the Thierry et al (2007a) paper is interesting because it challenges the notion that a specific component of the EEG waveform called the N170 (since it is negative, and occurs at 170 ms) is specific to faces. Thierry et al found that the N170 disappeared when they controlled for inter-stimulus perceptual variance (ISPV), that is, the fact that faces tend to be presented in portraits, while other stimuli are often shown at various angles and sizes.

As the comments on the Neurocritic’s post suggest, some investigators were not entirely convinced by Thierry et al’s (2007a) demonstration. Now one of the critical comments has made its way into the latest issue of Nature Neuroscience, along with a reply from Thierry et al (2007b).

Bentin et al (2007) point to previous research that shows how controlling for ISPV does not in fact explain the N170 specificity to faces. They’ve packed a bit too much information into this figure than necessary, but it’s worth a look:

The black line indicates the mean pixel-wise correlations within each stimulus group, that is, which is an indicator of how much ISPV there was. The grey bars show the N170 amplitude. If Thierry et al (2007) is correct in their argument that the N170 reflects ISPV rather than face specificity, you would expect the black line (indicating ISPV) and the grey bars (indicating N170 amplitude) to match up reasonably well. This isn’t really happening. To make this point even clearer, they show in the supplements that Thierry et al’s (2007) own data seems to indicate the same thing. Bentin et al (2007) also take Thierry et al (2007a) to task for failing to note how their conceptualisation of the N170 contradicts a bulk of well-known effects in previous literature.

So how could Thierry et al (2007a) get such contradictory results? Bentin et al (2007) suggest that one reason may be Thierry et al’s (2007) choice of EEG recording sites, which differ from those generally used by other investigators.

In their reply, Thierry et al (2007b) argue that, among other things, the recording site explanation doesn’t hold, since the same pattern of results appeared across all electrodes. They also contend Bentin et al’s (2007) notion that there is a generally agreed standard for electrode selection.

As for the lack of similarity between the pixel-wise correlations and the N170, this is explained (if you can call it that) by arguing that pixel-wise correlations may not be a perfect measure of ISPV – individual pixels may have very different effects depending on their location, which is not taken into account with such an analysis.

Confused yet? I don’t think there are any clear answers at this point. At the most basic level, the findings of Thierry et al (2007a) contradict previous findings, something that they appear to have failed to mention themselves. There is also some disagreement over what type of analysis is appropriate for this kind of research. Personally, I would like to see an independent replication of the results before I make any further attempts to understand what’s going on.

References
Bentin, S., Taylor, M.J., Rousselet, G.A., Itier, R.J., Caldara, R., Schyns, P.G., Jacques, C., and Rossion, B. (2007). Controlling interstimulus perceptual variance does not abolish N170 face sensitivity. Nature Neuroscience, 10, 801-802.

Thierry, G., Martin, C.D., Downing, P., & Pegna, A.J. (2007a). Controlling for Interstimulus Perceptual Variance Abolishes N170 Face Selectivity. Nature Neuroscience, 10, 505-511.

Thierry, G., Martin, C.D., Downing, P., & Pegna, A.J. (2007b). Is the N170 sensitive to the human face or to several intertwined perceptual and conceptual factors? Nature Neuroscience, 10, 802-803.

Google images now detects faces (poorly) June 17, 2007

Posted by Johan in AI, Face Perception, Off Topic, Sensation and Perception.
2 comments

Google has somewhat secretly rolled out a special filter for its image searches, that enables you to restrict your searches to faces. Geeks in the know say that this technology comes from their purchase of Neven Vision, a company I’ve never heard of before.

To use this technology, simply add &imgtype=face to the search URL – adding it in the search box won’t work (although Quicksilver users will find that it does work if you use the Google search plugin – I have no idea why).

Face detection algorithms are all the rage in computer science, and there are lots of implementations out there. I thought I’d compare Google to the competition.

An initial test suggests that Google does rather well. Compare a search for “house” with the image filter and without it. There are only faces (including a certain british-comedian turned american-doctor) in the former, and only houses in the latter.

However, it turns out that it’s quite easy to trip Google up once you throw faces of non-human animals at it. According to Google, this is a face:

But it’s not. So how about the competition? There are many online demos of face detection algorithms, but I quite like PittPatt by the Face Group at Carnegie Mellon. You can just plug in a URL rather than having to upload anything, and it’s quite fast. Pittpatt puts rectangles over each part of the image where it thinks a face is. Nothing lights up for the poor dog above.

Let’s make things more difficult for Pittpatt. Gorillas look more like people, right? Google produces mostly faces, but also this:

Ah, Pittpatt seems confused:

Apparently there are a few facial features in there, but I think the blue colour of the frame indicates uncertainty – from the gallery of uploaded pictures on Pittpatt, it seems the most obvious faces are framed in green or at times yellow.

How about chimps, then? Google fails this one spectacularly – all the faces on the first page are non-human, except for a bearded guy and a few pictures of George W. Bush. Go figure. To be fair, this probably has less to do with Dubya’s chimp-like features and more to do with Google’s page rank system, where a picture that is linked a number of times with the word “chimp” nearby ends up on page one of the results for that term. So Bush doesn’t necessarily look like a chimp, it’s just that the Internet thinks he does.

Google thinks this is a face. And you might be inclined to agree, but for the sake of argument I’m assuming that Google wants their algorithm to pick out human faces, not general primate or mammal faces. Pittpatt produces only another weak blue cube, this time restricted to the mouth of the chimp.

So what can we conclude here? This test is of course terribly unfair – I’ve only picked faces that Google failed so at best, Google could only have done as badly as Pittpatt. But it seems like Google’s face detection algorithm isn’t all that great compared to the alternatives. Another possibility is that they’re just trying not to be specist, but I somewhat doubt that. Also, we have learned that a fake statue of King Kong apparently looks more human than a fat dog or a young chimp, according to Pittpatt. I’m not sure what to do with that information.

In any case, it is pretty fascinating that computers can get this good at face detection. While primate faces can look like a human face to us, there is no way you would confuse the two. I guess you might interpret the ambiguous output of Pittpatt as something similar – there is something face-like there, but it’s not quite it.

Update: Other geeks inform me that the face detection algorithm that Google uses is probably trained by either of their two online games Peekaboom or Image Labeler. I couldn’t figure out how Peekaboom works without registering, but the point of the image labeler game is basically for both players to give the same name to a picture. With a training procedure like that, you suddenly start to see how the chimp above might appear under a “face” search – while “chimp” might be the first word you try there, then possibly “ape,” “face” isn’t going to be far behind. This actually points to a problem of human-trained computers – they acquire all our idiosyncrasies and imperfections, for better and for worse.

Follow

Get every new post delivered to your Inbox.