Anil K. Jain
University Distinguished Professor
Department of Computer Science
Michigan State University
Over the past 40 years, Anil Jain has been working on design and applications of pattern recognition systems. Currently, Jain and his students are devoting their efforts towards three challenging problems: Automatic fingerprint recognition, automatic face recognition and large scale data clustering. Specific projects are summarized below:
Latent Fingerprint Identification
Latent fingerprints are partial impressions of fingers found at crime scenes which serve as crucial evidence to apprehend and convict a suspect. Hence, automatic and accurate comparison of latent prints to rolled fingerprints (exemplars) in the law enforcement databases is critical in forensics. In this research , they incorporate feedback from the exemplar to refine the features extracted in a latent fingerprint to improve the identification accuracy. Experiments for this research involved comparing 700 latent prints to 100,000 rolled prints. The HPCC resources allowed them to run their matcher  in parallel on 144 single core machines, thereby reducing the comparison time from about 250 days to just about 20 days; a speedup time of ~12x.
Longitudinal study of Face Recognition
Determining the persistence of face recognition over time is an important yet challenging problem. Shape and texture of a human face naturally changes due to aging, leading to some recognition errors in face recognition. In Jain’s research , they are conducting a large-scale longitudinal study on how facial aging affects the performance of stat-of-the-art recognition systems. Their study utilizes statistical models to analyze the variation in face comparison scores with respect to different covariates such as elapsed time, age, gender, and race. The goal is to determine the trend in face recognition accuracy over time. To obtain reliable parameter estimates for the models, they rely on bootstrapping. Because of the large size of the study (~148K face images of 18K subjects), bootstrapping involves fitting a statistical model to 1,000 random samples (with replacement) of 18K subjects. Fitting each model can take more than 1 hour, so being able to run the 1,000 bootstraps in parallel on HPCC is extremely helpful.
Face Image Clustering
Investigations that require the exploitation of large volumes of face imagery are increasingly common in current forensic scenarios due to the prevalence of surveillance video, as well as the video/image recording capabilities of cell-phones. Effective solutions for triaging such imagery (i.e., low importance, moderate importance, and of critical interest) are not available in the literature. General issues for investigators in these scenarios are a lack of systems that can scale to large volumes of images, say 100M, and a lack of established methods for clustering the face images into the unknown number of identities. As such, they investigate the problem of clustering large database of face images, attempting to group individuals together by identity. The computational requirements for handling large database are quite large; simply extracting descriptive features from 1 million face images could take ~20 hours on a single machine . Aside from feature extraction, computing lists of the most similar individuals for every image in a large database (a necessary condition for some clustering methods) is costly. Typically, a single machine may take on the order of a week to process a single database but, leveraging HPCC resources, this task can be accomplished in less than a day……
(Read the full article, please click: