Combining image descriptors to effectively retrieve events from visual lifelogs

TitleCombining image descriptors to effectively retrieve events from visual lifelogs
Publication TypeConference Paper
Year of Publication2008
AuthorsDoherty, Aiden R., O'Conaire CiarĂ¡n, Blighe Michael, Smeaton Alan F., and O'Connor Noel E.
Conference NameMIR 2008 - ACM International Conference on Multimedia Information Retrieval
Conference Date30-31 Oct 2008
PublisherAssociation for Computing Machinery
Conference LocationVancouver, Canada
KeywordsRP4; RP5
Abstract

The SenseCam is a wearable camera that passively captures approximately 3,000 images per day, which equates to almost one million images per year. It is used to create a personal visual recording of the wearer's life and generates information which can be helpful as a human memory aid. For such a large amount of visual information to be of any use, it is accepted that it should be structured into "events", of which there are about 8,000 in a wearer's average year. In automatically segmenting SenseCam images into events, it will then be useful for users to locate other events similar to a given event e.g. "what other times was I walking in the park?", "show me other events when I was in a restaurant". On two datasets of 240k and 1.8M images containing topics with a variety of information needs, we evaluate the fusion of MPEG-7, SIFT, and SURF content-based retrieval techniques to address the event search issue. We have found that our proposed fusion approach of MPEG-7 and SURF offers an improvement on using either of those sources or SIFT individually, and we have also shown how a lifelog event is modeled has a large effect on the retrieval performance.

URLhttp://doras.dcu.ie/2081/