Using Flickr images for 3D reconstruction

At the Electronic Imaging symposium of 2013, Steve Seitz from Washington University and Google presented a very interesting keynote entitled “a trillion photos”.

The principle is to exploit the millions of images found in databases such as Flickr. The aim of the project Building Rome in a Day is to harvest images from Flickr by simply typing the keywords “Rome” or “Venice”. Many images are unusable because they can not be matched with other images  – such as pictures of a restaurant, of a family, etc. On the other hand, the most touristic places such as “San Marco” are taken in picture from many different angles. By using a standard processing stream such as SIFT + RANSAC + FLANN it is possible to match the images and then to do the 3D reconstruction.

In this video , the pyramids represent the estimated shooting positions. The reconstruction was made by ​​using 14,079 pictures. The reconstruction of Venice is made by ​​using 250,000 images, 496 computing cores. 27h are necessary for matching and 38h for reconstruction.

CIFED 2012: Best Paper Award

From 21 to 23 March 2012,  the Symposium on Writing and Document (CIFED) held in Bordeaux. Since 1992, every 2 years, CIFED is gathering the French scientific community to expose and exchange ideas around the themes of the written document.

International researchers presented their works about document image analysis and processing. I had the opportunity to participate to this conference, but also to present a part of my thesis work.

At the end of this conference, I had the honor of receiving the Best Paper Award for my work about the recognition of semi-structured documents.

My oral presentation (in French) is available here.

CORIA and CIFED are the meeting points of Francophone communities in information retrieval and analysis of written and scanned documents. While preserving the specificities of each conference, this edition will be an opportunity for both communities to gather around issues such as research of multimedia documents, interaction models with the user, search system information, performance evaluation tools for information retrieval, etc. More than 120 participants will exchange knowledge during this workshop.

Date: March 21-22-23 2012

Location: LABRI, University of Bordeaux 1

For more information:

ICDAR 2011

The 11th international conference on analysis and document recognition took place from the 19th to the 21th September 2011 in Beijing. The program can be found here.

I presented a poster on my work on the document image classification in an industrial context where thousands of documents are scanned each day. My paper presents a new method for fast indexing of document images. One of the difficulties is that the number of classes and the nature of the documents is completely unknown. Many descriptors are extracted such as the number of words, the number of images, the number of tables, statistics on the height and width of connected components and their bounding boxes, the values ​​of local densities components, etc.. Then the number of classes is estimated and a clustering is created, based on the number of class. We provide an “assisted” classification tool based on the CBIR technique and relevance feedback.