Archive for the ‘Computer Vision’ Category

The kooaba API is here! Yeeehhhh …

March 3rd, 2010 by Till

Now the moment you’ve all been waiting for: the kooaba API is open to the public. We announced it this Monday at ARDevcamp Zurich and already got loads of positive feedback.

The quick facts

  • You will get access to the recognition of media covers (CDs, DVDs, Books, Movie Posters). That’s close to ten million items (!)
  • We offer a service in the cloud, i.e. you don’t install anything locally
  • You access the API via REST (essentially standard HTTP)
  • FREE with up to 50 requests per day
  • Sample code for Java, Ruby, PHP, Objective C available
  • We have a new developers section on our web-site with more details

We can’t wait to see what people create, here are some things we would like to see ourselves:

  • clients for mobile platforms other than iPhone and Android
  • something with Twitter
  • something with Facebook
  • use the webcam
  • a (mobile) movie guide (snap pictures of movie posters…)
  • recommended reading (for book covers)

But more than these ideas we would love to see stuff that we didn’t even dream of.

Essentially, you can build your own Google Goggles or even cooler apps! And we take care of the geeky image search algorithms for you.

So, check it out!

Come and see kooaba’s Augmented Reality prototype live

February 23rd, 2010 by Till

Next week, on March 1st, we will be co-organizing the AR Devcamp Zurich. AR Devcamp’s mission can be summarized best with a quote from their website:

After nearly 20 years in the research labs, Augmented Reality is taking shape as one of the next major waves of Internet innovation, overlaying and infusing the physical world with digital media, information and experiences. We believe AR must be fundamentally open, interoperable, extensible, and accessible to all, so that it can create the kinds of opportunities for expressiveness, communication, business and social good that we enjoy on the web and Internet today. As one step toward this goal of an Open AR web, we are organizing AR DevCamps technical sessions and hacking opportunities in an open format, BarCamp style.

Together with ETH Zurich, we will lead a session on the use of visual (natural) features for Augmented Reality. We will also have a special announcement for developers.
The registration link for AR Devcamp can be found here.

Furthermore, co-located there will be a special Mobile Monday (“Hands on AR”) with an Augmented Reality tour through Zurich (starting at 4 pm).
For the first time, we will show an augmented reality prototype to the general public. It will be an application running on Android phones, with features similar to the ones we announced in this video a little while ago:

So, come and join us to see a glimpse of the (augmented) future. (Registration link for Mobile Monday is here.)

A glimpse of the future: augmented reality with kooaba

September 7th, 2009 by Till

Here at kooaba we are always keen on testing the boundaries of the current state-of-the art in image recognition. In cooperation with ETH Zurich we thus create prototypes of novel applications of our image recognition services.

One field of such applications is Augmented Reality. In a nutshell, augmented reality overlays digital information on the real world, e.g. as seen “through” the screen of a mobile phone. Recently several applications have appeared on the market, which rely on GPS and compass information to achieve the desired effect. However, these solutions have several shortcomings:

  • it is difficult to select relevant data for display. Often elements that are not actually visible are labeled on the screen
  • only stationary objects can be shown, i.e. objects which are bound to a fixed location, since the system relies on GPS in the first place

The combination with image recognition allows for overcoming these challenges. Thus, we asked Aleksander Slater, a Master Student at ETH, to implement an iPhone Augmented Reality system. It connects to our kooaba recognition system for media covers, as well as our landmark recognition engine developed recently at ETH. The result is shown in the video below:

Note how both (stationary) landmarks and (non stationary) media covers can be recognized and displayed on the screen. A click on the label leads to a page about related information. The recognized objects are stored in a history. (We have currently over 7 million media cover items in the kooaba database, and hundreds of thousands of landmarks in our landmark recognition prototype at ETH).
The next video shows the same concept (objects are replaced by pictures, since we didn’t have the Golden Gate Bridge in front of our house) and even combines it with OCR and translation capabilities.

This is all running on an iPhone already today, pretty much in real time (the video was recorded on the iPhone Emulator, just to get better quality). Let us know, if you would to have it on your device ;)

How we use amazon ec2 at kooaba

August 27th, 2009 by Till

As you are certainly aware of, at the core of all our offerings lies an image recognition system. Such an image recognition system functions very similar to a text search engine – with some added complexity and pitfalls here and there. In fact, image recognition is much less advanced in its capabilities than text search. One of the challenges is posed by the current recognition capabilities. While significantly more advanced than just 5 years ago, many image search tasks still suffer from the so-called “semantic gap”: many high-level queries (e.g. “give me a picture with a car in the rain”) are just impossible to solve with current state-of-the art technology.

At kooaba we offer currently services based on technology that works today: recognition of specific items (Media covers, posters, landmark buildings, magazine pages) works pretty well already nowadays. (Obviously, we will continuously expand our services as technology improves).
However, even or in particular for this type of visual object retrieval one huge challenge remains, which is scalability. Only very recently, vision researchers were able to demonstrate real-time search in collections in the order of millions of items. Obviously, when rolling out such a system as a product, you need significant hardware resources. The database/index is even early on too large-to fit into the RAM on a standard server, and CPU demands are very high also. Problems obviously get worse, when you have to handle many users, since you have to replicated the whole structure to handle the load in parallel. This used to mean a lot of upfront investment in hardware – which is a problem for any startup.
Thus, we at kooaba we know from the moment cloud computing offers started appearing a few years ago, that it would be a crucial success factor for us. In particular amazon’s ec2 caught our eye. We made it a central part of our technology strategy.
Today, we have deployed a system which indexes and recognizes around 7 million items. The architecture is shown below.

ec2_sketch.png

We run our own local servers, which handle incoming traffic from our Mobile applications (iPhone and Android) and also serve dynamic web-pages for search results, the kooaba library, content and data management etc.
However, the complete image recognition is deployed on a set of servers on ec2. Due to the demanding nature of our search engine we rely on large instance types. (Also, we were very glad when reserved instances were announced, to cut our costs further). The instances run our custom images with our proprietary search software written in C++ on Ubuntu Linux.
Requests are distributed to the instances in the amazon cloud, and collected and aggregated at our local servers.

In addition to the image search we also run Apache SOLR for fulltext search functions in the kooaba library. The text index is stored on elastic block storage and backuped to s3 using snapshots triggered by cron jobs.

So far we have never regretted our move to cloud computing. The benefit of avoiding up-front hardware investments is enormous, and the ec2 service has been very reliable so far – in fact, even more reliable than our local server housing partners.

We are currently considering moving also our web applications (written in Ruby on Rails), relational databases (mysql), and potentially even other parts of the infrastructure to ec2. Overall, we got very hesitant in buying our own hardware and always first compare offers to cloud computing – not only for computation, but also for storage.

kooaba Library released

May 25th, 2009 by Marc

We are proud to present our new product, the kooaba Library. It allows you to catalog and share your media collection such as CDs and DVDs on the web. You can scan your Items using our mobile image recognition application for iPhone and Android. Sign up now.

If you are already a user of kooaba for iPhone, you probably noticed that we added a new button called Settings to the application.

On the Settings panel you can enter your account information, which is needed if you want to use the kooaba Library. Now, all the items you snap using the iPhone app will be added to your personal library. Check out the following video to find out all about it:

Sign up now and start exploring!

The Augmented Reality Hype Cycle

April 28th, 2009 by Marc

Mobile service architects from SPRXMobile recently posted a great overview of the levels of Augmented Reality. Definitely worth a read!

The Augmented Reality Hype Cycle

Pixels that hyperlink the world.

October 17th, 2008 by Till

kooaba was featured very prominently on German TV channel 3sat. The title of the report: “pixels that hyperlink the world”, featuring kooaba, Microsoft, and Google. Nice!

Here is a handy video clip of the show (in German):

News from the kooaba labs: real-time server-side object recognition

May 5th, 2008 by Till

People ask us often what mobile visual search will look like in a few months or years. The kooaba labs intend to show such visions, prototypes etc.

Today, we open the kooaba labs part of our web-site with a demonstrator on real-time object recognition. The video below shows how objects are recognized and labeled directly in a video stream from a mobile device.

(The video shows the screen of the mobile device)

For the geeks among our readers: the video is streamed to kooaba’s recognition service (via WiFi or Bluetooth), where it is matched to a database of object. The information about the recognized object is sent back to the client and displayed on the screen.

The demonstrator shown in this video was created with the help of Raphael and Benjamin, both undergrads at ETH Zurich. Thanks guys.

For further details please refer to the corresponding kooaba labs page.

Our Technology in Smart Meeting Rooms

February 13th, 2007 by Till

At the winter school of the research project IM2 we demonstrated how our technology can be used in smart meeting rooms. Details to follow soon.

Museum Guide Demo

May 1st, 2005 by Herbert

Demo of the mobile image-recognition framework as museum guide to the public in the context of the 150th anniversary
of the ETH Zurich
, Switzerland

Here is the video we made during one of the numerous visits. The searching time is now multiple orders of magnitude faster