By Aaron Raizen | 11/12/2017 | General |Beginners

Image Recognition and Tagging Technology in 2018

Image Recognition and Tagging Technology in 2018

In this article we’ll be checking out some of the top image recognition software currently available—specifically concerning image tagging APIs. Most of these companies have demos on their sites, so I’ll be putting them to test with two pictures of my own—one easy, one a bit tougher. Here they are.

 

First up will be this guy, because the internet loves cats wearing hats .

image tagging cat

Then we’ll give this mosaic a go.

image tagging mosaic

Clarifai

clarifai

First on the list is Clarifai, one of the top image recognition APIs available today. It gives companies and developers an easy yet powerful way to integrate visual search into their apps. This includes image tagging and similar image search. They also offer “custom training so that you can train your own recognition models (on anything) and also search through them.” Oh, and this also goes for videos too.

 

So let’s try it out—first with our cat, then the mosaic.

 

Results:

image tagging clarifai 1

There were actually more but you get the gist of it.

 

You can also check for faces, colors, and NSFW content. This pic came in SWF, no face, and gave five pretty accurate colors. Not too bad, but you’ll have to decide for yourself if these results are accurate enough for your needs.

 

When I tried the next pic I got the following: Oops, something went wrong. Try reloading the model.

 

wah wah wah :(

 

Turns out the file was too big, though I had to figure that out on my own. I uploaded a smaller version and presto:

image tagging clarifai 2

Not really the best results, but depending on your needs Clarifai could be a good fit. They offer free, pay as you go, and enterprise plans, so pick which is right for you.

 

CloudSight

cloudsight

Let’s try another. With CloudSight you can keep it simple—send an HTTP request with your image and you'll receive a description of its contents. Sounds good to me. So let’s dive right into the results.

 

I uploaded the image and got back some JSON:

{
   "token": "G_h3bhJITmdtBMOrsHcCrQ",
   "url": "http://assets.cloudsight.ai/uploads/image_request/image/353/353008/353008767/IMG_20140726_001419.jpg",
   "ttl": 54,
   "status": "completed",
   "name": "white cat"
}

White cat—yeah that pretty much sums it up. Now the second pic:

{
   "token": "TblM2W9AaORdhmW0O3kA-A",
   "url": "http://assets.cloudsight.ai/uploads/image_request/image/353/353009/353009932/IMG_0345.jpg",
   "ttl": 54,
   "status": "completed",
   "name": "brown and yellow mosaic bird decor"
}

Brown and yellow mosaic bird decor. Impressive! But these results do come at a cost, namely latency. While other APIs return results in measurements of milliseconds, CloudSight is more in the seconds range—up to 15 actually.

 

Plans range from free for up to 500 requests and then paid plans from $99 per month.

 

If you need super high accuracy and are willing to wait a bit longer, CloudSight is the way to go.

 

Cloud Vision API

 

google image api

Google’s Cloud Vision API gives you powerful machine learning in an easy to use REST API. You’ll get image labeling, face and landmark detection, optical character recognition (OCR), and tagging of explicit content. No surprise, being Google it’s pretty slick.

 

It returns JSON if you like, or just get the labels, NSFW, and color results. For our cat pic we got:

image tagging google

The JSON returns all the info.

 

On the tougher mosaic?

image tagging

Not bad I guess, but not as good as CloudSight. It also has logo detection to “detect popular product logos within an image.”

 

Pricing ranges from free for the first thousand units and then $1.50 per 1000 units for most features. A unit is one application of a feature to an image. So if one image gets face detection and let’s say logo detection, that’s two units.



Microsoft’s Computer Vision API

image tagging MS

Not to be outdone by Google, Microsoft has their own Computer Vision API. By now I think we all get how these things work so let’s just dive in.

 

We get back a description:

{ "tags": [ "cat", "indoor", "sitting", "white", "table", "top", "computer", "black", "bed", "keyboard", "desk", "laptop", "standing", "room", "man", "umbrella", "laying" ], "captions": [ { "text": "a cat sitting on a desk", "confidence": 0.8545866 } ] }

and tags:

 

[ { "name": "cat", "confidence": 0.9951088 }, { "name": "indoor", "confidence": 0.9361982 } ]

We also get an adult/NSFW result of false, a racy score of false, and foreground, background, and accent colors. If you’re wondering, turns out a girl in a bikini will score false for adult but true for racy, so there’s that.

 

Pretty good results for our furry friend, but let’s see how it handles the mosaic.

 

Description:

{ "tags": [ "building", "standing", "cat", "group" ], "captions": [ { "text": "a close up of a cat", "confidence": 0.0968242 } ] }

Tags:

[]

Second wah wah wah :( of the article. Close up of a cat? Yeah, not so good Microsoft.

 

This API also has OCR features and can even read handwriting in an image. It also has celebrity and landmark recognition, and real-time video analysis. If you think it’ll be accurate enough for you and you like Microsoft stuff, give it a try.

 

It’s free for the first 5000 transactions per month and then, well it gets complicated but it’s on par with Google’s pricing.

Imagga

imagga

The last of our image tagging APIs is Imagga, another easy to implement, easy to use API. Their angle is their auto-tagging technology which can be trained. You can define tags for your specific use case and Imagga will learn to recognize and suggest them.

 

Alright, last one. Here we go:

image tagging imagga

No surprises here. As expected you get some JSON back with all the details. But did you notice that “show me more tags?” Well they mean it; the full list was about 85 tags!

 

Now the mosaic:

image tags imagga 2

Some of the best results we’ve seen for my ancient mosaic. Way to go Imagga! Notice how the confidence percentages are lower.

 

Pretty good stuff from Imagga. They don’t have a free plan but the cheapest starts at $14 per month, and then they go up from there.

 

Try it yourself

Here are links to all the different demos so you can play around with them yourself. Just try and keep it SFW. Or don’t, either way. Have fun!

Looking for more? Check out our Image Recognition Category page for lots more cool stuff. 

By Aaron Raizen | 11/12/2017 | General

{{CommentsModel.TotalCount}} Comments

Your Comment

{{CommentsModel.Message}}

Recent Stories

Top DiscoverSDK Experts

User photo
3355
Ashton Torrence
Web and Windows developer
GUI | Web and 11 more
View Profile
User photo
3220
Mendy Bennett
Experienced with Ad network & Ad servers.
Mobile | Ad Networks and 1 more
View Profile
User photo
3060
Karen Fitzgerald
7 years in Cross-Platform development.
Mobile | Cross Platform Frameworks
View Profile
Show All
X

Compare Products

Select up to three two products to compare by clicking on the compare icon () of each product.

{{compareToolModel.Error}}

Now comparing:

{{product.ProductName | createSubstring:25}} X
Compare Now