G2 Crowd Acquires Siftery to Create a New Way to Buy and Manage Software Spend 🚀

Best Image Recognition Software

Image recognition, also known as computer vision, allows applications using specific deep learning algorithms to understand images or videos. In these scenarios, images are data in the sense that they are inputted into an algorithm, the algorithm performs a requested task, and the algorithm outputs a solution provided by the image. One common execution for computer vision applications includes facial recognition—whether for tagging friends on Facebook or a police department identifying a potential suspect—solely based on an image. Another use for image recognition is in the medical field, where artificial intelligence, using image recognition, can observe an x-ray and decipher the diagnosis solely based on the image. Some other aspects of image recognition include image restoration, object recognition, and scene reconstruction. These capabilities may be embedded inside intelligent applications or offered as deep learning algorithms inAI platforms.

To qualify for inclusion in the Image Recognition category, a product must:

  • Provide a deep learning algorithm specifically for image recognition
  • Connect with image data pools to learn a specific solution or function
  • Consume the image data as an input and provide an outputted solution
G2 Crowd Grid® for Image Recognition
High Performers
Momentum Leaders
Momentum Score
Market Presence
Star Rating

Image Recognition reviews by real, verified users. Find unbiased ratings on user satisfaction, features, and price based on the most reviews available anywhere.

Compare Image Recognition Software
Results: 63
    G2 Crowd takes pride in showing unbiased ratings on user satisfaction. G2 Crowd does not allow for paid placement in any of our ratings.
    Sort By:

    Microsoft Computer Vision API is a cloud-based API tool that provides developers with access to advanced algorithms for processing images and returning informatio, by uploading an image or specifying an image URL, it analyze visual content in different ways based on inputs and user choices.

    Amazon Rekognition makes it easy to add image and video analysis to your applications. It can identify the objects, people, text, scenes, and activities, or any inappropriate content from an image or video.

    OpenCV is a tool that has has C++, C, Python and Java interfaces and supports Windows, Linux, Mac OS, iOS and Android for computational efficiency and with a strong focus on real-time applications, written in optimized C/C++, the library can take advantage of multi-core processing and enabled to take advantage of the hardware acceleration of the underlying heterogeneous compute platform

    Derive insight from images with our powerful Cloud Vision API

    Azure Face API uses state-of-the-art cloud-based face algorithms to detect and recognize human faces in images.Its capabilities include features like face detection, face verification, and face grouping to organize faces into groups based on their visual similarity.

    SimpleCV is an open source framework for building computer vision applications, user can get access to several high-powered computer vision libraries such as OpenCV without having to learn about bit depths, file formats, color spaces, buffer management, eigenvalues, or matrix versus bitmap storage.

    Microsoft Emotion API is a tool that analyze faces to detect a range of feelings and personalize your app's responses.

    scikit-image is a collection of algorithms for image processing.

    And this is where Google's deep dream ideas originate. With simple words you give to an AI program a couple of images and let it know what those images contain ( what objects - dogs, cats, mountains, bicycles, ... ) and give it a random image and ask it what objects it can find in this image.

    Clarifai offers a suite of tools that make it easy for anyone to quickly and accurately train, customize, and use machine learning-powered image and video recognition in their products.

    DeepPy is a MIT licensed deep learning framework that tries to add a touch of zen to deep learning as it allows for Pythonic programming based on NumPy's ndarray,has a small and easily extensible codebase, runs on CPU or Nvidia GPUs and implements the following network architectures feedforward networks, convnets, siamese networks and autoencoders.

    Azure Custom Vision Service is a tool for building custom image classifiers, and for making them better over time. This service enables you to identify your own objects and things in images.

    Vize.ai AI is custom image recognition and classification API, designed to allow developers and businesses to analyze image data.

    Caffe is a deep learning framework made with expression, speed, and modularity in mind.

    Hive is a complete deeplearning toolkit that applies visual intelligence to difficult problems, changing the way businesses analyze unstructured visual data.

    IBM Intelligent Video Analytics helps security and public safety organizations develop comprehensive security, intelligence and investigative capabilities using video.

    IBM Watson Visual Recognition is a tool that allow users to automatically identify subjects and objects contained within the image and organize and classify these images into logical categories.

    Microsoft Video API is a cloud-based API that provides advanced algorithms for tracking faces, detecting motion, stabilizing and creating thumbnails from video, it allows user to build more personalized and intelligent apps by understanding and automatically transforming video content.

    PCV is an open source Python module for computer vision

    Alibaba Cloud Image Search is an intelligent image search service that helps users find similar or identical images. Based on machine learning and deep learning, the product enables end-users to take a screenshot or upload an image to search and find desired products and fulfill other search requests

    Azure Video Indexer enables customers with digital video and audio content to automatically extract metadata and use it to build intelligent innovative applications.

    Gesture Recognition Toolkit (GRT) is a cross-platform, open-source, C++ machine learning library designed for real-time gesture recognition.

    MobileEngine makes it easy for you to add image recognition to your app. You provide a reference database of images (e.g. artwork, consumer packaged goods, book covers, catalog pages, etc.) and when your users photograph that object, MobileEngine finds your matching reference image.

    Allows you to implement image recognition technology within your web or mobile applications.

    VIGRA is a computer vision library that puts its main emphasis on flexible algorithms.

    VLFeat is an open source library that implements popular computer vision algorithms specializing in image understanding and local features extraction and matching, it include Fisher Vector, VLAD, SIFT, MSER, k-means, hierarchical k-means, agglomerative information bottleneck, SLIC superpixels, quick shift superpixels, large scale SVM training, and many others. It is written in C for efficiency and compatibility, with interfaces in MATLAB for ease of use, and detailed documentation throughout. It supports Windows, Mac OS X, and Linux.

    WineEngine is powered by TinEye's unparalleled image recognition technology and has been engineered and optimized to work with photographs captured by users' smart devices.

    3VR is a video technology and data company that solves the challenges associated with video searchability, allowing customers to rapidly gather real-time intelligence from the unstructured video data that is produced by a single camera or a global network of cameras.

    ACTi video analytics are designed to help you transform your video surveillance network into a smart detection system and a valuable resource for business management.

    AForge.Vision is a vision library that contains some vision classes - set of motion detection algorithms.

    Anyline is a SDK that allows to build real-time mobile OCR apps with highest recognition rates, not requiring any server infrastructure.

    Aurora Image Item Processing is a suite of check processing designed to provide superior performance and accuracy for your check processing operations.

    Blitline provides a platform for processing (cropping, rotating, compositing, filtering) images in a massively parallel environment.

    CraftAR by Catchoom is an image recognition and augmented reality platform for mobile and web applications.

    CCV is a open source/cross-platform solution for blob tracking with computer vision. that can interface with various web cameras and video devices as well as connect to various TUIO/OSC/XML enabled applications and supports many multi-touch lighting techniques including: FTIR, DI, DSI, and LLP with expansion planned for the future vision applications (custom modules/filters).

    Cloudsight is an image recognition API providing true understanding for your digital media.

    DeepAI allows users to learn about data science and play with the latest research.

    DeepLearningKit is an open source Deep Learning Framework for Apple's iOS, OS X & tvOS. Developed in Swift & Metal (GPU acceleration).

    Density offers a people counter & API.

    Image Processing is a tool that deals with array2d objects that contain various kinds of pixels or user defined generic image objects.

    Eblearn is an object-oriented C++ library that implements various machine learning models, including energy-based learning, gradient-based learning for machine composed of multiple heterogeneous modules.

    ENVI SARscape allows you to easily process and analyze SAR data acquired from all existing spaceborne and selected airborne platforms. Generate products, and integrate information with other geospatial products.

    Face++ provides APIs and SDKs for facial recognition.

    GraphicsMagick is a collection of tools and libraries to read, write, and manipulate an image in any of the more popular image formats.

    Imagga is an image recognition platform-as-a-service that provides image tagging APIs for developers and businesses to build scalable, image and video intensive apps.

    ImageVision provides image and video recognition solutions.

    Lambda is a free, open source face API which offers both face detection and face recognition.

    LTU Engine offers cutting-edge pixel based image functionalities to search and make sense of visuals.

    Harris Geospatial has developed a suite of deep learning-based tools called MEGA™ that are designed specifically to work with imagery to solve geospatial problems. This technology is currently being used to solve real-world problems in industries that include agriculture, utilities, transportation, and defense

    MetaEyes is a reporting service that with the help of image recognition technology analyzes Instagram (plus other services) photos, revealing a wealth of actionable information. MetaEyes can detect: • Faces & Demographics • Explicit/Racy Content • Sentiment • Scenes & Objects • Logos • Locations • Celebrities • Other info Powerful reporting tools to analyze and filter data in myriads of ways. • Summary and granular reports • Filter detailed reports by all MetaEyes attributes • View by date range and different sorting methods • Export as PDF or CSV files for further analysis Photos provide a far more refined insight allowing for novel ways of engagement. •Discover potential fans by attributes previously undetectable • Surface and engage with previously ghost influencers • Find user-generated brand photos and directly contact to use in marketing campaigns • Monitor for brand related crisis situations and engage directly to avoid negative virality