Best Image Recognition Software

Image recognition, also known as computer vision, allows applications using specific deep learning algorithms to understand images or videos. In these scenarios, images are data in the sense that they are inputted into an algorithm, the algorithm performs a requested task, and the algorithm outputs a solution provided by the image. One common execution for computer vision applications includes facial recognition—whether for tagging friends on Facebook or a police department identifying a potential suspect—solely based on an image. Another use for image recognition is in the medical field, where artificial intelligence, using image recognition, can observe an x-ray and decipher the diagnosis solely based on the image. Some other aspects of image recognition include image restoration, object recognition, and scene reconstruction. These capabilities may be embedded inside intelligent applications or offered as deep learning algorithms inAI platforms.

To qualify for inclusion in the Image Recognition category, a product must:

  • Provide a deep learning algorithm specifically for image recognition
  • Connect with image data pools to learn a specific solution or function
  • Consume the image data as an input and provide an outputted solution
G2 Crowd Grid® for Image Recognition
Leaders
High Performers
Contenders
Niche
Momentum Leaders
Momentum Score
Market Presence
Satisfaction
Filters
Star Rating

Image Recognition reviews by real, verified users. Find unbiased ratings on user satisfaction, features, and price based on the most reviews available anywhere.

Compare Image Recognition Software
Results: 68
    G2 Crowd takes pride in showing unbiased ratings on user satisfaction. G2 Crowd does not allow for paid placement in any of our ratings.
    Sort By:

    Microsoft Computer Vision API is a cloud-based API tool that provides developers with access to advanced algorithms for processing images and returning informatio, by uploading an image or specifying an image URL, it analyze visual content in different ways based on inputs and user choices.

    Amazon Rekognition makes it easy to add image and video analysis to your applications. It can identify the objects, people, text, scenes, and activities, or any inappropriate content from an image or video.

    OpenCV is a tool that has has C++, C, Python and Java interfaces and supports Windows, Linux, Mac OS, iOS and Android for computational efficiency and with a strong focus on real-time applications, written in optimized C/C++, the library can take advantage of multi-core processing and enabled to take advantage of the hardware acceleration of the underlying heterogeneous compute platform

    Derive insight from images with our powerful Cloud Vision API

    Microsoft Video API is a cloud-based API that provides advanced algorithms for tracking faces, detecting motion, stabilizing and creating thumbnails from video, it allows user to build more personalized and intelligent apps by understanding and automatically transforming video content.

    SimpleCV is an open source framework for building computer vision applications, user can get access to several high-powered computer vision libraries such as OpenCV without having to learn about bit depths, file formats, color spaces, buffer management, eigenvalues, or matrix versus bitmap storage.

    Azure Face API uses state-of-the-art cloud-based face algorithms to detect and recognize human faces in images.Its capabilities include features like face detection, face verification, and face grouping to organize faces into groups based on their visual similarity.

    Microsoft Emotion API is a tool that analyze faces to detect a range of feelings and personalize your app's responses.

    scikit-image is a collection of algorithms for image processing.

    And this is where Google's deep dream ideas originate. With simple words you give to an AI program a couple of images and let it know what those images contain ( what objects - dogs, cats, mountains, bicycles, ... ) and give it a random image and ask it what objects it can find in this image.

    Clarifai offers a suite of tools that make it easy for anyone to quickly and accurately train, customize, and use machine learning-powered image and video recognition in their products.

    Clarifai Reviews
    Optimized for quick response
    Get a Quote

    DeepPy is a MIT licensed deep learning framework that tries to add a touch of zen to deep learning as it allows for Pythonic programming based on NumPy's ndarray,has a small and easily extensible codebase, runs on CPU or Nvidia GPUs and implements the following network architectures feedforward networks, convnets, siamese networks and autoencoders.

    Azure Custom Vision Service is a tool for building custom image classifiers, and for making them better over time. This service enables you to identify your own objects and things in images.

    Founded in 2013, Hive is a full-stack deep learning company focused on solving visual intelligence challenges for enterprises and governments through three main pillars of the business: Hive Data, Hive Predict, and Hive Media. Hive Data is the world's largest two-sided marketplace for machine learning data labeling. We have over 800,000 distributed workers helping research labs and enterprise clients generate high quality truth data sets in fields like autonomous driving and robotics. Hive Predict is our set of proprietary visual intelligence models that solve problems like the identification of celebrities and logos. We sell these APIs to companies looking to incorporate our models into their own workflows. Our flagship product is Hive Media, through which we are selling the world's largest television analytics dataset.

    Vize.ai AI is custom image recognition and classification API, designed to allow developers and businesses to analyze image data.

    Alibaba Cloud Image Search is an intelligent image search service that helps users find similar or identical images. Based on machine learning and deep learning, the product enables end-users to take a screenshot or upload an image to search and find desired products and fulfill other search requests

    Caffe is a deep learning framework made with expression, speed, and modularity in mind.

    IBM Intelligent Video Analytics helps security and public safety organizations develop comprehensive security, intelligence and investigative capabilities using video.

    IBM Watson Visual Recognition is a tool that allow users to automatically identify subjects and objects contained within the image and organize and classify these images into logical categories.

    PCV is an open source Python module for computer vision

    VIGRA is a computer vision library that puts its main emphasis on flexible algorithms.

    Azure Video Indexer enables customers with digital video and audio content to automatically extract metadata and use it to build intelligent innovative applications.

    Gesture Recognition Toolkit (GRT) is a cross-platform, open-source, C++ machine learning library designed for real-time gesture recognition.

    MobileEngine makes it easy for you to add image recognition to your app. You provide a reference database of images (e.g. artwork, consumer packaged goods, book covers, catalog pages, etc.) and when your users photograph that object, MobileEngine finds your matching reference image.

    Allows you to implement image recognition technology within your web or mobile applications.

    VLFeat is an open source library that implements popular computer vision algorithms specializing in image understanding and local features extraction and matching, it include Fisher Vector, VLAD, SIFT, MSER, k-means, hierarchical k-means, agglomerative information bottleneck, SLIC superpixels, quick shift superpixels, large scale SVM training, and many others. It is written in C for efficiency and compatibility, with interfaces in MATLAB for ease of use, and detailed documentation throughout. It supports Windows, Mac OS X, and Linux.

    WineEngine is powered by TinEye's unparalleled image recognition technology and has been engineered and optimized to work with photographs captured by users' smart devices.

    3VR is a video technology and data company that solves the challenges associated with video searchability, allowing customers to rapidly gather real-time intelligence from the unstructured video data that is produced by a single camera or a global network of cameras.

    ACTi video analytics are designed to help you transform your video surveillance network into a smart detection system and a valuable resource for business management.

    AForge.Vision is a vision library that contains some vision classes - set of motion detection algorithms.

    Anyline is a SDK that allows to build real-time mobile OCR apps with highest recognition rates, not requiring any server infrastructure.

    Aurora Image Item Processing is a suite of check processing designed to provide superior performance and accuracy for your check processing operations.

    Blitline provides a platform for processing (cropping, rotating, compositing, filtering) images in a massively parallel environment.

    CraftAR by Catchoom is an image recognition and augmented reality platform for mobile and web applications.

    CCV is a open source/cross-platform solution for blob tracking with computer vision. that can interface with various web cameras and video devices as well as connect to various TUIO/OSC/XML enabled applications and supports many multi-touch lighting techniques including: FTIR, DI, DSI, and LLP with expansion planned for the future vision applications (custom modules/filters).

    CEREBRO applies cutting-edge technology, to gather real-time data from various data sources, including high-end sensors with demographic recognition capabilities to make sure you will reach your right audience and spend your budgets wisely.

    Cloudsight is an image recognition API providing true understanding for your digital media.

    DeepAI allows users to learn about data science and play with the latest research.

    DeepLearningKit is an open source Deep Learning Framework for Apple's iOS, OS X & tvOS. Developed in Swift & Metal (GPU acceleration).

    Density offers a people counter & API.

    Image Processing is a tool that deals with array2d objects that contain various kinds of pixels or user defined generic image objects.

    Eblearn is an object-oriented C++ library that implements various machine learning models, including energy-based learning, gradient-based learning for machine composed of multiple heterogeneous modules.

    ENVI SARscape allows you to easily process and analyze SAR data acquired from all existing spaceborne and selected airborne platforms. Generate products, and integrate information with other geospatial products.

    Face++ provides APIs and SDKs for facial recognition.

    FaceX provides a face detection and face recognition web service that can be integrated to your apps with just a few lines of code.

    GraphicsMagick is a collection of tools and libraries to read, write, and manipulate an image in any of the more popular image formats.

    Imagga is an image recognition platform-as-a-service that provides image tagging APIs for developers and businesses to build scalable, image and video intensive apps.

    ImageVision provides image and video recognition solutions.

    Intellisplit is a software he software is designed to identify and crop the required features like photos, signatures or biometric impressions and re-arrange them in the database or the desired destination folder.

    Lambda is a free, open source face API which offers both face detection and face recognition.