Computer vision
Definition
Computer vision is the field of AI that enables machines to interpret and understand visual information from images and video. It encompasses tasks such as image classification (identifying what is in an image), object detection (locating specific objects within a scene), segmentation (delineating object boundaries), facial recognition, optical character recognition (OCR), and pose estimation. Modern computer vision is predominantly powered by convolutional neural networks and, increasingly, transformer-based vision models trained on large labeled datasets.
In commerce, computer vision has diverse and high-value applications. Visual search allows customers to find products by uploading a photo rather than typing a description. In logistics and fulfillment, vision systems automate package inspection, barcode reading, and damage detection on conveyor lines. Planogram compliance tools use store cameras to verify that shelves match intended product layouts. In fashion and beauty retail, computer vision underpins virtual try-on and style recommendation engines. As camera hardware becomes ubiquitous in stores, warehouses, and devices, computer vision is becoming a foundational capability for operational efficiency and customer experience alike.
Related Terms
Source
Last updated: May 12, 2026