Computer Vision | Topics

Articles tagged with Computer Vision

192-dimensional photonic chip unlocks ultra-parallel optical computing with reconfigurable large kernels

The new photonic architecture harnesses three fundamental degrees of freedom: wavelength, mode, and polarization, achieving 192 parallel computing channels. The chip supports large, reconfigurable convolution kernels up to 13x13, capturing global structural contours while preserving fine details.

UJI students Pau Montagut and Mario García win first place in the international competition at the ICRA 2026 robotics conference

University of Jaume I students Pau Montagut and Mario García won first place at the ICRA 2026 robotics conference with an AI model that taught a Toyota HSR robot to perform household tasks. The team's achievement is notable given their undergraduate status, competing against teams of research personnel.

Garmin GPSMAP 67i with inReach

Garmin GPSMAP 67i with inReach provides rugged GNSS navigation, satellite messaging, and SOS for backcountry geology and climate field teams.

Artificial eyes could bring human-like sight to self-driving cars, robots

Researchers at Penn State developed photomemristors that adjust sensitivity based on light levels, like the human eye. These devices can process light data faster and more accurately than traditional systems in mixed lighting environments.

Multinex: An ultra lightweight AI model advancing low light image enhancement

A new ultra-lightweight AI model, Multinex, advances low-light image enhancement by leveraging classical colour vision theory and Retinex principles. The model outperforms comparable compact systems, recovering detail and clarity from previously unusable images.

Jumping spiders inspire ultra-efficient 3D camera

Researchers developed SpiderCam, a highly energy-efficient 3D camera inspired by jumping spiders. It captures two images with different focus settings and analyzes the differences in sharpness to produce real-time 3D maps while consuming less than a watt of power.

MIT researchers teach AI models to interpret charts

Researchers from MIT and IBM create a state-of-the-art dataset called ChartNet, which includes over a million varied charts. The dataset is designed to teach vision-language models how to effectively interpret charts, enabling them to outperform commercial models on tasks like data extraction and chart summarization.

Apple iPhone 17 Pro

Apple iPhone 17 Pro delivers top performance and advanced cameras for field documentation, data collection, and secure research communications.

Getting an exercise form coaching assist from AI

Researchers from Drexel University developed BioCoach, a program using AI and computer vision to analyze video and provide form coaching in real time. The system analyzes visual appearance and motion patterns, as well as 3D skeletal movements and body shape, to deliver detailed biomechanics-based feedback.

Brown University computer scientists clear a path to stream 3D ‘volumetric’ video

Brown University computer scientists introduce PackUV, a compression method that enables everyday video formats to stream volumetric video. The technique improves capture of 3D action and makes final products compatible with existing video codecs.

Disco lasers improve the safety of snow groomers

Researchers developed a disco laser system to enhance data visualization for snow groomers, improving operator comfort and reducing nausea caused by VR headsets. The system also enables better tracking and orientation aids, leading to more efficient and safe operation in challenging conditions.

SAMSUNG T9 Portable SSD 2TB

SAMSUNG T9 Portable SSD 2TB transfers large imagery and model outputs quickly between field laptops, lab workstations, and secure archives.

Turning surroundings into a 'virtual screen' could help machines to better see in 3D

A new approach enables computers and machines to capture images at higher resolution and faster speed, making it impervious to reflective surfaces. The technology uses a virtual screen created by repurposing the surroundings of specular objects.

Brain Network Disorders article reviews the adoption of AI in brain cancer segmentation

A systematic review of AI models for meningioma segmentation reveals that better model architecture is the key driver of improved performance. The top models achieved high accuracy and efficiency, while future research focuses on making them more generalizable and efficient for real-world clinical settings.

Machines with the ability to ‘feel’ currently in development as we enter next frontier of AI

GoPro HERO13 Black

GoPro HERO13 Black records stabilized 5.3K video for instrument deployments, field notes, and outreach, even in harsh weather and underwater conditions.

Is that solar panel pointing in the right direction?

A new technique uses a single image to forecast solar panel energy production and maximize output. The method estimates the amount of energy that will be produced based on the angle of the sun, shadows, reflections, and weather patterns, allowing for more accurate placement and optimization of solar panels in urban areas.

NTU Singapore scientists invent AI-powered biochip that detects genetic markers in 20 minutes

A team of scientists from NTU Singapore has developed a new biochip that, when paired with Artificial Intelligence (AI), can detect quickly and accurately extremely small amounts of microRNAs. The device can cut detection time from hours to 20 minutes.

HEAPGrasp: A faster, smarter way for robots to handle tricky objects using only RGB camera

HeapGrasp uses RGB images to analyze object silhouettes and estimate its 3D shape, reducing the need for depth information. The approach achieves high accuracy while minimizing camera movement and execution time.

KTU researchers develop a model that improves machine understanding of the real world

A new model combines multiple ways of analysing 3D data, integrating local and global perspectives to interpret complex environments more reliably. The system improves detection of small or partially visible objects in real-world situations, enhancing safety in autonomous systems.

Apple Watch Series 11 (GPS, 46mm)

Apple Watch Series 11 (GPS, 46mm) tracks health metrics and safety alerts during long observing sessions, fieldwork, and remote expeditions.

Improving AI models’ ability to explain their predictions

Researchers at MIT developed a new method that coaxes AI models to achieve better accuracy and clearer explanations in safety-critical applications. The approach extracts concepts the model has learned while training for a specific task and forces it to use those, producing better explanations than standard concept bottleneck models.

Small models, big insights into vision

Researchers used machine learning techniques to compress a large model of the visual cortex, creating smaller versions that predict neural responses with high accuracy. The compact models revealed specific computational patterns in how neurons detect important features, offering insights into how visual information is processed.

Philadelphia communities help AI computer vision get better at spotting gentrification

Drexel researchers create machine learning program that integrates qualitative and quantitative data to identify gentrification in Philadelphia neighborhoods. The program, trained with data from thousands of images and focus groups, accurately identifies new-build gentrification with 84% accuracy.

AmScope B120C-5M Compound Microscope

AmScope B120C-5M Compound Microscope supports teaching labs and QA checks with LED illumination, mechanical stage, and included 5MP camera.

InstaDrive: Street view generation based on the unified instance segmentation input of vehicles and map elements

InstaDrive generates precise editing of vehicles and map elements, enabling efficient labeled data generation. It outperforms baselines in FID and mAP, preserving accurate map structures and maintaining multi-view consistency.

Deep-learning model predicts how fruit flies form, cell by cell

A team of MIT engineers developed a deep-learning model that predicts how individual cells will fold, divide, and rearrange during a fruit fly's earliest stage of growth. The model achieved 90% accuracy in predicting the movement of 5,000 cells over the first hour of development.

Can AI read humans’ minds? A new model shows it’s shockingly good at it

A breakthrough AI system called OmniPredict can predict human pedestrian behaviors with unprecedented accuracy, revolutionizing self-driving cars and urban mobility. The model combines visual cues with contextual information to anticipate pedestrians' next moves, reducing the risk of accidents and improving traffic safety.

Purdue innovation to be evaluated in international study for earlier identification of preeclampsia risk

Researchers at Purdue University are testing a computer-vision method to analyze smartphone photos of pregnant women's eyes to predict preeclampsia risk. The two-year study aims to reduce maternal mortality in Africa and could potentially save thousands of lives.

AI-powered vision gives meaning to wildfire chaos

A UBC Okanagan team harnesses computer modeling to study wildfire movement, finding that fires often behave randomly due to factors like fuel type, wind, and terrain. This randomness can lead to significant variations in fire spread, highlighting the need for more probabilistic models.

Sony Alpha a7 IV (Body Only)

Sony Alpha a7 IV (Body Only) delivers reliable low-light performance and rugged build for astrophotography, lab documentation, and field expeditions.

AI at the Eyelid: Glasses that track health through your blinks

Researchers developed AI-powered BlinkWise glasses that track blinking patterns to assess fatigue, mental workload, and eye-related health issues. The device uses radio signals to detect minute eyelid movements with unprecedented detail, preserving privacy and using minimal power.

‘More than just an image’: Purdue tech extracts hyperspectral info from conventional photos

Researchers at Purdue University have developed an algorithm that recovers detailed spectral information from photographs taken by conventional cameras. The method uses computer vision, color science, and optical spectroscopy to achieve high spectral resolution comparable to scientific spectrometers.

AI turns printer into a partner in tissue engineering

Researchers at UMC Utrecht developed a new AI-powered printer called GRACE that can print implantable tissues with improved cell survival and functionality. The printer uses computer vision and laser-based imaging to design and print complex structures, including blood vessels and cartilage layers.

Shaky cameras can make for sharper shots, new research shows

Researchers at Brown University developed an image processing technique that harnesses camera motion to increase resolution, producing super-resolution images with details sharper than the original pixel array allows. The technique has potential applications in archival photography and photography from moving aircraft.

An efficient and memory-friendly unsupervised industrial anomaly detection model

A research team developed an innovative unsupervised model for industrial anomaly detection using paired well-lit and low-light images. The model leverages feature maps, Low-pass Feature Enhancement, and Illumination-aware Feature Enhancement to detect anomalies while remaining lightweight and memory-efficient.

Sky-Watcher EQ6-R Pro Equatorial Mount

Sky-Watcher EQ6-R Pro Equatorial Mount provides precise tracking capacity for deep-sky imaging rigs during long astrophotography sessions.

Pedestrians now walk faster and linger less, researchers find

A new study reveals that pedestrians are now walking faster and spending less time in public spaces. Researchers analyzed 40 years of video footage to find a 14% decline in people lingering in these areas.

AI vision, reinvented: The power of synthetic data

Researchers developed CoSyn, a new approach to train open-source models using AI-generated scientific figures and charts. The resulting dataset, CoSyn-400K, includes over 400,000 synthetic images and 2.7 million sets of corresponding instructions. CoSyn-trained models match or outperform proprietary peers in various benchmark tests.

New tool gives anyone the ability to train a robot

MIT engineers developed a versatile demonstration interface that allows users to teach robots new skills in three intuitive ways: remote control, physical manipulation, or demonstration. This innovation expands the type of users and 'teachers' who interact with robots, enabling robots to learn a wider set of skills.

New attack can make AI ‘see’ whatever you want

Researchers have demonstrated a new technique, RisingAttacK, to manipulate all widely used AI computer vision systems, allowing them to control what the AI 'sees'. The attack is effective at influencing the AI's ability to detect top targets, such as cars, pedestrians, or stop signs.

Pervasive surveillance of people is being used to access, monetize, coerce, and control

A new study reveals a five-fold increase in computer vision papers linked to surveillance patents, highlighting the rise of obfuscating language that normalises surveillance. The top institutions producing surveillance are Microsoft, Carnegie Mellon University, and MIT.

Rigol DP832 Triple-Output Bench Power Supply

Rigol DP832 Triple-Output Bench Power Supply powers sensors, microcontrollers, and test circuits with programmable rails and stable outputs.

New all-silicon computer vision hardware by UMass researchers advances in-sensor visual processing technology

Researchers at UMass Amherst created integrated arrays of gate-tunable silicon photodetectors that can capture dynamic visual information and classify static images with high accuracy. The technology has the potential to reduce latency in computer vision tasks, enabling applications like self-driving vehicles and bioimaging.

Animation technique simulates the motion of squishy objects

Researchers at MIT developed a simulation method that allows for accurate and stable simulations of elastic materials, enabling the creation of realistic bouncy characters in movies and video games. The approach preserves physical properties and avoids instability, making it a promising tool for engineers to design flexible products.

Imaging technique removes the effect of water in underwater scenes

Researchers have developed an image-analysis tool called SeaSplat that cuts through the ocean's optical effects and generates images of underwater environments with accurate colors. The team paired SeaSplat with a computational model to convert images into three-dimensional underwater worlds, allowing for virtual exploration.

New AI model dramatically improves subgraph matching accuracy by eliminating noise

A new deep learning model, ENDNet, significantly enhances subgraph matching accuracy by identifying and neutralizing extra nodes that interfere with the matching process. This improves performance in pattern recognition tasks across various fields, including drug discovery and natural language processing.

Smarter skies: A new AI model turns street cameras into rainfall sensors

Researchers developed an innovative deep-learning-based framework that uses common surveillance cameras to estimate rainfall in real time. The approach achieved high predictive accuracy across various environmental conditions and lighting scenarios, outperforming traditional methods while maintaining low computational costs.

Davis Instruments Vantage Pro2 Weather Station

Davis Instruments Vantage Pro2 Weather Station offers research-grade local weather data for networked stations, campuses, and community observatories.

Making AI models more trustworthy for high-stakes settings

Researchers at MIT developed a technique to improve the reliability of conformal classification, which can produce impractably large prediction sets. By combining test-time augmentation with conformal prediction, they reduced prediction set sizes by up to 30 percent while maintaining probability guarantees.

Helping computers perceive and interact with the visual world

Schmid's contributions have helped computers recognize complex objects, understand video analysis, and process realistic settings. Her leadership has built active research communities, mentoring and supervising peers across the field of computer vision.

UF professor develops AI tool to better assess Parkinson’s disease, other movement disorders

A University of Florida researcher has developed a groundbreaking AI tool called VisionMD that analyzes videos of patients with Parkinson's disease and other movement disorders. The tool provides valuable information about how the disease is progressing and responding to medications, improving patient care and advancing clinical research.

Transforming doors into gateways to the virtual world: the future of mixed reality!

A collaborative research team has developed a novel mixed reality (MR) technology that uses real-world doors as natural transition points. The system allows users to select a door within their MR interface and seamlessly transition into a virtual space, creating an unprecedented sense of immersion.

New 3D technology paves way for next-generation eye-tracking

Researchers at the University of Arizona have developed a new 3D imaging technique, deflectometry, paired with advanced computation to improve eye-tracking accuracy. The method can capture gaze direction information from more than 40,000 surface points, theoretically millions, increasing accuracy by a factor of over 3,000 compared to c...

Sky & Telescope Pocket Sky Atlas, 2nd Edition

Sky & Telescope Pocket Sky Atlas, 2nd Edition is a durable star atlas for planning sessions, identifying targets, and teaching celestial navigation.

Beyond ambiguous reflections: Bridging optical 3D metrology and computer vision

Researchers develop a new approach combining Phase Measuring Deflectometry and Shape from Polarization to accurately image specular surfaces without prior knowledge or assumptions. The single-shot method enables motion-robust measurements, pushing the limits for next-generation 3D sensors.

New AI tool generates high-quality images faster than state-of-the-art approaches

Researchers have developed a hybrid image-generation tool called HART that combines the strengths of autoregressive and diffusion models. It achieves high reconstruction quality with significantly reduced computational resources, enabling local execution on laptops or smartphones.

2025 IEEE 2nd International Conference on Deep Learning and Computer Vision (IEEE DLCV 2025)

The conference aims to bridge theoretical advancements with practical applications in AI and visual computing. Researchers can submit original research papers and attend keynote sessions, offering opportunities to network with pioneers in intelligent technologies.

From handicap to asset: AI approach leverages optics phenomenon to produce better images

Scientists developed a method that harnesses chromatic aberration to produce high-quality images using a single exposure. The AI approach uses generative models to retrieve phase information from limited data input.

GQ GMC-500Plus Geiger Counter

GQ GMC-500Plus Geiger Counter logs beta, gamma, and X-ray levels for environmental monitoring, training labs, and safety demonstrations.

Two Michigan Engineering researchers named 2025 Sloan Research Fellows

Thatchaphol Saranurak and Andrew Owens have been awarded Sloan Research Fellowships for their innovative work on graph networks and machine perception systems. Their research aims to create more efficient algorithms for computing dynamic systems, such as social networks and traffic patterns.

Engineers enable a drone to determine its position in the dark and indoors

MIT researchers have introduced a new system called MiFly that enables drones to self-localize in indoor, dark, and low-visibility environments. The system uses radio frequency waves reflected by a single tag placed in the environment, allowing the drone to estimate its trajectory with high accuracy.

How neighborhood perception affects housing rents: A novel analytical approach

A new method developed by Osaka Metropolitan University accurately predicts housing prices in Osaka City, with neighborhood perception being a key factor. The approach achieves nearly 75% accuracy by combining existing property data with machine-learning-processed street view images.

UCF helps develop AI tool that may assist underserved hospitals

The open-source AI model analyzes medical images, generates detailed reports, and answers clinical questions to streamline diagnostics and improve accuracy. BiomedGPT aims to democratize healthcare and reduce disparities amongst patients by providing easily accessible data to bolster underserved hospitals.

A review of camouflaged object detection research and the promise of deep learning

A comprehensive review of camouflaged object detection research highlights the potential of deep learning in recognizing objects in complex scenarios. The review analyzes traditional and deep learning approaches, emphasizing practical contributions and theoretical frameworks.

Kestrel 3000 Pocket Weather Meter

Kestrel 3000 Pocket Weather Meter measures wind, temperature, and humidity in real time for site assessments, aviation checks, and safety briefings.

Smart food drying techniques with AI enhance product quality and efficiency

Researchers develop precision techniques using optical sensors and AI to facilitate efficient and accurate food drying. The study discusses three emerging smart drying techniques, providing practical information for the food industry.

Placenta assessment tool aims to improve neonatal, maternal care

A new tool developed by Penn State researchers uses computer vision and artificial intelligence to analyze placenta images, detecting abnormalities and risks such as neonatal sepsis. The PlacentaCLIP+ model has the potential to transform neonatal and maternal care in low- and high-resource settings.

The best AI strategy to recognize multiple objects in one image

Researchers from Bar-Ilan University discover that classifying objects together through Multi-Label Classification can yield better results than detecting individual objects. This new method allows networks to learn correlations between object combinations, making them more recognizable in real-life applications such as autonomous vehi...

UTIA team wins grant to advance AI education and career preparation

The University of Tennessee Institute of Agriculture has won a four-year grant to create hands-on curriculum about AI-related technologies for future farmers and leaders. Selected students will test the curriculum in drones, robotics, and other smart agriculture technologies, gaining skills in coding, drone-work, and robotics.

Anker Laptop Power Bank 25,000mAh (Triple 100W USB-C)

Anker Laptop Power Bank 25,000mAh (Triple 100W USB-C) keeps Macs, tablets, and meters powered during extended observing runs and remote surveys.

A new way to create realistic 3D shapes using generative AI

Researchers develop a simple fix to an existing technique, enabling the generation of sharp, high-quality 3D shapes that rival top model-generated 2D images. The new approach improves upon previous methods by avoiding costly retraining and complex postprocessing.

Researchers develop markerless motion capture system to push biomechanics “into the wild”

Researchers at CAMERA have developed an open-source markerless motion capture system using computer vision and deep learning methods. The system estimates joint positions from regular 2D image data, providing unobtrusive analysis of body movements.

Reality check: making indoor smartphone-based augmented reality work

A study by Osaka University researchers found that visual landmarks can be difficult to find in certain environments, leading to motion sickness. They propose using radio-frequency localization, such as ultra-wideband sensing, to overcome these challenges and improve indoor augmented reality applications.

1 2 3