A new ultra-lightweight AI model, Multinex, advances low-light image enhancement by leveraging classical colour vision theory and Retinex principles. The model outperforms comparable compact systems, recovering detail and clarity from previously unusable images.
Researchers from MIT and IBM create a state-of-the-art dataset called ChartNet, which includes over a million varied charts. The dataset is designed to teach vision-language models how to effectively interpret charts, enabling them to outperform commercial models on tasks like data extraction and chart summarization.
Garmin GPSMAP 67i with inReach
Garmin GPSMAP 67i with inReach provides rugged GNSS navigation, satellite messaging, and SOS for backcountry geology and climate field teams.
Researchers from Drexel University developed BioCoach, a program using AI and computer vision to analyze video and provide form coaching in real time. The system analyzes visual appearance and motion patterns, as well as 3D skeletal movements and body shape, to deliver detailed biomechanics-based feedback.
Researchers developed a disco laser system to enhance data visualization for snow groomers, improving operator comfort and reducing nausea caused by VR headsets. The system also enables better tracking and orientation aids, leading to more efficient and safe operation in challenging conditions.
The Association for Computing Machinery announced three technical awards for innovations in global wireless standards, machine learning, and 3D generative AI. Erdal Arikan received the Paris Kanellakis Theory and Practice Award for his discovery of channel polarization and polar codes.
A new model combines multiple ways of analysing 3D data, integrating local and global perspectives to interpret complex environments more reliably. The system improves detection of small or partially visible objects in real-world situations, enhancing safety in autonomous systems.
SAMSUNG T9 Portable SSD 2TB
SAMSUNG T9 Portable SSD 2TB transfers large imagery and model outputs quickly between field laptops, lab workstations, and secure archives.
Researchers developed an AI-based system that accurately detects whip sounds in horse racing, achieving detection rates of up to 70% in audio data. The system's ability to process audio in real-time and its reliance on high-frequency components make it a promising tool for improving animal welfare and fair competition.
A new AI tool, CattleFever, uses artificial intelligence and thermal cameras to estimate cattle body temperature from a photo. The system can automatically determine an animal's body temperature within 1 degree of the reading from a thermometer.
A breakthrough AI system called OmniPredict can predict human pedestrian behaviors with unprecedented accuracy, revolutionizing self-driving cars and urban mobility. The model combines visual cues with contextual information to anticipate pedestrians' next moves, reducing the risk of accidents and improving traffic safety.
Researchers at Purdue University are testing a computer-vision method to analyze smartphone photos of pregnant women's eyes to predict preeclampsia risk. The two-year study aims to reduce maternal mortality in Africa and could potentially save thousands of lives.
Sky-Watcher EQ6-R Pro Equatorial Mount
Sky-Watcher EQ6-R Pro Equatorial Mount provides precise tracking capacity for deep-sky imaging rigs during long astrophotography sessions.
Researchers developed AI-powered BlinkWise glasses that track blinking patterns to assess fatigue, mental workload, and eye-related health issues. The device uses radio signals to detect minute eyelid movements with unprecedented detail, preserving privacy and using minimal power.
Researchers developed a novel approach called R3DG that analyzes representations at varying granularities to capture nuanced emotional fluctuations and reduce computational complexity. This framework demonstrates superior performance in multiple multimodal tasks, including sentiment analysis, emotion recognition, and humor detection.
Researchers developed CoSyn, a new approach to train open-source models using AI-generated scientific figures and charts. The resulting dataset, CoSyn-400K, includes over 400,000 synthetic images and 2.7 million sets of corresponding instructions. CoSyn-trained models match or outperform proprietary peers in various benchmark tests.
Apple iPhone 17 Pro
Apple iPhone 17 Pro delivers top performance and advanced cameras for field documentation, data collection, and secure research communications.
MIT engineers developed a versatile demonstration interface that allows users to teach robots new skills in three intuitive ways: remote control, physical manipulation, or demonstration. This innovation expands the type of users and 'teachers' who interact with robots, enabling robots to learn a wider set of skills.
Researchers have demonstrated a new technique, RisingAttacK, to manipulate all widely used AI computer vision systems, allowing them to control what the AI 'sees'. The attack is effective at influencing the AI's ability to detect top targets, such as cars, pedestrians, or stop signs.
Researchers from The University of Osaka have demonstrated that vision transformers can spontaneously develop human-like visual attention patterns without specific training. This breakthrough showcases the potential of self-supervised learning for advancing AI applications and modeling biological vision.
University of Missouri researchers create digital sentiment map using AI to analyze public Instagram posts, linking emotional tone to real-life features. The tool aims to improve city services, identify areas of concern, and inform emergency response decisions.
Sony Alpha a7 IV (Body Only)
Sony Alpha a7 IV (Body Only) delivers reliable low-light performance and rugged build for astrophotography, lab documentation, and field expeditions.
A new crowdsourcing system, FireLoc, uses a network of low-cost mobile phones to detect wildfires minutes—even seconds—after they ignite. The system prioritizes privacy and accurately maps wilderness fires to within 180 feet of their origin.
WorldScribe, a new software, uses generative AI to provide real-time text and audio descriptions of surroundings for people who are blind or have low vision. The tool can adjust the level of detail based on user commands or camera frame time.
A study from the University of Arkansas System Division of Agriculture has improved food quality computer predictions by using human perception data. The researchers trained a computer model to mimic human adaptation to environmental conditions, resulting in more consistent predictions under different lighting conditions.
Davis Instruments Vantage Pro2 Weather Station
Davis Instruments Vantage Pro2 Weather Station offers research-grade local weather data for networked stations, campuses, and community observatories.
Researchers have made significant strides in multimodal sentiment recognition, leveraging self-supervised learning and large models to capture correlations between modalities and emotional information. The study emphasizes the importance of addressing data scarcity and exploring transfer learning methods to develop robust models.
Researchers created a system called Holodeck to generate interactive 3D environments, leveraging language models like ChatGPT to control it. The system outperformed earlier tools in evaluating realism and accuracy, with human evaluators preferring its outputs across various indoor environments.
A new depth from focus/defocus approach, DDFS, combines model-based and learning-based strategies to achieve notable improvements in performance and applicability. The proposed method outperformed state-of-the-art methods in various metrics for several image datasets.
MethaneMapper is an artificial intelligence-powered hyperspectral imaging tool that can detect real-time methane emissions and trace them to their sources. With a performance accuracy of 91%, it has the potential to revolutionize the way we monitor oil and gas operations and curb climate change.
Researchers at North Carolina State University have developed a new methodology called Patch-to-Cluster attention (PaCa) that addresses the challenges of vision transformers. PaCa improves ViT's ability to identify, classify, and segment objects in images while reducing computational demands and enhancing model interpretability.
DJI Air 3 (RC-N2)
DJI Air 3 (RC-N2) captures 4K mapping passes and environmental surveys with dual cameras, long flight time, and omnidirectional obstacle sensing.
MIT researchers develop teaching phase that guides humans in understanding AI strengths and weaknesses, enabling more accurate decisions and faster conclusions. The technique helps humans build a mental model of the AI agent, reducing reliance on biased assumptions.
Researchers at Beijing Institute of Technology created a robot that can track fast-moving rats for extended periods using real-time localization and movement analysis. The robotic rat's built-in stereo vision system enables it to characterize typical behaviors of actual rats, promoting autonomy and reproducibility in behavior research.
Researchers at MIT develop RFusion, a robotic system that uses data from a camera and radio frequency antenna to locate and retrieve lost items. The system relies on RFID tags and machine learning algorithms to optimize the robot's trajectory and grasp the object.
GQ GMC-500Plus Geiger Counter
GQ GMC-500Plus Geiger Counter logs beta, gamma, and X-ray levels for environmental monitoring, training labs, and safety demonstrations.
A team of scientists from Osaka University developed a machine learning method for classifying the type of building and its primary façade color using deep learning models applied to street-level images. This work may assist in fostering neighborhood cohesion and support urban renewal by providing tailored street-view datasets.