Real-Time Reconstructions

Modern archeologists have been able to survey larger structures more precisely using remote sensing and photogrammetry. More recently, researchers demonstrate applications of multi view stereo with networked embedded cameras to track geological disturbances. In scenarios where visibility comes with high cost or saftey risk, the ability to quickly render high-fidelity reconstructions for offline analysis & review can be a powerful tool. Advances in techniques like multi-view stereo and structure from motion have reduced the cost by alleviating dependence on more expensive sensors like lidar....

 · 4 min · Terry Rodriguez & Salma Mayorquin

Detect-Track-Localize

In our latest experiment with Depthai’s cameras, we consider visual localization. This relates to the simultaneous localization and mapping (SLAM) problem that robots use to consistently localize in a known environment. However, instead of feature matching with algorithms like ORB, we can try to directly regress the pose of a known object. This approach uses object detection, which is more robust to changes in illumination and perspective than classical techniques. And without the need to generate a textured map of a new environment, this technique can be quickly adapted to new scenes....

 · 3 min · Terry Rodriguez & Salma Mayorquin

Pointcloud Video

Lately, we’ve come to enjoy using the DepthAI OAK-D, which features an RGB camera with stereo depth, IMU, and Intel’s MyriadX VPU. Along with this powerful hardware combination, DepthAI provides a rich SDK to build your own embedded vision pipelines. Many projects are included to get you started. These specs could help bring spatial AI to the SpecMirror where we can test representing human activities with pointcloud video. The Data First, we will generate training samples for activity recognition models like P4Transformer....

 · 2 min · Terry Rodriguez & Salma Mayorquin

Protecting Privacy With Computer Vision

Check out and contribute to our collection of data privacy resources! AI researchers developed models to identify image pixels featuring people. We apply this to promote privacy by helping you redact personally identifiable info in images. This demo is powered by Tensorflow.js! Drop an image and retrieve the redacted output without ever sending data over the internet. Click on your redacted image when it’s done to save. Consider another use case of delivery robots roaming the streets....

 · 1 min · Terry Rodriguez & Salma Mayorquin

Alexa, where are my keys?

Alexa works well for information retrieval tasks and controlling devices on your wireless home networks. We want to use the home network to track our valuables or keys. We’ll hack cheap bluetooth low energy beacons for the network range and battery longevity and build a smart application so that Alexa knows where we left the keys. Hacking Bluetooth beacons We’ll start by exploring what we can do with cheap bluetooth beacons....

 · 7 min · Terry Rodriguez & Salma Mayorquin