Bitrate Optimization using Spark and FFmpeg

Check out the notebook that accompanies this post! Streaming video is a major part of how users consume information across a variety of applications. As more users turn to mobile devices, the screen sizes are also increasing. At the same time, consumers expect high quality video without lag or distortion. This frames an engineering challenge to optimize the way video is streamed for consumers using a wide variety of hardware....

 · 3 min · Terry Rodriguez & Salma Mayorquin

Scalable Image Deduplication With Spark

Make sure to check out the databricks notebook for this post! Modern internet companies maintain many image/video assets rendered at various resolutions to optimize content delivery. This demand gives rise to very interesting optimization problems. Groups like Netflix have even taken steps to personalize the images presented to each user, but as they describe, this involves subproblems in organizing the collection of images. In particular, Netflix researchers described extracting image metadata to help cluster near duplicate images so they could more efficiently apply techniques like contextual bandits for image personalization....

 · 2 min · Terry Rodriguez & Salma Mayorquin