Image Inpainting for Content Localization

In a prior post, we explored training StyleGAN2 on a corpus of many theatrical posters we scraped from places like IMDb. Then we considered applications of StyleGAN2 for image retrieval after extracting embeddings by projecting our corpus to the latent factor space. These image retrieval techniques can form the basis of personalized image recommendations. Netflix engineering posted on how they partner with their content creation team to test artwork for driving user engagement....

 · 3 min · Terry Rodriguez & Salma Mayorquin

Applying GAN Latent Factors for Image Retrieval

GANs represent the state of the art in learning an image distribution for an image corpus. These models often use explicit mechanisms for learning factored representations for images over continuous embedding space. These are desirable features for image embeddings in the context of image retrieval. In this post, we explore applications of StyleGAN2 variants to the image retrieval task. StyleGAN2 By unsupervised approach, we train a StyleGAN2 model to generate theatrical posters from our image corpus....

 · 3 min · Terry Rodriguez & Salma Mayorquin

Deepfake Detection With NVIDIA TLT 3.0 and DeepStream SDK

Last year, over 2 thousand teams participated in Kaggle’s Deepfake detection video classification challenge. For this task, contestants were provided 470 GB of high resolution video and required to submit a notebook which predicts whether each sample video file has been deepfaked with a 9 hour run-time limit. Since most deepfake technology performs a faceswap, contestants concentrated around face detection and analysis. Beginning with face detection, contestants could develop an image classifier using the provided labels....

 · 5 min · Terry Rodriguez & Salma Mayorquin

Deepfake Detection: Challenge Accepted

Advances in methods to generate photorealistic but synthetic images have prompted concerns about abusing the technology to spread misinformation. In response, major tech companies like Facebook, Amazon, and Microsoft partnered to sponsor a contest hosted by Kaggle to mobilize machine learning talent to tackle the challenge. With $1 million in prizes and nearly half a terabyte of samples to train on, this contest requires the development of models that can be deployed to combat deepfakes....

 · 2 min · Terry Rodriguez & Salma Mayorquin

Everybody Dance Faster

Check out the repo and the video! “Everybody Dance Now” offers a sensational demonstration in combining image-to-image translation with pose estimation to produce photo-realistic ‘do-as-i-do’ motion transfer. Researchers used roughly 20 mins of video shot at 120 fps of a subject moving through a normal range of body motion. It is also important for source and target videos to be taken from similar perspectives. Generally, this is a fixed camera angle at a third person perspective with the subject’s body occupying most of the image....

 · 7 min · Terry Rodriguez & Salma Mayorquin