Wayne State University
Objective:
Face Verification System
Same or Not?
Video-based Face Verification
feed a video as a
sequence of frames
feed video directly into the ConvNets as an input
Video-based Face Verification
feed a video as a
sequence of frames
feed video directly into the ConvNets as an input
Combine information from a sequence of frames using a CNN architecture with stream pooling and fully connected layers.
Our approach:
[1] Zhang, Kaipeng, Zhanpeng Zhang, Zhifeng Li, and Yu Qiao. "Joint Face Detection and Alignment Using Multitask Cascaded Convolutional Networks." IEEE Signal Processing Letters 23, no. 10 (2016): 1499-1503.
[1]
original
Positive
Negative
Original
Positive
Negative
Original
Positive
Negative
Video Face DataSet
Network Architecture
Triplet Loss:
The Triplet Loss minimizes the distance between an anchor and a positive, both of which have the same identity, and maximizes the distance between the anchor and a negative of a different identity
Works to be done