NUST Institutions Library Catalogue catalog › Details for: Audio Visual Person Recognition /

Normal view MARC view ISBD view

Audio Visual Person Recognition / Ahmad Ali

By: Ali, Ahmad Contributor(s): Supervisor : Dr. Hasan Sajid Material type: Text

TextIslamabad : SMME- NUST; 2022Description: 46p. Soft Copy 30cmSubject(s): MS Robotics and Intelligent Machine EngineeringDDC classification: 629.8 Online resources: Click here to access online

Tags from this library: No tags from this library for this title. Log in to add tags.

Holdings ( 1 )
Title notes ( 1 )
Comments ( 0 )

Item type	Current location	Home library	Shelving location	Call number	Status	Date due	Barcode	Item holds
Thesis	School of Mechanical & Manufacturing Engineering (SMME)	School of Mechanical & Manufacturing Engineering (SMME)	E-Books	629.8 (Browse shelf)	Available		SMME-TH-712

Total holds: 0

Person authentication is a primary element to consider wherever privacy is necessary. Deep learning based authentication algorithms have a number of applications in the said field. Adding multiple modalities makes the system more robust. In this research a joint multi-modal audio-visual deep learning based method has been devised to authenticate a person based on their voice as well as face. This two-step verification process works by learning face-feature based embeddings as well as voice-feature based embeddings to serve two purposes: 1) if the face presented matches with an identity in a reference database and 2) if the voice matches any voice in the reference database. This strategy can help prevent important systems from impostor attempts using modalities that are commonly present and available in consumer devices.

There are no comments on this title.

to post a comment.