Publications

Found 35 results
Author Title [ Type(Asc)] Year
Filters: Author is Howard Lei  [Clear All Filters]
Conference Paper
Lei, H., & Mirghafori N. (2007).  Word-Conditioned Phone N-Grams for Speaker Recognition. Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007). 253-256.
Lei, H., & Mirghafori N. (2007).  Word-Conditioned HMM Supervectors for Speaker Recognition. Proceedings of the 8th Annual Conference of the International Speech Communication Association (Interspeech 2007). 746-749.
Lei, H., Choi J., Janin A., & Friedland G. (2011).  User Verification: Matching the Uploaders of Videos Across Accounts. 2404-2407.
Lei, H. (2009).  Towards Structured Approaches to Arbitrary Data Selection and Performance Prediction for Speaker Recognition.
Elizalde, B. Martinez, Friedland G., Lei H., & Divakaran A. (2012).  There is No Data Like Less Data: Percepts for Video Concept Detection on Consumer-Produced Media. 27-32.
Lei, H., Meyer B. T., & Mirghafori N. (2012).  Spectro-Temporal Gabor Features for Speaker Recognition. 4241-4244.
Lei, H., Choi J., Janin A., & Friedland G. (2011).  Persona Linking: Matching Uploaders of Videos Across Accounts.
Lei, H., Choi J., & Friedland G. (2013).  Nowhere to Hide: Exploring User-Verification Across Flickr Accounts.
Peters, N., Lei H., & Friedland G. (2012).  Name That Room: Room Identification Using Acoustic Features in a Recording. 841-844.
Friedland, G., Choi J., Lei H., & Janin A. (2011).  Multimodal Location Estimation on Flickr Videos.
Lei, H., Choi J., & Friedland G. (2012).  Multimodal City-Verification on Flickr Videos Using Acoustic and Textual Features. 2273-2276.
Lei, H., & Lopez-Gonzalo E. (2009).  Mel, Linear, and Antimel Frequency Cepstral Coefficients in Broad Phonetic Regions for Telephone Speaker Recognition. 2323-2326.
Peters, N., Lei H., & Choi J. (2012).  Matching Artificial Reverb Settings to Unknown Room Recordings: A Recommendation System for Reverb Plugins.
Elizalde, B. Martinez, Lei H., & Friedland G. (2013).  An I-Vector Representation of Acoustic Environments for Audio-Based Video Event Detection on User Generated Content. 114-117.

Pages