Publications

Found 170 results
Author Title [ Type(Desc)] Year
Filters: Author is Gerald Friedland  [Clear All Filters]
Conference Paper
Vinyals, O., Friedland G., & Morgan N. (2010).  Discriminative Training for Hierarchical Clustering in Speaker Diarization. 2326-2329.
Friedland, G., Hürst W., & Knipping L. (2007).  Educational Multimedia Systems: The Past, the Present, and a Glimpse into the Future. 1-4.
Hung, H., Huang Y., Friedland G., & Gatica-Perez D. (2008).  Estimating the Dominant Person in Multi-Party Conversations Using Speaker Diarization Strategies. 2197-2200.
Goga, O., Lei H., Parthasarathi S. Hari Krish, Friedland G., Sommer R., & Teixeira R. (2013).  Exploiting Innocuous Activity for Correlating Users Across Sites.
Knox, M. Tai, Mirghafori N., & Friedland G. (2013).  Exploring Methods of Improving Speaker Accuracy for Speaker Diarization.
Gonina, E., Friedland G., Cook H., & Keutzer K. (2011).  Fast Speaker Diarization Using a High-Level Scripting Language.
Huang, Y., Vinyals O., Friedland G., Müller C., Mirghafori N., & Wooters C. (2007).  A Fast-Match Approach for Robust, Faster than Real-Time Speaker Diarization. Proceedings of IEEE Workshop on Automatic Speech Recognition and Understanding. 693-698.
Choi, J., Larson M., Friedland G., & Hanjalic A. (2019).  From Intra-Modal to Inter-Modal Space: Multi-Task Learning of Shared Representations for Cross-Modal Retrieval. Proceedings of 2019 IEEE Fifth International Conference on Multimedia Big Data (BigMM). 1-10.
Friedland, G., Vinyals O., Huang Y., & Müller C. (2009).  Fusing Short Term and Long Term Features for Improved Speaker Diarization. 4077-4080.
Cao, L., Friedland G., & Xie L. (2014).  GeoMM 2014: The Third ACM Multimedia Workshop on Geotagging and its Applications in Multimedia. 1251-1252.
Choi, J., Larson M., Li X., Li K., Friedland G., & Hanjalic A. (2017).  The Geo-Privacy Bonus of Popular Photo Enhancements. Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval. 84-92.
Vinyals, O., & Friedland G. (2008).  A Hardware-Independent Fast Logarithm Approximation with Adjustable Accuracy. 61-65.
Huang, P-S., Mertens R., Divakaran A., Friedland G., & Hasegawa-Johnson M. (2012).  How to Put It Into Words - Using Random Forests to Extract Symbol Level Descriptions from Audio Content for Concept Detection. 505-508.
Choi, J., Lei H., Ekambaram V., Kelm P., Gottlieb L., Sikora T., et al. (2013).  Human Vs Machine: Establishing a Human Baseline for Multimodal Location Estimation.
Vaquero, C.., Vinyals O., & Friedland G. (2010).  A Hybrid Approach to Online Speaker Diarization. 2642-2645.
Boakye, K., Vinyals O., & Friedland G. (2011).  Improved Overlapped Speech Handling for Speaker Diarization. 941-944.
Tsai, T.. J., Friedland G., & Anguera X. (2015).  An Information-Theoretic Metric of Fingerprint Effectiveness.
Ravanelli, M., Elizalde B. Martinez, Bernd J., & Friedland G. (2015).  Insights into Audio-Based Multimedia Event Classification with Neural Networks. 19-23.
Elizalde, B. Martinez, Lei H., & Friedland G. (2013).  An I-Vector Representation of Acoustic Environments for Audio-Based Video Event Detection on User Generated Content. 114-117.
Friedland, G., Gottlieb L., & Janin A. (2009).  Joke-o-Mat: Browsing Sitcoms Punchline by Punchline. 1115-1116.
Janin, A., Gottlieb L., & Friedland G. (2010).  Joke-O-Mat HD: Browsing Sitcoms with Human Derived Transcripts. 1591-1594.
Bernd, J., Borth D., Carrano C., Choi J., Elizalde B. Martinez, Friedland G., et al. (2015).  Kickstarting the Commons: The YFCC100M and the YLI Corpora. 1-6.
Stolcke, A., Friedland G., & Imseng D. (2010).  Leveraging Speaker Diarization for Meeting Recognition from Distant Microphones. 4390-4393.
Friedland, G., & Vinyals O. (2008).  Live Speaker Identification in Conversations. 1017-1018.
Elizalde, B. Martinez, & Friedland G. (2013).  Lost in Segmentation: Three Approaches for Speech/Non-Speech Detection in Consumer-Produced Videos.

Pages