Publications

Found 170 results
Author Title [ Type(Asc)] Year
Filters: Author is Gerald Friedland  [Clear All Filters]
Conference Paper
Friedland, G., Gottlieb L., & Janin A. (2009).  Joke-o-Mat: Browsing Sitcoms Punchline by Punchline. 1115-1116.
Elizalde, B. Martinez, Lei H., & Friedland G. (2013).  An I-Vector Representation of Acoustic Environments for Audio-Based Video Event Detection on User Generated Content. 114-117.
Ravanelli, M., Elizalde B. Martinez, Bernd J., & Friedland G. (2015).  Insights into Audio-Based Multimedia Event Classification with Neural Networks. 19-23.
Tsai, T.. J., Friedland G., & Anguera X. (2015).  An Information-Theoretic Metric of Fingerprint Effectiveness.
Boakye, K., Vinyals O., & Friedland G. (2011).  Improved Overlapped Speech Handling for Speaker Diarization. 941-944.
Vaquero, C.., Vinyals O., & Friedland G. (2010).  A Hybrid Approach to Online Speaker Diarization. 2642-2645.
Choi, J., Lei H., Ekambaram V., Kelm P., Gottlieb L., Sikora T., et al. (2013).  Human Vs Machine: Establishing a Human Baseline for Multimodal Location Estimation.
Huang, P-S., Mertens R., Divakaran A., Friedland G., & Hasegawa-Johnson M. (2012).  How to Put It Into Words - Using Random Forests to Extract Symbol Level Descriptions from Audio Content for Concept Detection. 505-508.
Vinyals, O., & Friedland G. (2008).  A Hardware-Independent Fast Logarithm Approximation with Adjustable Accuracy. 61-65.
Choi, J., Larson M., Li X., Li K., Friedland G., & Hanjalic A. (2017).  The Geo-Privacy Bonus of Popular Photo Enhancements. Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval. 84-92.
Cao, L., Friedland G., & Xie L. (2014).  GeoMM 2014: The Third ACM Multimedia Workshop on Geotagging and its Applications in Multimedia. 1251-1252.
Friedland, G., Vinyals O., Huang Y., & Müller C. (2009).  Fusing Short Term and Long Term Features for Improved Speaker Diarization. 4077-4080.
Choi, J., Larson M., Friedland G., & Hanjalic A. (2019).  From Intra-Modal to Inter-Modal Space: Multi-Task Learning of Shared Representations for Cross-Modal Retrieval. Proceedings of 2019 IEEE Fifth International Conference on Multimedia Big Data (BigMM). 1-10.
Huang, Y., Vinyals O., Friedland G., Müller C., Mirghafori N., & Wooters C. (2007).  A Fast-Match Approach for Robust, Faster than Real-Time Speaker Diarization. Proceedings of IEEE Workshop on Automatic Speech Recognition and Understanding. 693-698.
Gonina, E., Friedland G., Cook H., & Keutzer K. (2011).  Fast Speaker Diarization Using a High-Level Scripting Language.
Knox, M. Tai, Mirghafori N., & Friedland G. (2013).  Exploring Methods of Improving Speaker Accuracy for Speaker Diarization.
Goga, O., Lei H., Parthasarathi S. Hari Krish, Friedland G., Sommer R., & Teixeira R. (2013).  Exploiting Innocuous Activity for Correlating Users Across Sites.
Hung, H., Huang Y., Friedland G., & Gatica-Perez D. (2008).  Estimating the Dominant Person in Multi-Party Conversations Using Speaker Diarization Strategies. 2197-2200.
Friedland, G., Hürst W., & Knipping L. (2007).  Educational Multimedia Systems: The Past, the Present, and a Glimpse into the Future. 1-4.
Vinyals, O., Friedland G., & Morgan N. (2010).  Discriminative Training for Hierarchical Clustering in Speaker Diarization. 2326-2329.
Jing, L., Liu B., Choi J., Janin A., Bernd J., Mahoney M., et al. (2016).  A discriminative and compact audio representation for event detection. Proceedings of the 2016 ACM Conference on Multimedia (MM '16). 57-61.
Zhao, T., Choi J., & Friedland G. (2020).  DIME: An Online Tool for the Visual Comparison of Cross-modal Retrieval Models. International Conference on Multimedia Modeling. 729-733.
Choi, J., & Friedland G. (2011).  Data-Driven vs. Semantic-Technology-Driven Tag-Based Video Location Estimation. 243-246.
Choi, J., & Friedland G. (2011).  Data-Driven vs. Semantic-Technology-Driven Tag-Based Video Location Estimation. 243-246.
Friedland, G., & Sommer R. (2010).  Cybercasing the Joint: On the Privacy Implications of Geotagging.

Pages