Publications
A Scalable Web Cache Consistency Architecture.
Proceedings of the ACM Conference on Applications, Technologies, Architectures, and Protocols for Computer Communication (SIGCOMM '99). 163-174.
(1999).
(2013). Almost an expert: The effects of rubrics and expertise on perceived value of crowdsourced design critiques.
Proceedings of the 19th ACM Conference on Computer-Supported Cooperative Work & Social Computing. 1005-1017.
(2016).
(2012).
(2012).
Discretized Streams: An Efficient and Fault-Tolerant Model for Stream Processing on Large Clusters.
1-6.
(2012). Apache Spark: a unified engine for big data processing.
Communications of the ACM. 59(11), 56-65.
(2016). Fast and Interactive Analytics Over Hadoop Data with Spark.
USENIX ;login:. 34(4), 45-51.
(2012).
(2010).
(2011).
(2012). Delay Scheduling: A Simple Technique for Achieving Locality and Fairness in Cluster Scheduling.
265-278.
(2010).
(2011).
(2012).
(2009).
(2011).
(2010). Inference and Analysis of Haplotypes from Combined Genotyping Studies Deposited in dbSNP.
Genome Research. 15(11), 1594-1600.
(2005). Leveraging Genetic Variability Across Populations for the Identification of Causal Variants.
The American Journal of Human Genetics. 86(1), 23-33.
(2010). Leveraging the HapMap Correlation Structure in Association Studies.
American Journal of Human Genetics. 80, 683-691.
(2007). Fast and Intuitive Clustering of Web Documents.
Proceedings of the Third International Conference on Knowledge Discovery and Data Mining. 287-290.
(1997). Axiomatizing Congestion Control.
Proceedings of the ACM on Measurement and Analysis of Computing Systems. 3(2),
(2019). Semantic categories of artifacts and animals reflect efficient coding.
Proceedings of the 41st Annual Meeting of the Cognitive Science Society.
(2019). Evolution and efficiency in color naming: The case of Nafaanra.
Proceedings of the 41st Annual Meeting of the Cognitive Science Society.
(2019). Communicative need in colour naming.
Cognitive Neuropsychology.
(2019).