#ID  Title  Author  Bibiographic  Year  Month  Day  GroupID  Picture  PDF  Abstract 000001Automatic Dialog Act Segmentation and Classification in Multiparty MeetingsJ. Ang, Y. Liu and E. Shriberg2005 IEEE International Conference on Acoustics, Speech, and Signal
Processing, Philadelphia, PA2005____000001http://www.icsi.berkeley.edu/ftp/global/pub/speech/papers/icassp2005-da-seg-class.pdfWe explore the two related tasks of dialog act (DA) segmentation and DA classification for speech from the ICSI Meeting Corpus. We employ simple lexical and prosodic knowledge sources, and compare results for human-transcribed versus automatically recognized words. Since there is little previous work on DA segmentation and classification in the meeting domain, our study provides baseline performance rates for both tasks. We introduce a range of metrics for use in evaluation, each of which measures different aspects of interest. Results show that both tasks are difficult, particularly for a fully automatic system. We find that a very simple prosodic model aids performance over lexical information alone, especially for segmentation. Both tasks, but particularly word-based segmentation, are degraded by word recognition errors. Finally, while classification results for meeting data show some similarities to previous results for telephone conversations, findings also suggest a potential difference with respect to the effect of modeling DA context. 000002Tonotopic Multi-Layered Perceptron: A Neural Network for LearningB. Y. Chen, Q. Zhu, N. Morgan2005 IEEE International Conference on Acoustics, Speech, and Signal20050303000001http://www.icsi.berkeley.edu/ftp/global/pub/speech/papers/icassp2005-chen.pdf 000003Improved Phonetic Speaker Recognition Using Lattice DecodingA. O. Hatch, B. Peskin, A. Stolcke2005 IEEE International Conference on Acoustics, Speech, and Signal20050303000001http://www.icsi.berkeley.edu/ftp/global/pub/speech/papers/icassp2005-spkr-phonelats.pdf 000004Multi-rate and variable-rate modeling of speech at phone and syllable time scalesOzgur Cetin and Mari Ostendorf2005 IEEE International Conference on Acoustics, Speech, and SignalProcessing, Philadelphia, PA20050103000001 000005Speaker Detection Without ModelsD. Gillick, S. Stafford, B. Peskin2005 IEEE International Conference on Acoustics, Speech, and Signal
Processing, Philadelphia, PA20050303000001http://www.icsi.berkeley.edu/ftp/global/pub/speech/papers/icassp2005-gillick.pdfIn order to capture sequential information and to take advantage of extendedtraining data conditions, we developed an algorithm for speaker detectionthat scores a test segment by comparing it directly to similar instances ofthat speech in the training data. This non-parametric technique, though atan early stage in its development, achieves error rates close to 1% on theNIST 2001 Extended Data task and performs extremely well in combination witha standard Gaussian Mixture Model system. We also present a new scoringmethod that significantly improves performance by capturing only positiveevidence. 000006Modeling Prosodic Feature Sequences for SpeakerRecognitionE. Shriberg, L. Ferrer, S. Kajarekar, A. Venkataraman, A. StolckeTo appear in Speech Communication20050103000001We describe a novel approach to modeling idiosyncraticprosodic behavior for automatic speaker recognition. The approachcomputes various duration, pitch, and energy features for eachestimated syllable in speech recognition output, quantizes thefeatures, forms N-grams of the quantized values, and models normalizedcounts for each feature N-gram using support vector machines(SVMs). We refer to these features as "SNERF-grams" (N-grams ofSyllable-based Nonuniform Extraction Region Features). Evaluation ofSNERF-gram performance is conducted on two-party spontaneous Englishconversational telephone data from the Fisher corpus, using oneconversation side in both training and testing. Results show thatSNERF-grams provide significant performance gains when combined with astate-of-the-art baseline system, as well as with two highlysuccessful long-range feature systems that capture word usage andlexically constrained duration patterns. Further experiments examinethe relative contributions of features by quantization resolution,N-gram length, and feature type. We find that longer N-grams arebetter than shorter ones, and that pitch features are most useful,followed by duration and energy features. The most important pitchfeatures are those capturing pitch level, whereas the most importantenergy features reflect patterns of rising and falling. For durationfeatures, nucleus duration is more important for speaker recognitionthan are durations from the onset or coda of a syllable. Overall, wefind that SVM modeling of prosodic feature sequences yields valuableinformation for automatic speaker recognition. It also offers rich newopportunities for exploring how speakers differ from each other involuntary but habitual ways. 000007Structural Metadata Research in the EARS ProgramY. Liu, E. Shriberg, A. Stolcke, B. Peskin, J. Ang, D. Hillard, M. Ostendort, M. Tomalin, P. Woodland, and M. Harper2005 IEEE International Conference on Acoustics, Speech, and Signal
Processing, Philadelphia, PA200503__000001http://www.icsi.berkeley.edu/ftp/global/pub/speech/papers/icassp2005-mde.pdfBoth human and automatic processing of speech require recognition of more than just words. In this paper we provide a brief overview of research on structural metadata extraction in the DARPA EARS rich transcription program. Tasks include detection of sentence boundaries, filler words, and disfluencies. Modeling approaches combine lexical, prosodic, and syntactic information, using various modeling techniques for knowledge source integration. The performance of these methods is evaluated by task, by data source (broadcast news versus spontaneous telephone conversations) and by whether transcriptions come from humans or from an (errorful) automatic speech recognizer. A representative sample of results shows that combining multiple knowledge sources (words, prosody, syntactic information) is helpful, that prosody is more helpful for news speech than for conversational speech, that word errors significantly impact performance, and that discriminative models generally provide benefit over maximum likelihood models. Important remaining issues, both technical and programmatic, are also discussed. 000008The ICSI/SRI/UW RT04 Structural Metadata Extraction SystemY. Liu, E. Shriberg, A. Stolcke, B. Peskin, M. Harper.RT-04 EARS Workshop20040103000001Both human and automatic processing of speech require recognizing more than just the words. We describe the ICSI-SRI-UW metadata detection system in both broadcast news and spontaneous telephone conversations, developed as part of the DARPA EARS Rich Transcription program. System tasks include sentence boundary detection, filler word detection, and detection/correction of disfluencies. To achieve best performance, we combine information from different types of textual knowledge sources (based on words, part-of-speech classes, and automatically induced classes) with information from a prosodic classifier. The prosodic classifier employs bagging and ensemble approaches to better estimate posterior probabilities. In addition to our previous HMM approach, we investigate using a maximum entropy (Maxent) and a conditional random field (CRF) approach for various tasks. Results using these techniques are presented for the 2004 NIST Rich Transcription metadata tasks. 000009Structural Event Detection for Rich Transcription of SpeechY. LiuPh.D Thesis, Purdue University20041219000001http://www.icsi.berkeley.edu/~yangl/thesis.pdfAlthough speech recognition technology has significantly improved during the past few decades, current speech recognition systems output only a stream of words without providing other useful structural information that could aid a human reader and downstream language processing modules.This thesis research focuses on the automatic detection of several helpful structural events in speech, including sentence boundaries, type of utterance, filled pauses, discourse markers, and edit disfluencies. The systems evaluated combine prosodic cues and textual information sources in a variety of ways to support automatic detection of these structural events. Experiments were conducted across corpora (conversational speech and broadcast news speech) and with different transcription quality (human transcriptions versus recognition output).The imbalanced data problem is investigated for training thedecision tree prosody model component of our system because structuralevents are much less frequent than non-events. A variety of sampling approaches and bagging are used to address this imbalance. Significant performance improvements are obtained via bagging. Some of the sampling methods are useful depending on the performance metrics used. Sentence boundary detection and disfluency detection tasks are impacted differently by sampling, bagging, and boosting, suggesting the inherent differences between the two tasks.A variety of methods for combining knowledge sources are examined:a hidden Markov model (HMM), the maximum entropy (Maxent) model, and theconditional random field (CRF). The Maxent and CRF approaches arediscriminatively trained to model the posterior probabilities and thuscorrelate with the performance measures. They also support the use of more correlated features and so enable the combination of a variety of textual information sources. The HMM and CRF both model sequence information, unlike the Maxent which explicitly models local information. A model that combines these three approaches is superior to any method alone.Interactions with other research efforts suggest that the methods developed in this thesis generalize well to other corpora (e.g., a multimodal corpus, a multiparty meeting corpus) and to similar tasks (e.g., a gestural model, dialog act segmentation and classification). 000010Computing the Similarity of Two Sequences with Nested Arc AnnotationsJ. Alber, J. Gramm, J. Guo and R. NiedermeierTheoretical Computer Science, 312(2-3):377-35820040105000003 000011Approximation and Collusion in Multicast Cost SharingA. Archer, J. Feigenbaum, A. Krishnamurthy, R. Sami and S. ShenkerEconomic Behavior, 47, pp. 367120040105000002 000012A Layered Naming Architecture for the InternetH. Balakrishnan, K. Lakshminaryanan, S. Ratnasamy, S. Shenker, I. Stoica and M. WalfishProceedings of SIGCOMMM20040105000002 000013Towards Optimally Multiplexed Applications of Universal ArraysA. Ben-Dor, T. Hartman, R.M. Karp, B. Schwikowski, R. Sharan and Z. YakhiniJournal of Computational Biology20040105000003 000014Using Heriarchical Location Names for Scalable Routing and Rendevous in Wireless Sensor NetworksF. Bian, R. Govindan and S. ShenkerSubmitted to Sensys20040105000002 000015Using TCP DSACKs and SCTP Duplicate TSNs to Detect Spurious RetransmissionsE. Blanton and M. AllmanRFC 370820040205000002 000016Text-Constrained Speaker Recognition on a Text-Independent TaskK. Boakye and B. PeskinOdyssey 2004 - The Speaker and Language Recognition Workshop, Toledo, Spain20040605000001http://www.icsi.berkeley.edu/ftp/global/pub/speech/papers/spkrodyssey04-kofi.pdf 000017Scalable Construction-Based Parsing and Semantic AnalysisJ. BryantIn Proceedings of ScaNaLU-2004, Boston, MA20040105000004 000018Tight Lower Bounds for the Asymetric k-Center ProblemJ. Chuzhoy, S. Guha, E. Halperin, S. Khanna, G. Kortsarz and S. NaorIn Proceedings of the 36th Annual ACM Symposium on Theory of Computing (STOC04)20040105000003 000019Global Synchronization in SensornetsJ. Elson, R.M. Karp, C.H. Papadimitriou and S. ShenkerIn Proceedings of LATIN, 609-62420040105000002 000020Optimally Phasing Long Genomic Regions using Local Haplotype PredictionsE. Eskin, E. Halperin and R. SharanIn Proceedings of the Second RECOMB Satellite Workshop on Computational Methods for SNPs and Haplotypes, pp. 13-1620040105000003 000021Embodied Meaning in a Neural Theory of LanguageJ. Feldman and S. NarayananBrain and Language 89, pp 385-392, Elsevier Press Incorporated2004____000004 000022Limited Slow-Start for TCP with Large Congestion WindowsS. FloydRFC 3742, Experimental20040305000002 000023The NewReno Modification to TCP's Fast Recovery AlgorithmS. Floyd, T. Henderson and A. GurtovRFC 3782, Proposed Standard20040405000002 000024IAB Concerns Regarding Congestion Control for Voice Traffic in the InternetS. Floyd and J. Kempf, EditorsRFC 3714, Informational20040305000002 000025Speech Recognition TechnologyH. Franco, F. Beaufays, N. Morgan and H. BourlardChapter in Handbook of Brain Theory and Neural Networks, 2nd edition, M. Arbib ed. MIT Press20040105000001 000026Identifying Agreement and Disagreement in Conversational Speech: Use of Bayesian Networks to Model Pragmatic DependenciesM. Galley, K. McKeown, J. Hirschberg and E. ShribergProceedings of 42nd Meeting of the ACL, July 21-26, Barcelona200407__000001http://www.icsi.berkeley.edu/ftp/global/pub/speech/papers/acl2004-agreement-bayesian.pdf 000027Reconstructing Chain Functions in Genetic NetworksI. Gat-Viks, R. Shamir, R.M. Karp and R. SharanIn Proceedings of the Ninth Pacific Symposium on Biocomputing (PSB'04), pp. 498-50920040105000003 000028Load Balancing in Dynamic Structured P2P SystemsB. Godfrey, R.M. Karp, K. Lakshminarayanan, S. Surana and I. StoicaProceedings of INFOCOMM20040105000003 000029A polynomial-time algorithm for the matching of crossing contact-map patternsJ. GrammIn Proceedings of the 4th International Workshop on Algorithms in Bioinformatics (WABI 2004), Bergen, Norway20040905000003 000030Experimental Design for Multiplexed SNP GenotypingJ. Gramm, A. Ben-Dor and R. SharanManuscript20040105000003 000031Graph-modeled data clustering: fixed-parameter algorithms for clique generationJ. Gramm, J. Guo, F. Hüffner and R. NiedermeierTheory of Computing Systems20040105000003 000032Automated generation of search tree algorithms for hard graph-modification problemsJ. Gramm, J. Guo, F. Hüffner and R. NiedermeierAlgorithmica, 39(4):321-34720040105000003 000033Parameterized Intractability of Distinguishing Substring SelectionJ. Gramm, J. Guo and R. NiedermeierTheory of Computing Systems20040105000003 000034Pattern matching for arc-annotated sequencesJ. Gramm, J. Guo and R. NiedermeierJournal of Algorithms20040105000003 000035On the Complexity of Haplotyping via Perfect PhylogenyJ. Gramm, T. Nierhoff, R. Sharan and T. TantauIn Proceedings of the Second RECOMB Satellite Workshop on Computational Methods for SNPs and Haplotypes, pp. 35-4620040105000003 000036Perfect Path Phylogeny Haplotyping with Missing Data is Fixed-Parameter TractableJ. Gramm, T. Nierhoff and T. TantauIn Proceedings of the First International Workshop on Parameterized and Exact Computation (IWPEC 2004), Bergen, Norway20040905000003 000037Haplotype Reconstruction from Genotype Data Using Imperfect PhylogenyE. Halperin and E. EskinBioinformatics20040105000003 000038Perfect Phylogeny and Haplotype AssignmentE. Halperin and R.M. KarpIn Proceedings of RECOMB20040105000003 000039The Minimum-Entropy Set Cover ProblemE. Halperin and R.M. KarpIn Proceedings of ICALP20040105000003 000040Color naming lens aging and grue: What the optics of the aging eye can teach us about color languageJ. Hardy, C. Frederick, P. Kay and J. WernerPsychological Science20040105000004 000041A 1.5-Approximation Algorithm for Sorting by Transpositions and TransversalsT. Hartman and R. SharanIn Proceedings of WABI20040905000003 000042Desperately Seeking Impostors: Data-Mining for Competitive Impostor Testing in a Text-Dependent Speaker Verification SystemM. Hebert and N. MirghaforiIn Proceedings of IEEE ICASSP, Montreal20040505000001http://www.icsi.berkeley.edu/~nikki/papers/lev_icassp04.pdf 000043Show what you know: musings on the reporting of negative results in speech recognition researchH. Hermansky and N. MorganJournal of Negative Results in Speech and Audio Sciences20040105000001http://journal.speech.cs.cmu.edu/ 000044Improving Automatic Sentence Boundary Detection with Confusion NetworksD. Hillard, M. Ostendorf, A. Stolcke, Y. Liu and E. ShribergIn Proceedings of HLT-NAACL Conference, Boston20040405000001http://www.icsi.berkeley.edu/ftp/global/pub/speech/papers/hlt2004-su-nets.pdf 000045The ICSI Meeting Project: Resources and ResearchA. Janin, J. Ang, S. Bhagat, R. Dhillon, J. Edwards, J. Macias-Guarasa, N. Morgan, B. Peskin, E. Shriberg, A. Stolcke, C. Wooters and B. WredeNIST ICASSP 2004 Meeting Recognition Workshop, Montreal20040505000001http://www.icsi.berkeley.edu/ftp/global/pub/speech/papers/nist2004-meeting-janin.pdf 000046Fast Portscan Detection Using Sequention Hypothesis TestingJ. Jung, V. Paxson, A. Berger and H. BalakrishnanProceedings of IEEE Symposium on Security and Privacy20040505000002 000047Spurring the Adoption of DHTs with OpenHashB. Karp, S. Ratnasamy, S. Rhea and S. ShenkerIn Third International Workshop on Peer-to-Peer Systems (IPTPS)20040105000002 000048Finite-Length Analysis of LT-codesR.M. Karp, M. Luby and A. ShokrollahiIn Proceedings of International Symposium on Information Theory (ISIT)20040105000003 000049Color categories are not arbitraryP. KayJournal of Cross-Cultural Research20040105000004 000050Pragmatic Aspects of Grammatical ConstructionsP. KayIn L. Horn and G. Ward, eds., Hanbook of Pragmatics, Oxford and London: Blackwell20040105000004 000051Argument Constructions and the Argument-Adjunct DistinctionP. KayH. Boas and M. Fried, eds, Construction Grammar: Back to the Roots. Amsterdam: John Benjamins20050105000004 000052PathBLAST: A tool for alignment of protein interaction networksB.P. Kelley, B. Yuan, F. Lewitter, R. Sharan, B.R. Stockwell and T. IdekerNucleic Acids Research 32, pp. W83-W8820040105000003 000053Practical and Robust Geographic Routing in Wireless NetworksY. Kim, R. Govindan, B. Karp and S. ShenkerSubmitted to Mobicom20040105000002 000054Computational Problems in Noisy SNP and Haplotype Analysis: Block Scores, Block Identification and Population StratificationG. Kimmel, R. Sharan and R. ShamirINFORMS Journal on Computing, in press20040105000003 000055Explicit Transport Error Notification (ETEN) for Error-Prone Wireless and Satellite NetworksR. Krishnan, J. Sterbenz, W. Eddy, C. Partridge and M. AllmanComputer Networks, 46(3), October 2004.200410__000002Wireless and satellite networks often have non-negligible packet corruption rates that can significantly degrade TCP performance. This is due to TCP's assumption that every packet loss is an indication of network congestion (causing TCP to reduce the transmission rate). This problem has received much attention in the literature. In this paper, we take a broad look at the problem of enhancing TCP performance under corruption losses, and include a discussion of the key issues. The main contributions of this paper are: (i) a confirmation of previous studies that show the reduction of TCP performance in the face of corruption loss, and in addition a plausible upper bound achievable with perfect knowledge of the cause of loss, (ii) a classification of the potential mitigation space, and (iii) the introduction of a promising new mitigation that employs rich cumulative information from intermediate nodes in a path to form a better congestion response. We first illustrate the performance implications of corruption-based loss for a variety of networks via simulation. In addition, we show a rough upper bound on the performance gains a TCP could get if it could perfectly determine the cause of each segment loss -- independent of any specific mechanism for TCP to learn the root cause of packet loss. Next, we provide a taxonomy of potential practical classes of mitigations that TCP end-points and intermediate network elements can cooperatively use to decrease the performance impact of corruption-based loss. Finally, we briefly consider a potential mitigation, called cumulative explicit transport error notification (CETEN), which covers a portion of the solution space previously unexplored. CETEN is shown to be a promising mitigation strategy, but a strategy with numerous formidable practical hurdles still to overcome.Wesley Eddy, Shawn Ostermann, Mark Allman. New Techniques for MakingTransport Protocols Robust to Corruption-Based Loss. ACM ComputerCommunication Review, 34(5), October 2004.Abstract: Current congestion control algorithms treat packet loss as an indication of network congestion, under the assumption that most losses are caused by router queues overflowing. In response to losses (congestion), a sender reduces its sending rate in an effort to reduce contention for shared network resources. In network paths where a non-negligible portion of loss is caused by packet corruption, performance can suffer due to needless reductions of the sending rate (in response to ``perceived congestion'' that is not really happening). This paper explores a technique, called Cumulative Explicit Transport Error Notification (CETEN), that uses information provided by the network to bring the transport's long-term average sending rate closer to that dictated by only congestion-based losses. We discuss several ways that information about the cumulative rates of packet loss due to congestion and corruption might be obtained from the network or through fairly generic transport layer instrumentation. We then explore two ways to use this information to develop a more appropriate congestion control response (CETEN). The work in this paper is done in terms of TCP. Since numerous transport protocols use TCP-like congestion control schemes, the CETEN techniques we present are applicable to other transports as well. In this paper, we present early simulation results that show CETEN to be a promising technique. In addition, this paper discusses a number of practical and thorny implementation issues associated with CETEN.Alberto Medina, Mark Allman, Sally Floyd. Measuring Interactions BetweenTransport Protocols and Middleboxes. ACM SIGCOMM/USENIX InternetMeasurement Conference, Taormina, Sicily, Italy, October 2004.Abstract: In this paper we explore the current network environment with respect to how the network's evolution ultimately impacts end-to-end protocols. The traditional end-to-end assumptions about the Internet are increasingly challenged by the introduction of intermediary network elements (middleboxes) that intentionally or unintentionally prevent or alter the behavior of end-to-end communications. This paper provides measurement results showing the impact of the current network environment on a number of traditional and proposed protocol mechanisms (e.g., Path MTU Discovery, Explicit Congestion Notification, etc.). We present results of measurements taken using an active measurement framework to study web servers. We analyze our results to gain further understanding of the differences between the behavior of the Internet in theory versus the behavior we observed through measurements. In addition, these measurements can be used to guide the definition of more realistic Internet modeling scenarios. 000056Routing as a ServiceK. Lakshminarayanan, I. Stoica and S. ShenkerPreprint 200420040105000002 000057Trickle: A Self-Regulation Algorithm for Code Propogation and Maintenance in Wireless Sensor NetworksP. Levis, N. Patel, D. Culler and S. ShenkerNSDI20040105000002 000058Comparing and Combining Generative and Posterior Probability Models: Some Advances in Sentence Boundary Detection in SpeechY. Liu, A. Stolcke, E. Shriberg and M. HarperIn Proceedings of Conference on Empirical Methods in Natural Language Processing, Barcelona200407__000001http://www.icsi.berkeley.edu/ftp/global/pub/speech/papers/emnlp2004-maxent.pdf 000059Parameterization of the Score Threshold for a Text-Dependent Adaptive Speaker Verification SystemN. Mirghafori and M. HebertIn Proceedings of IEEE ICASSP, Montreal20040505000001http://www.icsi.berkeley.edu/~nikki/papers/fcdt_icassp04.pdf 000060Scaling Understanding up to Mental SpacesE. Mok, J. Bryant and J.A. FeldmanIn Proceedings of ScaNaLU-2004, Boston, MA20040105000004 000061Stochastic direct reinforcement: Representations, recurrence and stochastic gamesJ. Moody, Y. Liu, M. Saffell and K. YounSubmitted for publication, 200420040105000003 000062Speech Recognition and the Auditory PerspectiveN. Morgan, H. Bourlard and H. HermanskyChapter in Speech Processing in the Auditory System, S. Greenberg and W. Ainsworth, eds, Springer20040105000001 000064TRAPping Conversational Speech: Extending TRAP/Tandem approaches to conversational telephone speech recognitionN. Morgan, B. Y. Chen, Q. Zhu, and A. StolckeIn Proceedings of IEEE ICASSP, Montreal20040505000001http://www.icsi.berkeley.edu/ftp/global/pub/speech/papers/icassp04-cts.pdf 000065CPRM: An expressive probabilistic framework for reasoning about event structureS. Narayanan and J. FeldmanSubmitted for publication, 20042004____000004 000066On the Reducibility of Sets Inside with Low Information ContentM. Ogihara and T. Tantauin JCSS20040105000003 000067Incomplete Directed Perfect PhylogenyI. Pe'er, T. Pupko, R. Shamir and R. SharanSIAM Journal on Computing 33(3), pp. 590-60720040105000003 000068Time delay based failure-robust direction of arrival estimationT. Pirinen and J. Yli-HietanenProceedings of IEEE SAM 2004, Sitges, Barcelona, Spain20040705000001http://www.cs.tut.fi/sgn/arg/tuomop/sam2004.pdf 000069Detection and compensation of sensor malfunction in time delay based direction of arrival estimationT. Pirinen, J. Yli-Hietanen, P. Pertilä and A. VisaIn Proceedings of IEEE ISCAS, Vancouver20040505000001http://www.cs.tut.fi/sgn/arg/tuomop/iscas04.pdf 000070A Comparison of Application-Level and Router-Assisted Hierarchical Schemes for Reliable MulticastP. Radoslavov, C. Papadopoulos, R. Govindan and D. EstrinIEEE/ACM Transactions on Networking (TON), Volume 12, Issue 3, pp. 469-48220040605000002 000071Prefix Hash Tree: An Indexing Data Structure over Distributed Hash TablesS. Ramabhadran, S. Ratnasamy, J.M. Hellerstein and S. ShenkerSubmitted to PODC 200420040105000002 000072Color naming and sunlightT. Regier and P. KayPsychological Science 15, 288-28920040105000004 000073A Discriminative Model for Identifying Spatial Cis-Regulatory ModulesE. Segal and R. SharanIn Proceedings of RECOMB 2004, pp. 141-149; also submitted to Journal of Computational Biology20040105000003 000074Optimal Fully Dynamic Algorithms for Cograph RecognitionR. Shamir and R. SharanDiscrete Applied Mathematics 136, pp. 329-34020040105000003 000075Cluster Graph Modification ProblemsR. Shamir, R. Sharan and D. TsurDiscrete Applied Mathematics20040105000003 000076CREME: Cis-Regulatory Module Explorer for the Human GenomeR. Sharan, A. Ben-Hur, G.G. Loots and I. OvcharenkoIn Nucleic Acids Research 32, pp. W253-W25620040105000003 000077Multiplexing Schemes for Generic SNP Genotyping AssaysR. Sharan, A. Ben-Dor and Z. YakhiniIn Proceedings of PSB'04, pp. 140-15120040105000003 000078Identification of Protein Complexes by Comparative Analysis of Yeast and Bacterial Protein Interaction DataR. Sharan, T. Ideker, B.P. Kelley, R. Shamir and R.M. KarpIn Proceedings of RECOMB 2004, pp. 282-28920040105000002 000079From Transient Patterns to Persistent Structures: A model of episodic memory formation via cortic-hippocampal interactionsL. ShastriBehavioral and Brain Science, in revision20040105000004 000080The ICSI Meeting Recorder Dialog Act (MRDA) CorpusE. Shriberg, R. Dhillon, S. Bhagat, J. Ang, and H. CarveyIn Proceedings of HLT-NAACL SIGDIAL Workshop, April-May 2004, Boston20040405000001http://www.icsi.berkeley.edu/ftp/global/pub/speech/papers/hlt04_sigdial_dacorpus.pdf 000081Progress in Meeting Recognition: The ICSI-SRI-UW Spring 2004 Evaluation SystemA. Stolcke, C. Wooters, N. Mirghafori, T. Pirinen, I. Bulyko, D. Gelbart, M. Graciarena, S. Otterson, B. Peskin and M. OstendorfIn NIST ICASSP 2004 Meeting Recognition Workshop, Montreal20040505000001http://www.icsi.berkeley.edu/ftp/global/pub/speech/papers/nist2004-meeting-system.pdf 000082Towards a Next Generation Inter-domain Routing ProtocolL. Subramanian, M. Caesar, M. Handley, M Mao, S. Shenker and I. StoicaPreprint 200420040105000002 000083Listen and Whisper: Security Mechanisms for BGPL. Subramanian, V. Roth, I. Stoica, S. Shenker and R. KatzNSDI20040105000002 000084Revealing Modularity and Organization in the Yeast Molecular Network by Integrated Analysis of Highly Heterogeneous Genome-Wide DataA. Tanay, R. Sharan, M. Kupiec and R. ShamirIn Proceedings of the National Academy of Sciences USA 101, pp. 2981-298620040105000003 000085Comparing Verboseness for Finite Automata and Turing MachinesT. TantauTheory of Computing Systems, Vol. 31, No. 1, pp. 95-10920040105000003 000086A Logspace Approximation Scheme for the Shortest Path Problem for Graphs with Bounded Independence NumberT. TantauIn Proceedings of STACS 2004, Lecture Notes on Computer Science, Springer-Verlag20040105000003 000087Untangling the Web from DNSM. Walfish, H. Balakrishnan and S. ShenkerNSDI20040105000002 000088Reflections on Witty: Analyzing the AttackerN. Weaver and D. Ellis;login: pp. 34-3720040605000002 000089Worms vs. Perimeters: The Case for Hard-LANsN. Weaver, D. Ellis, S. Staniford and V. PaxsonHot Interconnects 1220040805000002 000090A Worst-Case WormN. Weaver and V. PaxsonIn Proceedings of Third Annual Workshop on Economics and Information Security (WEIS04)20040505000002 000091Very Fast Containment of Scanning WormsN. Weaver, S. Staniford and V. PaxsonIn Proceedings of USENIX Security Symposium20040105000002 000092The Success of Open SourceS. WeberCambridge, MA, Harvard University Press2004____000005 000093Connectionist Mechanisms for Cognitive ControlC. Wendelken and L. ShastriSubmitted for publication, 200420040105000004 000094Avoiding Forbidden Submatrices by Row DeletionsS. Wernicke, J. Alber, J. Gramm, J. Guo and R. NiedermeierIn Proceedings of SOFSEM 04, LNCS, Vol. 2832, pp. 349-360, Springer20040105000003 000095The 2004 ICSI-SRI-UW Meeting Recognition SystemC. Wooters, N. Mirghafori, A. Stolcke, T. Pirinen, I Bulyko, D. Gelbart, M. Graciarena, S. Otterson, B. Peskin and M. OstendorfIn Proceedings of the Joint AMI/Pascal/IM2/M4 Workshop on Meeting Recognition. Also published in Lecture Notes in Computer Science, Volume 3361 / 2005.200501__000001http://www.icsi.berkeley.edu/ftp/global/pub/speech/papers/meeteval04_amispringer.pdf 000096MotifPrototyper: A Bayesian Profile Model for Motif FamiliesE.P. Xing and R.M. KarpIn Proceedings of the National Academy of Sciences20040105000003 000097Bayesian Haplotype Inference via the Dirichlet ProcessE.P. Xing, R. Sharan and M.I. JordanIn Proceedings of the Second RECOMB Satellite Workshop on Computational Methods for SNP and Haplotypes, pp. 99-112; Also In Proceedings of ICML 200420040105000003 000098From Molecule to Metaphor: A Neural Theory of LanguageJ. FeldmanUnpublished. Available at ftp://ftp.icsi.berkeley.edu/pub/ai/jfeldman/tech/2004____000004ftp://ftp.icsi.berkeley.edu/pub/ai/jfeldman/tech/ 000099On the Impact of Bursting on TCPE. Blanton and M. AllmanTo appear in Proceedings of the Passive and Active Measurement Workshop, Boston, MA, March 2005200503__000002Periodically in the transport protocol research community, the idea of introducing a burst mitigation strategy is voiced. In this paper we assess the prevalence and implications of bursts in the context of real TCP traffic in order to better inform a decision on whether TCP's congestion control algorithms need to incorporate some form of burst suppression. After analyzing traffic from three networks, we find that bursts are fairly rare and only large bursts (of hundreds of segments) cause loss in practice. 000100Towards Robust Speaker Segmentation: The ICSI-SRI Fall 2004 Diarization SystemC. Wooters, J. Fung, B. Peskin and X. AngueraRT-04F Workshop, Nov. 2004200411__000001http://www.icsi.berkeley.edu/ftp/global/pub/speech/papers/EARS-RT04f-spkr.pdf 000101The Contribution of Framenet to Practical LexicographyS. Atkins, M. Rundell and H. SatoInternational Journal of Lexicography, Volume 16.3: 333-357. 2003.2003____000004 000102Lexicographic Relevance: Selecting Information From Corpus EvidenceS. Atkins, C.J. Fillmore and C.R. JohnsonInternational Journal of Lexicography, Volume 16.3: 251-280. 2003.2003____000004 000103The Structure of the Framenet DatabaseC.F. Baker, C. J. Fillmore and B. CroninInternational Journal of Lexicography, Volume 16.3: 281-296. 2003.2003____000004 000104The FrameNet Data and SoftwareC.F. Baker and H. SatoPoster and Demonstration at Association for Computational Linguistics, Sapporo, Japan. 2003.2003____000004 000105Embodied verbal semantics: evidence from an image-verb matching taskJ. Feldman, B. Bergen and S. NarayananProc. Cognitive Science Conference. Boston, August 2003.200308__000004 000106ConnectionismJ. Feldman and L. ShastriIn Encyclopedia of Cognitive Science, Nature Publishing Group, MacMillan, London, 2003.2003____000004 000107Background to FramenetC.J. Fillmore, C.R. Johnson and M.R.L. PetruckInternational Journal of Lexicography, Vol 16.3: 235-250. 2003.2003____000004 000108FrameNet GlossaryC.J. Fillmore and M.R.L. PetruckInternational Journal of Lexicography, Vol. 16.3: 359-3612003____000004 000109FrameNet in Action: The Case of AttachingC.J. Fillmore, M.R.L. Petruck, J. Ruppenhofer and A. WrightInternational Journal of Lexicography, Vol. 16.3: 297-3322003____000004 000110Semantic Extraction with Wide-Coverage Lexical ResourcesB. Mohit and S. NarayananIn Marti Hearst and Mari Ostendorf, (eds.) HLT-NAACL 2003: Short Papers. Edmonton, Alberta, Canada, pp. 64-662003____000004 000111FrameNet Meets the Semantic Web: Lexical Semantics for the WebS. Narayanan, C. Baker, C. Fillmore and M. PetruckIn Fensel, Dieter, Katia Sycara and John Mylopoulos (Eds.) The Semantic Web - ISWC 2003: 771-787. Springer-Verlag, Berlin2003____000004 000112Analysis and Simulation of Web ServicesS. Narayanan and S. McIlraithComputer Networks (accepted for publication).2003____000004 000113Putting FrameNet Data into the ISO Linguistic Annotation FrameWorkS. Narayanan, M.R.L. Petruck, C.F. Baker and C.J. FillmoreIn Proceedings of the ACL 2003 Workshop on Linguistic Annotation: Getting the Model Right, Sapporo, Japan2003____000004 000114All in a Day's WeekM.R.L. Petruck and H.C. BoasPresentation at Workshop on Frame Semantics. Prague, Czech Republic20030729000004 000115Surprise: Spanish FrameNet!C. Subirats-Rüggeberg and M.R.L. PetruckPresentation at Workshop on Frame Semantics, International Congress of Linguists. Prague, Czech Republic.20030729000004 000116FrameNet's Frames vs. Levin's Verb ClassesC.F. Baker and J. RuppenhoferIn J. Larson and M. Paster (Eds.) In Proceedings of the 28th Annual Meeting of the Berkeley Linguistics Society. 27-382002____000004 000117Bilingual FrameNet Dictionaries for Machine TranslationH.C. BoasIn M. González Rodríguez and C. Paz Suárez Araujo (eds.), Proceedings of the Third International Conference on Language Resources and Evaluation. Las Palmas, Spain. Vol. IV: 1364-13712002____000004 000118Scaling Cognitive Linguistics: Formalisms for Language UnderstandingN. Chang, J. Feldman, R. Porzel and K. SandersWorkshop on Scalable Natural Language Understanding. European Media Laboratory. Heidelberg2002____000004 000119Putting Frames in PerspectiveN. Chang, S. Narayanan and M.R.L. PetruckProceedings COLING 2002. Nineteenth International Conference on Computational Linguistics, Taipei, Taiwan2002____000004 000120From Frames to InferenceN. Chang, S. Narayanan and M.R.L. PetruckIn Proceedings of the First International Workshop on Scalable Natural Language Understanding, Heidelberg, Germany2002____000004 000121Herds of Wildebeest, Flasks of Vodka, Heaps of Trouble: An Embodied Construction Grammar Approach to English Measure PhrasesE. Dodge and A.C. WrightIn J. Larson and M. Paster (Eds.) Proceedings of the 28th Annual Meeting of the Berkeley Linguistics Society: 75-86.2002____000004 000122The FrameNet Database and Software ToolsC.J. Fillmore, C.F. Baker and H. SatoIn Proceedings of the Third International Conference on Language Resources and Evaluation (LREC). Las Palmas. 1157-1160.2002____000004 000123Seeing Arguments through Transparent StructuresC.J. Fillmore, C.F. Baker and H. SatoIn Proceedings of the Third International Conference on Language Resources and Evaluation (LREC). Las Palmas. 787-912002____000004 000124Transparency and Building Lexical Dependency GraphsC.J. Fillmore and H. SatoIn J. Larson and M. Paster (eds.) Proceedings of the 28th Annual Meeting of the Berkeley Linguistics Society. 87-992002____000004 000125Automatic Labeling of Semantic RolesD. Gildea and D. JurafskyComputational Linguistics, Vol 28.3: 245-2882002____000004 000126FrameNet Meets the Semantic Web: A DAML+OIL Frame RepresentationS. Narayanan, C.J. Fillmore, C.F. Baker and M.R.L. PetruckIn Proceedings of the The Eighteenth National Conference on Artificial Intelligence. Edmonton, Canada2002____000004 000127A new view of the medial temporal lobes and the structure of memoryC. Ranganath, L. Shastri and M. D'EspositoICSI Technical Report TR-02-001200202__000004 000128Collocational Information in the FrameNet DatabaseJ. Ruppenhofer, C.F. Baker and C.J. FillmoreIn Braasch, Anna and Claus Povlsen (eds.), Proceedings of the Tenth Euralex International Congress. Copenhagen, Denmark. Vol. I: 359-3692002____000004 000129The FrameNet Database and Software ToolsJ. Ruppenhofer, C.F. Baker and C.J. FillmoreIn Braasch, Anna and Claus Povlsen (eds.), Proceedings of the Tenth Euralex International Congress. Copenhagen, Denmark. Vol. I: 371-3752002____000004 000130A Computationally Efficient Abstraction of Long-term PotentiationL. ShastriNeurocomputing. 44-46: 33-41.2002____000004 000131Episodic memory and cortico-hippocampal interactionsL. ShastriTrends in Cognitive Sciences, 6: 162-168200204__000004 000132Combining belief and utility in a structured connectionist agent architectureC. Wendelken and L. ShastriProceedings of Cognitive Science 2002, Fairfax, VA200208__000004 000133The developmental promise of information and communications technology in IndiaJ. PalContemporary South Asia, Volume 12, Number 1, March 2003.200303__000005 000134How Much InformationJ. Pal, P. Lyman, H. Varian, P.F. Charles, N. Good, L. Jordan and K. Swearingenhttp://www.sims.berkeley.edu/research/projects/how-much-info-2003/20030125000005http://www.sims.berkeley.edu/research/projects/how-much-info-2003/ 000135The Information Society and the Welfare State: The Finnish ModelP. Himanen and M. CastellsOxford University Press, 2002.2002____000005 000136The Hacker Ethic and the Spirit of the Information AgeP. Himanen2002____000005 000137Bringing devices to the masses: a comparative study of the Brazilian Computador Popular and the Indian SimputerJ. Pal with R. Fonsecahttp://www.sims.berkeley.edu/~joyojeet/Simputer-CP.doc200312__000005http://www.sims.berkeley.edu/~joyojeet/Simputer-CP.docThe goal of empowering underserved populations to use technology has resulted in hardware solutions worldwide that have been deployed to varying levels of success. Despite the differences in the devices and their technology, their employment, and the benefits motivating the exploration of such solutions, there have been some common threads connecting such projects. The Computador Popular was a project in Brazil to bring computers to the urban poor in Brazil; the Simputer is a mobile computing device made for poor rural populations in India,. Studying these two projects generates some underlying hypotheses about top-down approaches to technology and regional development. 000138Global Synchronization in SensornetsJ. Elson, R.M. Karp, C.H. Papadimitriou and S. ShenkerProceedings of LATIN, 609-624, 2004.2004____000003 000139Identification of Protein Complexes by Comparative Analysis of Yeast and Bacterial Protein Interaction DataR. Sharan, T. Ideker, B.P. Kelley, R. Shamir and R.M. KarpIn Proceedings of the Eighth Annual International Conference on Research in Computational Molecular Biology (RECOMB'04).2004____000003 000140Coalescing Times for IID Random VariablesI. Adler, H.S. Ahn, R.M. Karp and S.M. RossRandom Structures and Algorithms2003____000003 000141A Stochastic Process on the Hypercube with Applications to Peer to Peer NetworksM. Adler, E. Halperin, R.M. Karp and V. VaziraniProceedings of the 35th Annual ACM Symposium on Theory of Computing (STOC), 575-5842003____000003 000142Discovering Local Structure in Gene Expression Data: The Order-Preserving Submatrix Problem.A. Ben-Dor, B. Chor, R.M. Karp and Z. YakhiniJournal of Computational Biology 10(3-4), 385-3982003____000003 000143Towards Optimally Multiplexed Applications of Universal DNA Tag SystemsA. Ben-Dor, T. Hartman, B. Schwikowski, R. Sharan and Z. YakhiniProceedings of the Seventh Annual International Conference on Research in Computational Molecular Biology (RECOMB'03), ACM Press, 48-562003____000003 000144The restriction Scaffold problemA. Ben-Dor, R.M. Karp, B. Schwikowski and R. ShamirJournal of Computational Biology 10(3-4), 385-3982003____000003 000145A heuristic for the stacker crane problem on trees which is almost surely exactA. Coja-Oghlan, S.O. Krumke and T. NierhoffIn Algorithms and Computation, 14th International Sympsium, ISAAC 2003, LNCS 2906, pp. 605-614, Springer2003____000003 000146Genome-wide In-silico Determination of Transcriptional Regulation Modules Controlling Cell Cycle in Human CellsR. Elkon, C. Linhart, R. Sharan, R. Shamir and Y. ShilohGenome Research, 13, pp. 773-7802003____000003 000147Large Scale Reconstruction of Haplotypes from Genotype DataE. Eskin, E. Halperin and R.M. KarpProceedings of the 7th Conference on Research in Computational Biology (RECOMB), 104-1132003____000003 000148Efficient Reconstruction of Haplotype Structure Via Perfect PhylogenyE. Eskin, E. Halperin and R.M. KarpJournal of Bioinformatics and Computational Biology (JBCB) 1(1) 1-202003____000003 000149Stock Returns: Momentum, Volatility and Interest RatesY. Fang, S. Wada and J. MoodyIn the proceedings of Computational Intelligence in Financial Engineering, IEEE Press2003____000003 000150An Improved Approximation Algorithm for Vertex Cover with Hard CapacitiesR. Gandhi, E. Halperin, S. Khuller, G. Kortsarz and A. SrinivasanProceedings of the International Colloquium on Automata, Languages and Programming (ICALP), 164-1752003____000001http://hdl.handle.net/1903/1272 000151Scoring Clustering Solutions by their Biological RelevanceI. Gat-Viks, R. Sharan and R. ShamirBioinformatics, 19,pp 2381-23892003____000003 000152Detecting Protein sequences via Metric EmbeddingsE. Halperin, J. Buhler, R.M. Karp, R. Krauthgamer and B. WestoverProceedings of the Eleventh International Conference on Intelligent Systems for Molecular Biology (ISMB), 122-1292003____000003 000153Integrality Ratio for Group Steiner Trees and Directed Steiner TreesE. Halperin, G. Kortsarz, R. Krauthgamer, A. Srinivasan and N. WangProceedings of the Twelfth Symposium on Discrete Algorithms (SODA), 275-2842003____000003 000154Polylogarithmic InapproximabilityE. Halperin and R. KrauthgamerProceedings of the 35th Annual ACM Symposium on Theory of Computing (STOC), 585-5942003____000003 000155A Gambling Game and its Application to the Analysis of Adaptive Randomized RoundingR.M. Karp and C. KenyonSpringer-Verlag Lecture Notes in Computer Science, Volume 2764 (RANDOM 03), 329-3402003____000003 000156Load Balancing in Structured P2P SystemsR.M. Karp, A. Rao, K. Lakshminarayanan, S. Surana and I. StoicaProceedings of the Second International Workshop on Peer-to-Peer Systems2003____000003 000157A Simple Algorithm for Finding Frequent Elements in Streams and BagsR.M. Karp, S. Shenker and C.H. PapadimitriouTransactions on Database Systems2003____000003 000158Conserved pathways within Bacteria and Yeast as revealed by Global Protein Network AlignmentB.P. Kelley, R. Sharan, R.M. Karp, E.T. Sittler, D.E. Root, B.R. Stockwell and T. IdekerProceedings of the National Academy of Sciences USA, 100, pp. 11394-113992003____000003 000159Idetifying Blocks and Sub-Populations in Noisy SNP DataG. Kimmel, R. Sharan and R. ShamirProceedings of the Third Workshop on Algorithms in Bioinformatics (WABI'03), pp. 303-3192003____000003 000160Regularizers and Priors for Feed-Forward NetworksJ. Moody and T. RognvaldssonSubmitted for publication 20032003____000003 000161Detecting Excess Radical Replacements in Phylogenetic TreesT. Pupko, R. Sharan, M. Hasegawa, R. Shamir and D. GraurGene, 319, pp. 127-1352003____000003 000162Load Balancing in Structured P2P SystemsA. Rao, K. Lakshminarayanan. S. Surana, R.M. Karp and I. StoicaIn 2nd International Workshop on Peer-to-Peer Systems2003____000003 000163CLICK and EXPANDER: A System for Clustering and Visualizing Gene Expression DataR. Sharan, A. Maron-Katz and R. ShamirBioinformatics, 19, pp. 1787-17992003____000003 000164CRÈME: A Framework for Identifying Cis-Regulatory Modules in Human-Mouse Conserved SegmentsR. Sharan, I. Ovcharenko, A. Ben-Hur and R.M. KarpProceedings of the Eleventh International Conference on Intelligent Systems for Molecular Biology (ISMB'03), Bioinformatics 19, Supplement 1, pp. 1283-12912003____000003 000165LOGOS: A Modular Bayesian Model for de novo Motif DetectionE.P. Xing, W. Wu, M. Jordan and R.M. KarpProceedings of IEEE Computer Society Bioinformatic Conference IPTPS2003____000003 000166The SFRA: A Corner-Turn FPGA ArchitectureN. Weaver, J. Hauser and J. Wawrzynekto appear at the 12th ACM International Symposium on Field Programmable Gate Arrays (FPGA), February 2004.20040227000002Paper: http://www.cs.berkeley.edu/~nweaver/sfra_fpga_2004.pdf. 000167Achieving Fairness through Selective Early DroppingJ. Scott and R.M. Karppreprint 2004.2004____000002 000168Finite-Length Analysis of LT-codesR.M. Karp, M. Luby and A. ShokrollahiSubmitted to International Symposium on Information Theory (ISIT '04).2004____000002 000169Load Balancing in Dynamic Structured P2P SystemsB. Godfrey, R.M. Karp, K. Lakshminarayanan, S. Surana and I. StoicaProceedings of INFOCOMM, 20042004____000002 000170New Techniques for Making Transport Protocols Robust to Corruption-Based LossW. Eddy, S. Ostermann and M. AllmanJanuary 2004. Accepted for publication.2004____000002 000171Search Strategies in Inter-Domain Traffic EngineeringK. Chandrayana, R.M. Karp, M. Roughan, S. Sen and Y. Zhangpreprint 20042004____000002 000173A Stochastic Process on the Hypercube with Applications to Peer-to-peer NetworksM. Adler, E. Halperin, R.M. Karp and V VaziraniProceedings of the Thierty-fifth Annual ACM Symposium on Theory of Computing (STOC 2003)2003____000002 000174On the Performance of MiddleboxesM. AllmanACM SIGCOMM/Usenix Internet Measurement Conference, Miami, FL, USA200310__000002 000175An Evaluation of XML-RPCM. AllmanACM Performance Evaluation Review, 30(4)200303__000002 000176TCP Congestion Control with Appropriate Byte Counting (ABC)M. AllmanRFC 3465200302__000002 000177Estimating Loss Rates With TCPM. Allman, W. Eddy and S. OstermannACM Performance Evaluation Review, 31(3)200312__000002 000178A Conservative Selective Acknowledgment (SACK)-based Loss Recovery Algorithm for TCPE. Blanton, M. Allman, K. Fall and L. WangRFC 3517200304__000002 000179Making Gnutella-like P2P Systems ScalableY. Chawathe, S. Ratnasamy, L. Breslau and S. ShenkerIn Proceedings of ACM SIGCOMM2003____000002 000180A Comparison of RED's Byte and Packet ModesW. Eddy and M. AllmanComputer Networks, 42(2)200306__000002 000181Mechanism Design for Policy RoutingJ. Feigenbaum, R. Sami and S. ShenkerYale University Technical Report YALEU/DCS/TR-1258200311__000002 000182HighSpeed TCP for Large Congestion WindowsS. FloydRFC 3649, Experimental200312__000002 000183TCP Friendly Rate Control (TFRC): Protocol SpecificationS. FloydRFC 3446200301__000002 000184Pktd: A Packet Capture and Injection DaemonJ. Gonzalez and V. PaxsonProceedigns of Passive & Active Measurement: PAM-20032003____000002 000185DIFS: A Distributed Index for Features in Sensor NetworksB. Greenstein, D. Estrin, R. Govindan, S. Ratnasamy and S. ShenkerIn Elsevier Journal of Ad Hoc Networks, 2003.(Also in the First IEEE Workshop on Sensor Networks Protocols and Applications, SNPA, 2003)2003____000002 000186The Impact of DHT Routing Geometry on Resilience and ProximityK. Gummadi, R. Gummadi, S. Gribble, S. Ratnasamy, S. Shenker and I. StoicaIn Proceedings of ACM SIGCOMM 2003, Karlruhe, Germany200308__000002 000187Modeling Wireless Links for Transport ProtocolsA. Gurtov, and S. Floydto appear in ACM CCR200311__000002 000188TCP Friendly Rate Control (TFRC): Protocol SpecificationM. Handley, S. Floyd, J. Pahdye and J. WidmerRFC 3448, Proposed Standard200301__000002 000189A Framework for Incremental Deployment Strategies for Router-Assisted ServicesX. He, C. Papadopoulos and P. RadoslavovProceedings of the IEEE Infocom 2003.San Francisco, CA, USA200304__000002 000190Querying the Internet with PIERR. Huebsch, J.M. Hellerstein, N. Lanham, B.T. Loo, S. Shenker and I. StoicaIn Proceedings of VLDB, 2003, Berlin, Germany200309__000002 000191A Simple Algorithm for Finding Frequent Elements in Streams and BagsR.M. Karp, S. Shenker and C.H. PapadimitriouTransactions on Database Systems 28: 51-552003____000002 000192Patterns of Congestion CollapseT. Kelly, S. Floyd and S. ShenkerUnder submission200306__000002 000193Designing DCCP: Congestion Control Without ReliabilityE. Kohler, M. Handley and S. Floyd?200305__000002 000194On the Convergence of Statistical Techniques for Estimating Network Traffic DemandsA. Medina, K. Salamatian, N. Taft, I. Matta, Y. Tsang and C. DiotSubmitted for publication 2003200302__000002 000195Inside the Slammer WormD. Moore, V. Paxson, S. Savage, C. Shannon, S. Staniford and N. WeaverIEEE Magazine of Security and Privacy200308__000002 000196The Spread of the Sapphire/Slammer WormD. Moore, V. Paxson, S. Savage, C. Shannon, S. Staniford and N. WeaverTechnical Report200302__000002 000197A High-level Programming Environment for Packet Trace Anonymization and TransformationR. Pang and V. PaxsonProceedings of ACM SIGCOMM 2003200308__000002 000198On Selfish Routing in Internet-Like EnvironmentsL. Qiu, Y.R. Yang, Y. Zhang and S. ShenkerIn Proceedings of ACM SIGCOMM 2003, Karlsruhe, Germany200308__000002 000199Load Balancing in Structured P2P SystemsA. Rao, K. Lakshminaryanan, S. Surana, R.M. Karp and I. StoicaProceedings of the Second International Workshop on Peer-to-Peer Systems2003____000002 000200Geographic Routing without Location InformationA. Rao, S. Ratnasamy, C. Papadimitriou, S. Shenker and I. StoicaIn Proceedings of ACM MOBICOMM 2003, San Diego, CA200309__000002 000201Range Queries over DHTsS. Ratnasamy, J.M. Hellerstein and S. ShenkerIntel Research Technical Report, IRB-TR-03-01120030625000002 000202Data-Centric Storage in Sensornets with GHT, a Geographic Hash TableS. Ratnasamy, B. Karp, S. Shenker, D. Estrin, R. Gvoindan, L. Yin and F. YuACM MONET2003____000002 000203Active Mapping: Resisting NIDS Evasion Without Altering TrafficU. Shankar and V. PaxsonProceedings of IEEE Symposium on Security and Privacy200305__000002 000204Enhancing Byte-Level Network Intrusion Detection Signatures with ContextR. Sommer and V. PaxsonProceedings of ACM CCS2003____000002 000205Core-Stateless Fair Queueing: a Scalable Architecture to Approximate Fair Bandwidth Allocations in High-speed NetworksI. Stoica, S. Shenker and H. ZhangIEEE/ACM Transactions on Networking 11(1): 33-46. (A previous version appeared in Proceedings of SIGCOMM'98, pp. 118-130, Vancouver, Canada, August 1998.)2003____000002 000206Post Placement C-slow Retiming for the Xilinx Virtex FPGAN. Weaver, Y. Markovskiy, Y. Patel and J. Wawrzynek11th ACM Symposium of Field Programmable Gate Arrays (FPGA)2003____000002http://www.cs.berkeley.edu/~nweaver/papers/Cslow_Retiming_Virtex.pdf 000207A Taxonomy of Computer WormsN. Weaver, V. Paxson, S. Staniford and R. CunninghamProceedings of the ACM CCS First Workshop on Rapid Malcode (WORM)200310__000002http://www.cs.berkeley.edu/~nweaver/papers/taxonomy.pdf 000208Large Scale Malicious Code: A Research AgendaN. Weaver, V. Paxson, S. Staniford and R. CunninghamDARPA-sponsored report2003____000002 000209RR-TCP: A Reordering-Robust TCP with DSACKM. Zhang, B. Karp, S. Floyd and L. PetersonICSI Technical Report TR-02-006, Berkeley, CA, July 2002. Also in 11th IEEE International Conference on Network Protocols (ICNP'03)200311__000002 000210Host Mobility Using and Internet Indirection InfrastructureS. Zhuang, K. Lai, I. Stoica, R. Katz and S. ShenkerIn Proceedings of MobiSys '03, pp. 129-144, San Francisco, CA200305__000002 000211A Robust Speaker Clustering AlgorithmJ. Ajmera and C. WootersProceedings of IEEE Speech Recognition and Understanding Workshop, St. Thomas, U.S. Virgin Islands200312__000001http://www.icsi.berkeley.edu/ftp/global/pub/speech/papers/asru03-ajmewoot.pdf 000212Automatically Generated Prosodic Cues to Lexically Ambiguous Dialog Acts in Multiparty MeetingsS. Bhagat, H. Carvey and E. ShribergProceedings of ICPhS 2003, Barcelona200308__000001http://www.icsi.berkeley.edu/ftp/global/pub/speech/papers/icphs03-bhagat.pdf 000213Learning Discriminative Temporal Patterns in Speech: Development of Novel TRAPS-Like ClassifiersB. Chen, S. Chang and S. SivadasProceedings of EUROSPEECH 2003, Geneva200309__000001http://www.icsi.berkeley.edu/ftp/global/pub/speech/papers/byc_eur03.pdf 000214Far-field ASR on Inexpensive MicrophonesL. Docio-Fernandez, D. Gelbart and N. MorganProceedings of EUROSPEECH 2003, Geneva200309__000001http://www.icsi.berkeley.edu/ftp/global/pub/speech/papers/ldocio_eur03.pdf 000215Automatic Speech RecognitionH. Hermansky and N. MorganIn Encyclopedia of Cognitive Science, Nature Publishing Group, London2003____000001 000216Data-Driven Speaker and Subword Unit Clustering in Speech ProcessingM. HerschEPFL Diploma Thesis, ICSI200303__000001http://www.icsi.berkeley.edu/ftp/global/pub/speech/papers/thesis03-mhersch.pdf 000217Detection Of Agreement vs. Disagreement In Meetings: Training With Unlabeled DataD. Hillard, M. Ostendorf and E. ShribergProceedings of HLT-NAACL Conference, Edmonton, Canada200305__000001http://www.icsi.berkeley.edu/ftp/global/pub/speech/papers/hlt03-agree-disagree.pdf 000218The ICSI Meeting CorpusA. Janin, D. Baron, J. Edwards, D. Ellis, D. Gelbart, N. Morgan, B. Peskin, T. Pfau, E. Shriberg, A. Stolcke and C. WootersProceedings of ICASSP-2003, Hong Kong200304__000001http://www.icsi.berkeley.edu/ftp/global/pub/speech/papers/icassp03-janin.pdf 000219Word Fragments Identification Using Acoustic-Prosodic Features in Conversational SpeechY. LiuProceedings of HLT/NAACL, Student Session, Edmonton, Alberta2003____000001 000220Automatic disfluency identification in conversational speech using multiple knowledge sourcesY. Liu, E. Shriberg and A. StolckeProceedings of EUROSPEECH 2003, Geneva200309__000001http://www.icsi.berkeley.edu/ftp/global/pub/speech/papers/eurospeech2003-dfdetect.pdf 000221Meetings about meetings: research at ICSI on speech in multiparty conversationsN. Morgan, D. Baron, S. Bhagat, H. Carvey, R. Dhillon, J. Edwards, D. Gelbart, A. Janin, A. Krupski, B. Peskin, T. Pfau, E. Shriberg, A. Stolcke and C. WootersProceedings of ICASSP-2003, Hong Kong200304__000001http://www.icsi.berkeley.edu/ftp/global/pub/speech/papers/icassp03meetings.pdf 000222Scaling Up: Learning Large-scale Recognition Methods from Small-scale Recognition TasksN. Morgan, B. Chen, Q. Zhu and A. StolckeICSI Technical Report tr-03-02. Also Special Workshop in Maui(SWIM) paper 218.2004____000001http://www.icsi.berkeley.edu/ftp/global/pub/speech/papers/swim04-scale.pdf 000223Using prosodic and conversational features for high-performance speaker recognition: Report from JHU WS'02.B. Peskin, J. Navratil, J. Abramson, D. Jones, D. Klusacek, D. Reynolds and B. XiangProceedings of ICASSP-2003, Hong Kong200304__000001http://www.icsi.berkeley.edu/ftp/global/pub/speech/papers/icassp03-peskin2.pdf 000224Audio Information Access from Meeting RoomsS. Renals and D. EllisIn Proceedings of ICASSP-2003. Hong Kong200304__000001 000225The SuperSID Project: Exploiting high-level information for high-accuracy speaker recognitionD. Reynolds, W. Andrews, J. Campbell, J. Navratil, B. Peskin, A. Adami, Q. Jin, D. Klusacek, J. Abramson, R. Mihaescu, J. Godfrey, D. Jones and B. XiangProceedings of ICASSP-2003, Hong Kong200304__000001http://www.icsi.berkeley.edu/ftp/global/pub/speech/papers/icassp03-peskin1.pdf 000226Experiments With Linear And Nonlinear Feature Transformations In HMM Based Phone RecognitionP. SomervuoProceedings of ICASSP-2003, Hong Kong200304__000001http://www.icsi.berkeley.edu/ftp/global/pub/speech/papers/icassp03-somervuo.pdf 000227Feature Transformations and Combinations for Improving ASR PerformanceP. Somervuo, B. Chen and Q. ZhuProceedings of EUROSPEECH 2003, Geneva200309__000001http://www.icsi.berkeley.edu/ftp/global/pub/speech/papers/panus_eur03.pdf 000228The Relationship Between Dialogue Acts and Hot Spots in MeetingsB. Wrede and E. ShribergProceedings of IEEE Speech Recognition and Understanding Workshop, St. Thomas, U.S. Virgin Islands200312__000001http://www.icsi.berkeley.edu/ftp/global/pub/speech/papers/asru03-hotspotsDAs.pdf 000229SpottingB. Wrede and E. ShribergProceedings of EUROSPEECH 2003, Geneva200309__000001http://www.icsi.berkeley.edu/ftp/global/pub/speech/papers/bwrede_eur03.pdf 000230Multimodal Model Integration for Sentence Unit DetectionL. Chen, Y. Liu, M. Harper and E. ShribergTo appear in 6th International Conference on Multimodal Interfaces, October 20042004____000001 000231Meeting Recorder Project: Dialog Act Labeling GuideR. Dhillon, S. Bhagat, H. Carvey and E. ShribergICSI Technical Report TR-04-0022004____000001http://www.icsi.berkeley.edu/ftp/global/pub/speech/papers/MRDA-manual.pdf 000232Using Machine Learning to Cope with Imbalanced Classes in Natural Speech: Evidence from Sentence Boundary and Disfluency DetectionY. Liu, E. Shriberg, A. Stolcke and M. HarperTo appear in Proceedings of International Conference on Spoken Language Processing, Jeju, Korea, October 2004.2004____000001http://www.icsi.berkeley.edu/ftp/global/pub/speech/papers/icslp2004-mde-ml.pdf 000233From Switchboard to Meetings: Development of the 2004 ICSI-SRI-UW Meeting Recognition SystemN. Mirghafori, A. Stolcke, C. Wooter, T. Pirinen, I. Bulyko, D. Gelbart, M. Graciarena, S. Otterson, B. Peskin and M. OstendorfTo appear in Proceedings of International Conference on Spoken Language Processing, Jeju, Korea, October 2004.200410__000001http://www.icsi.berkeley.edu/ftp/global/pub/speech/papers/icslp2004-meeting-system.pdf 000234Selfish Behavior and Stability of the Internet: A Gamer-Theoretic Analysis of TCPA. Akella, R.M. Karp, C. Papadimitriou, S. Seshan and S. ShenkerIn Proceedings ACM SIGCOMM 20022002____000002 000235A Scalable System for Sharing Internet MeasurementsM. Allman, E. Blanton and W. EddyProceedings of the Passive and Active Measurement Workshop, March 2002200203__000002 000236Increasing TCP's Initial WindowM. Allman, S. Floyd and C. PartridgeRFC 3390, October 2002. Proposed Standard. Obsoletes RFC 2414 (Experimental).200210__000002 000237Design Guidelines for Robust Internet ProtocolsT. Anderson, S. Shenker, I. Stoica and D. WetherallIn First Workshop on Hot Topics in Networks (HotNets-I), October 2002.200210__000002 000238Secure mobile agent systems using Java - where are we heading?W. Binder and V. RothIn Proceedings, 17th ACM Symposium on Applied Computing, Special Track on Agents, Interactions, Mobility, and Systems (SAC/AIMS), Madrid, Spain, March 2002.200203__000002 000239On Making TCP More Robust to Packet ReorderingE. Blanton and M. AllmanACM Computer Communication Review, 32(1), January 2002.200201__000002 000240From Protocol Stack to Protocol Heap - Role-Based ArchitectureB. Braden, T. Faber and M. HandleyFirst Workshop on Hot Topics in Networking, October 2002.200210__000002 000241Towards capturing Representative AS-level Internet TopologiesH. Chang, R. Govindan, S. Jamin, S. Shenker and W. WillingerSIGMETRICS 2002: 280-281.2002____000002 000242The Origin of Power-Laws in Internet Topologies RevisitedQ. Chen, H. Chang, R. Govindan, S. Jamin, S. Shenker and W. WillingerINFOCOM, 2002.2002____000002 000243Replication Strategies in Unstructured Peer-to-Peer NetworksE. Cohen and S. ShenkerIn Proceedings ACM SIGCOMM 2002.2002____000002 000244Multiscale Stepping-Stone Detection: Detecting Pairs of Jittered Interactive Streams by Exploiting Maximum Tolerable DelayD. Donoho, A. G. Flesia, U. Shankar, V. Paxson, J. Coit and S. StanifordProceedings of RAID, 2002.2002____000002 000245A Comparison of RED's Byte and Packet ModesW. Eddy and M. AllmanACM SIGCOMM Student Poster Session, August 2002.200208__000002 000246Hardness Results for Multicast Cost SharingJ. Feigenbaum, A. Krishnamurthy, R. Sami and S. ShenkerFoundations of Software Technology and Theoretical Computer Science: 133-144, 2002.2002____000002 000247A BGP-based Mechanism for Lowest-cost RoutingJ. Feigenbaum, C.H. Papadimitriou, R. Sami and S. ShenkerPrinciples of Distributed Computing 2002: 173-182.2002____000002 000248General Architectural and Policy ConsiderationsS. Floyd, editorRFC 3426, November 2002.200211__000002 000249Inappropriate TCP Resets Considered HarmfulS. FloydRFC 3360, August 2002.200208__000002 000250IAB Architectural and Policy Considerations for Open Pluggable Edge ServicesS. Floyd and L. Daigle, editorsRFC 3238, Informational, January 2002. Also an excerpt, the ISOC Member Briefing #5 on OPES.200201__000002 000251Internet Research Needs Better ModelsS. Floyd and E. Kohler.Hotnets-I, October 2002.200210__000002 000252Estimating Router ICMP Generation DelaysR. Govindan and V. PaxsonProceedings of Passive & Active Measurement: PAM-2002.2002____000002 000254An Analysis of The Internal Structure of Large Autonomous SystemsR. Govindan and P. RadoslavovTechnical Report 02-777, Computer Science Department, University of Southern California, November 2002.200211__000002 000255XORP: An Open Platform for Network ResearchM. Handley, O. Hodson and E. KohlerFirst Workshop on Hot Topics in Networking, October 2002.200210__000002 000256Complex Queries in DHT-based Peer-to-Peer NetworksM. Harren, J.M. Hellerstein, R. Huebsch, B.T. Loo, S. Shenker and I. StoicaThe 1st International Workshop on Peer-to-Peer Systems (IPTPS): 242-259, 2002.2002____000002 000257Internet Congestion Control for High Bandwidth-Delay Product EnvironmentsD. Katabi, M. Handley and C. RohrsProceedings of ACM SIGCOMM 2002.2002____000002 000258Observed Structure of Addresses in IP TrafficE. Kohler, J. Li, V. Paxson and S. ShenkerProceedings of ACM SIGCOMM Internet Measurement Workshop, November 2002. Also in Proceedings of the 2nd Internet Measurement Workshop (IMW) 2002.200211__000002 000259Programming language optimizations for modular router configurationsE. Kohler, R. Morris and B. ChenProceedings of the 10th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS-X), San Jose, California, October 2002, pages 251-263.200210__000002 000260Modular components for network address translationE. Kohler, R. Morris and M. PolettoProceedings of the 5th International Conference on Open Architectures and Network Programming (OPENARCH '02), New York, New York, June 2002, pages 39-50.Proceedings of the 5th International Conference on Open Architectures and Network Programming (OPENARCH '02), New York, New York, June 2002, pages 39-50.200206__000002 000261Explicit Transport Error Notification (ETEN) for Error-Prone Wireless and Satellite Networks - SummaryR. Krishnan, M. Allman, C. Partridge, J.P.G. Sterbenz and W. IvancicEarth Science Technology Conference, Pasadena, CA, June 2002.200206__000002 000262Explicit Transport Error Notification (ETEN) for Error-Prone Wireless and Satellite NetworksR. Krishnan, M. Allman, C. Partridge and J.P.G. SterbenzTechnical Report No. 8333, BBN Technologies, March 2002.200203__000002 000263Search and Replication in Unstructured Peer-to-peer NetworksQ. Lv, P. Cao, E. Cohen, K. Li and S. ShenkerSIGMETRICS 2002: 258-259.2002____000002 000264Can Heterogeneity Make Gnutella Scalable?Q. Lv, S. Ratnasamy and S. ShenkerThe 1st International Workshop on Peer-to-Peer Systems (IPTPS): 94-103, 2002.2002____000002 000265Controlling High Bandwidth Aggregates in the NetworkR. Mahajan, S. Bellovin, S. Floyd, J. Ioannidis, V. Paxson and S. ShenkerComputer Communication Review 32(3), July 2002.200207__000002 000266Route Flap Dampening Exacerbates Internet Routing ConvergenceZ.M. Mao, R. Govindan, G. Varghese and R. KatzIn Proceedings of the ACM SIGCOMM 2002.2002____000002 000267A Taxonomy of Traffic MatricesA. Medina, C. Fraleigh, N. Taft, S. Battacharyya and C. DiotSPIE Workshop on Scalability and Traffic Control in IP Networks II. Boston, MA. July 29-August 1, 2002.200207__000002 000268Traffic Matrix Estimation Techniques: Existing Techniques Compared and New DirectionsA. Medina, N. Taft, K.Salamatian, S. Bhattacharyyam and C. DiotTo appear on proceedings of SIGCOMM 2002, Pittsburgh, PA. August 2002.200208__000002 000269Comparative Analysis of Traffic Matrix Estimation MethodsA. Medina, N. Taft, K. Salamatian, S. Bhattacharyya and C. DiotDIMACS Workshop on Internet and WWW Measurement, Mapping and Modeling. February 13-15 2002.20020213000002 000270A Flow Table-Based Design to Approximate FairnessR. Pan, L. Breslau, B. Prabhakar and S. ShenkerIn Hot Interconnects: 10th Symposium on High Performance Interconnects (Hot-I) 2002.2002____000002 000271The Internet Under Crisis Conditions: Learning from September 11C. Partridge, P. Barford, D. Clark, S. Donelan, V. Paxson, J. Rexford, M. Vernon, J. Eisenberg, M. Blumenthal, D. Padgham, K. Batch, D. Drake and J. BriscoeComputer Science and Telecommunications Board, National Research Council, National Academy Press, Washington, DC, 2002.2002____000002 000272A Swifter Start for TCPC. Partridge, D. Rockwell, M. Allman, R. Krishnan and J.P.G. SterbenzTechnical Report No. 8339, BBN Technologies, March 2002200203__000002 000273Mobile Agent Interoperability Patterns and PracticeU. Pinsdorf and V. RothIn Proceedings 9th Annual IEEE International Conference and Workshops on the Engineering of Computer-Based Systems, Lund, Sweden, April 2002.200204__000002 000274Topologically-Aware Overlay Construction and Server SelectionS. Ratnasamy, M. Handley, R.M. Karp and S. ShenkerINFOCOM, 2002.2002____000002 000275GHT: A Geographic Hash-table for Data-centric Storage in SensornetsS. Ratnasamy, B. Karp, L. Yin, F. Yu, D. Estrin, R. Govindan and S. ShenkerIn First ACM International Workshop on Wireless Sensor Networks and Applications (WSNA), September 2002.200209__000002 000276Routing Algorithms for DHTs: Some Open QuestionsS. Ratnasamy, I. Stoica and S. ShenkerThe 1st International Workshop on Peer-to-Peer Systems (IPTPS): 45-52, 2002.2002____000002 000277Java Security Architecture and ExtensionsV. RothDr. Dobbs Journal, 2002(335), April 2002.200204__000002 000278Empowering mobile software agentsV. RothIn N. Suri, editor, Proceedings 6th IEEE Mobile Agents Conference, volume 2535 of Lecture Notes in Computer Science, pages 47-63. Springer Verlag, October 2002.200210__000002 000279Improved key management for digital watermark monitoringV. Roth and M. ArnoldProceedings of SPIE, San Jose, CA, January 2002.200201__000002 000280Data-centric Storage in SensornetsS. Shenker, S. Ratnasamy, B. Karp, R. Govindan and D. EstrinIn Workshop Record of the First Workshop on Hot Topics in Networks (HotNets-I), October 2002.200210__000002 000281How to Own the Internet in Your Spare TimeS. Staniford, V. Paxson and N. WeaverIn Proceedings of the 11th USENIX Security Symposium, August 2002.200208__000002Paper: http://www.cs.Berkeley.edu/~nweaver/cdc.web/ 000282Internet Indirection InfrastructureI. Stoica, D. Adkins, S. Ratnasamy, S. Shenker, S. Surana and S. ZhuangThe 1st International Workshop on Peer-to-Peer Systems (IPTPS): 191-202, 2002.2002____000002 000283Internet Indirection InfrastructureI. Stoica, D. Adkins, S. Zhuang, S. Shenker and S. SuranaIn Proceedings, ACM SIGCOMM, August 2002200208__000002 000284Routing Algorithms in DHTs: Some Open QuestionsI. Stoica, D. Adkins, S. Ratnasamy, S. Surana, S. Shenker and S. ZhuangIn Proceedings of the First International Workshop on Peer-to-Peer Systems (IPTPS) 2002, March 2002.200203__000002 000285Self-Verifying CSFQI. Stoica, H. Zhang and S. ShenkerINFOCOM, 2002.2002____000002 000286Network Topology Generators: Degree-Based vs. StructuralH. Tangmunarunkit, R. Govindan, S. Jamin, S. Shenker and W. WillingerIn Proceedings ACM SIGCOMM 2002.2002____000002 000287Scaling phenomena in the Internet: Critically examining criticalityW. Willinger, R. Govindan, S. Jamin, V. Paxson and S. ShenkerProceedings of Natl. Acad. Sci. USA, Vol. 99, Suppl. 1, 2573-2580, February 19, 2002.20020219000002 000288Long-range dependence and data network trafficW. Willinger, V. Paxson, R.H. Riedi and M.S. TaqquLong-range Dependence: Theory and Applications, P. Doukhan, G. Oppenheim and M. S. Taqqu, eds., Birkhauser, 2002.2002____000002 000289RR-TCP: A Reordering-Bobust TCP with DSACKM. Zhang, B. Karp, S. Floyd and L. PetersonICSI Technical Report TR-02-006, Berkeley, CA, July 2002.200207__000002 000290On the Characteristics and Origins of Internet Flow RatesY. Zhang, L. Breslau, V. Paxson and S. ShenkerProceedings of ACM SIGCOMM, August 2002.2002____000002 000291Measuring End-to-End Bulk Transfer CapacityM. AllmanACM SIGCOMM Internet Measurement Workshop. November 2001.200111__000002 000292Enhancing TCP's Loss Recovery Using Limited TransmitM. Allman, H. Balakrishnan and S. FloydRFC 3042, Proposed Standard, January 2001200101__000002 000293On Estimating End-to-End Network Path PropertiesM. Allman and V. PaxsonACM SIGCOMM Workshop on Data Communication in Latin American and the Caribbean, April 2001. This paper was originally presented at ACM SIGCOMM 1999.200104__000002 000294Dynamic Behavior of Slowly-Responsive Congestion Control AlgorithmsD. Bansal, H. Balakrishnan, S. Floyd and S. ShenkerSIGCOMM 20012001____000002 000295Dissemination of mutable sets of web objectsS. Buchholz, S. Gobel, A. Schill and T. Ziegert13th IASTED International Conference on Parallel and Distributed Computing Systems, 2001.2001____000002 000296Inferring link loss using striped unicast probesN. Duffield, F. Lo Presti, V. Paxson, and D. TowsleyProceedings of IEEE INFOCOM, April 2001.200104__000002 000297Approximation and collusion in multicast cost sharingJ. Feigenbaum, A. Krishnamurthy, R. Sami and S. ShenkerAbstratc in Proceedings of 3rd Conference on Electronic Commerce, 2001.2001____000002 000298Sharing the cost of multicast transmissionsJ. Feigenbaum, C. Papadimitriou and S. ShenkerJournal of Computer and System Sciences 63: 21-41, 2001.2001____000002 000299A Report on Some Recent Developments in TCP Congestion ControlS. FloydIEEE Communications Magazine, April 2001.200104__000002 000300Simulation is CrucialS. FloydSidebar, IEEE Spectrum, January 2001.200101__000002 000301Adaptive RED: An Algorithm for Increasing the Robustness of RED's Active Queue ManagementS. Floyd, R. Gummadi and S. ShenkerAugust 1, 2001. Simulation scripts. We have not yet finished the longer technical report.200108__000002 000302Difficulties in Simulating the InternetS. Floyd and V. PaxsonIEEE/ACM Transactions on Networking, Vol.9, No.4, pp. 392-403, August 2001. (An earlier version appeared in Proceedings of the 1997 Winter Simulation Conference, December 1997).200108__000002 000303IPNL: A NAT-extended Internet architectureP. Francis and R. GummadiSIGCOMM 20012001____000002 000304Network Intrusion Detection: Evasion, Traffic Normalization, and End-to-End Protocol Semantics (HTML)M. Handley, C. Kreibich and V. PaxsonProceedings of USENIX Security Symposium 2001.2001____000002 000305A Comparison of Incremental Deployment Strategies for Router-Assisted Reliable MulticastX. He, C. Papadopoulos, P. Radoslavov and R. GovindanTechnical Report 01-751, Computer Science Department, University of Southern California, July 2001.200107__000002 000306Building efficient wireless sensor networks with low-level namingJ. Heidemann, F. Silva, C. Intanagonwiwat, R. Govindan, D. Estrin and D. GanesanProceedings of the Symposium on Operating Systems Principles, Banff, Canada, October 2001.200110__000002 000307J. Ishac and M. AllmanOn the Performance of TCP Spoofing in Satellite NetworksIEEE Milcom. October 2001.200110__000002 000308Experimentation and Modeling of HTTP Over Satellite ChannelsH. Kruse, M. Allman, J. Griner and D. TranInternational Journal of Satellite Communications, 19(1), January/February 2001.200101__000002 000309Controlling High-Bandwidth Flows at the Congested RouterR. Mahajan and S. FloydICSI Tech Report TR-01-001, April 2001. (A revised version appeared in ICNP 2001.)200104__000002 000310Controlling high-bandwidth flows at the congested routerR. Mahajan, S. Floyd and D. WetherallICNP 20012001____000002 000311A Framework for Defining Empirical Bulk Transfer Capacity MetricsM. Mathis and M. AllmanRFC 3148. July 2001.200107__000002 000312BRITE: An Approach to Universal Topology GenerationA. Medina, A. Lakhina, I. Matta and J. ByersIn Proceedings of MASCOTS 2001. IEEE Computer Society, August 2001.200108__000002 000313BRITE: Universal Topology Generation from a User's PerspectiveA. Medina, A. Lakhina, I. Matta and J. ByersBU-CS-TR-2001-003. April 05, 2001.20010405000002 000314Identifying the TCP Behavior of Web ServersJ. Padhye and S. FloydSIGCOMM 2001, August 2001. Or the ICSI Technical Report 01-002, February 2001.2001____000002 000315An Analysis of Using Reflectors for Distributed Denial-of-Service AttacksV. PaxsonComputer Communication Review 31(3), July 2001.200107__000002 000316The Relationship Between Topology and Protocol Performance: Case StudiesP. RadoslavovPhD Dissertation, Computer Science Department, University of Southern California, December 2001.200112__000002 000317Topology-Informed Internet Replica PlacementP. Radoslavov, R. Govindan and D. EstrinProceedings of the Sixth International Workshop on Web Caching and Content Distribution, Boston University, Boston, Massachusetts, USA, June 20-22, 2001.20010620000002 000318A Comparison of Application-Level and Router-Assisted Hierarchical Schemes for Reliable MulticastP. Radoslavov, C. Papadopoulos, R. Govindan and D. EstrinProceedings of the IEEE Infocom 2001, Anchorage, Alaska, USA, April 22-26, 2001.20010422000002 000319The Addition of Explicit Congestion Notification (ECN) to IPK.K. Ramakrishnan, S. Floyd and D. BlackRFC 3168, Proposed Standard, September 2001.200109__000002 000320A Scalable Content-Addressable NetworkS. Ratnasamy, P. Francis, M. Handley, R.M. Karp and S. ShenkerIn proceedings of ACM SIGCOMM 20012001____000002 000321S. Ratnasamy, M. Handley, R.M. Karp and S. ShenkerApplication-level Multicast using Content-Addressable NetworksIn Proceedings of 3rd International Workshop on Networked Group Communication, London, Nov 2001.200111__000002 000322Adaptive power control for ad-hoc networksM. Sanchez5th International Conference on Sytemics Cybernetics and Informatics, SCI 2001.2001____000002 000323Petri net based performance evaluation of USAIA's bandwidth partitioning for the wireless cell levelJ. Sokol and D. TutschPNPN Petri nets and Performance Models, September 2001.200109__000002 000324Does AS size determine degree in AS topology?H. Tangmunarunkit, J. Doyle, R. Govindan, S. Jamin, S. Shenker and W. WillingerACM Computer Communication Review, October 2001.200110__000002 000325Internet pat inflation due to policy routingH. Tangmunarunkit, R. Govindan and S. ShenkerIn Proceedings of SPIE ITCom 2001, Denver, CO. 19-24. August 2001.200108__000002 000326The Impact of Routing Policy on Internet PathsH. Tangmunarunkit, R. Govindan, S. Shenker and D. EstrinINFOCOM 2001: 736-742.2001____000002 000327Reliable Multicast Transport Building Blocks for One-to-Many Bulk-Data TransferB. Whetten, L. Vicisano, R. Kermode, M. Handley, S. Floyd and M. LubyRFC 3048, Informational, January 2001.200101__000002 000328Extending Equation-Based Congestion Control to Multicast ApplicationsJ. Widmer and M. HandleyProceedings of ACM SIGCOMM 2001.2001____000002 000329On the Constancy of Internet Path PropertiesY. Zhang, N. Duffield, V. Paxson and S. ShenkerProceedings of ACM SIGCOMM Internet Measurement Workshop, November 2001.200111__000002 000330The Use of End-to-end Multicast Measurements for Characterizing Internal Network BehaviorA. Adams, T. Bu, R. Caceres, N. Duffield, T. Friedman, J. Horowitz, F. Lo Presti, S.B. Moon, V. Paxson and D. TowsleyIEEE Communications, 38(5), May 2000.200004__000002 000331Parallel Sorting with Limited BandwidthM. Adler, J. Byers and R.M. KarpSIAM Journal of Computing 29(6): 1997-2015, 2000.2000____000002 000332A Web Server's View of the Transport LayerM. AllmanACM Computer Communication Review, 30(5), October 2000.200010__000002 000333Ongoing TCP Research Related to SatellitesM. Allman, S. Dawkins, D. Glover, J. Griner, D. Tran, T. Henderson, J. Heidemann, J. Touch, H. Kruse, S. Ostermann, K. Scott and J. SemkeFebruary 2000. RFC 2760.200002__000002 000334TCP Behavior in Networks with Dynamic Propagation DelayM. Allman, J. Griner and A. RichardProceedings of Globecom 2000, November 2000.200011__000002 000335A History of the Improvement of Internet Protocols Over Satellites Using ACTSM. Allman, H. Kruse and S. OstermannInvited paper for ACTS Conference 2000, May 2000.200005__000002 000336FTP Extensions for Variable Protocol SpecificationM. Allman and S. OstermannTechnical Report CR-209414, NASA Glenn Research Center, February 2000.200002__000002 000337Universal DNA Tag Systems: A Combinatorial Design SchemeA. Ben-Dor, R.M. Karp, B. Schwikowski and Z. YakhiniRECOMB, 2000.2000____000002 000338IANA Allocation Guidelines For Values In the Internet Protocol and Related HeadersS. Bradner and V. PaxsonRFC 2780, Best Current Practice, March 2000.200003__000002 000339Advances in Network SimulationL. Breslau, et al.IEEE Computer, May 2000, pp. 59-67. Or the technical report: Technical Report 99-702, University of Southern California, March 1999.200005__000002 000340Comments on the Performance of Measurement-Based Admission Control AlgorithmsL. Breslau, S. Jamin and S. ShenkerINFOCOM 2000, Tel Aviv, Israel, March 2000.200003__000002 000341Endpoint Admission Control: Architectural Issues and PerformanceL. Breslau, E.W. Knightly, S. Shenker, I. Stoica and H. ZhangACM SIGCOMM 2000, pages 57-69, Stockholm, Sweeden, October 2000.200010__000002 000342An Optimal Algorithm for Monte-Carlo EstimationP. Dagum, R.M. Karp, M. Luby and S. RossSIAM Journal of Computing, 29(5):1484-1496, 2000.2000____000002 000343Advantages of Parallel Processing and the Effects of Communications TimeW. Eddy and M. AllmanTechnical Report CR-209455, NASA Glenn Research Center, February 2000.200002__000002 000344Network Visualization with the Nam, VINT Network AnimatorD. Estrin, M. Handley, J. Heidemann, S. McCanne, Y. Xu and H. YuIEEE Computer, 33(11), 63-68, November 2000.200011__000002 000345Sharing the Cost of Multicast TransmissionsJ. Feigenbaum, C. Papadimitriou and S. ShenkerThirty-Second Annual ACM Symposium on Theory of Computing (STOC00), May 2000.200005__000002 000346Congestion Control PrinciplesS. FloydRFC 2914, Best Current Practice, September 2000.200009__000002 000347Equation-Based Congestion Control for Unicast ApplicationsS. Floyd, M. Handley, J. Padhye and J. WidmerIn Proceeding of ACM SIGCOMM 2000.2000____000002 000348An Extension to the Selective Acknowledgement (SACK) Option for TCPS. Floyd, J. Mahdavi, M. Mathis and M. PodolskyRFC 2883, Proposed Standard, July 2000.200007__000002 000349TCP Congestion Window ValidationM. Handley, S. Floyd and J. PadhyeRFC 2861, Experimental, June 2000.200006__000002 000350The Reliable Multicast Design Space for Bulk Data TransferM. Handley, S, Floyd, B. Whetten, R. Kermode, L. Vicisano and M. LubyRFC 2887, Informational, August 2000.200008__000002 000351TCP Congestion Window ValidationM. Handley, J. Padhye and S. FloydRFC 2861, Experimental, June 2000. Or the technical report: UMass CMPSCI Technical Report 99-77, September 1999.200006__000002 000352Session Announcement ProtocolM. Handley, C. Perkins, E. WhelanRFC 2974, Experimental, October 2000.200210__000002 000353Multicast-Scope Zone Announcement Protocol (MZAP)M. Handley, D. Thaler and R. KermodeRFC 2776, Proposed Standard, February 2000.200002__000002 000354Discovery of Rgulatory Interactions Through Perturbation: Inference and Experimental DesignT.E. Ideker, V. Thorsson and R.M. KarpPacific Symp. Biocomputing, 2000.2000____000002 000355GPSR: Greedy Perimeter Stateless Routing for Wireless networksB. Karp and H.T. KungMOBICOM, 2000.2001____000002 000356Optimization Problems in Congestion ControlR.M. Karp, E. Koutsoupias, C.H. Papadimitriou and S. ShenkerFOCS 2000: 66-74.2000____000002 000357Randomized Rumor SpreadingR.M. Karp, C. Schindelhauer, S. Shenker and B. VockingFOCS 2000, 565-574.2000____000002 000358Algorithms for Optical MappingR.M. Karp and R. ShamirJournal of Computational Biology 7(1,2) 2000.2000____000002 000359The Click modular routerE. KohlerPh.D. thesis, Massachusetts Institute of Technology, November 2000.200011__000002 000360Programming language techniques for modular router configurationsE. Kohler, B. Chen, M.F. Kaashoek, R. Morris and M. PolettoMIT Laboratory for Computer Science technical report MIT-LCS-TR-812, August 2000.200008__000002 000361The Click modular routerE. Kohler, R. Morris, B. Chen, J. Jannotti and M.F. KaashoekACM Transactions on Computer Systems 18 (3), August 2000, pages 263-297.200008__000002 000362On the Performance of TCP-based Data Transfers on a Faded Ka-Band Satellite LinkH. Kruse, S. Ostermann and M. Allman. Proceedings of the 6th Ka-Band Utilization Conference, June 2000.200006__000002 000363On the Origin of Power Laws in Internet TopologiesA. Medina, I. Matta and J. ByersACM Computer Communications Review, April 2000. Also BU-CS-TR-2000-004. January 21, 2000.200004__000002 000364BRITE: A Flexible Generator of Internet TopologiesA. Medina and I. MattaBU-CS-TR-2000-005. January 21, 2000.20000121000002 000365Experiences with NIMIV. Paxson, A. Adams and M. MathisProceedings of Passive & Active Measurement: PAM-2000.2000____000002 000366Computing TCP's Retransmission TimerV. Paxson and M. AllmanRFC 2988, Proposed Standard, November 2000.200011__000002 000367The Multicast Address-Set Claim (MASC) ProtocolP. Radoslavov, D. Estrin, R. Govindan, M. Handley, S. Kumar and D. ThalerRFC 2909, Experimental, September 2000.200009__000002 000368On Characterizing Network Topologies and Analyzing Their Impact on Protocol DesignP. Radoslavov, H. Tangmunarunkit, H. Yu, R. Govindan, S. Shenker and D. EstrinTechnical Report 00-731, Computer Science Department, University of Southern California, February 2000.200002__000002 000369Multimedia Proxy Caching for Quality Adaptive Streaming Applications in the InternetR. Rejaie, H. Yu, M. Handley and D. EstrinIn Proceedings of Infocom 2000.2000____000002 000370Stream Control Transmission ProtocolR. Stewart, Q. Xie, K. Morneault, C. Sharp, H. Schwarzbauer, T. Taylor, I. Rytina, M. Kalla, L. Zhang and V. PaxsonRFC 2960, Proposed Standard, October 2000.200010__000002 000371On the Aggregatability of Multicast Forwarding StateD. Thaler and M. HandleyIn Proceedings of Infocom 2000.2000____000002 000372The Internet Multicast Address Allocation ArchitectureD. Thaler, M. Handley and D. EstrinRFC 2908, Informational, September 2000.200009__000002 000373A Comparison of the AES Candidates Amenability to FPGA ImplementationN. Weaver and J. WawrzynekThe Third Advanced Encryption Standard (AES) Candidate Conference, 2000.2000____000002 000374Reliable Multicast Transport Building Blocks for One-to-Many Bulk-Data TransferB Whetton, L. Vicisano, R. Kermode, M. Handley, S. Floyd and M. LubyExperimental RFC, October 2000.200010__000002 000375TCP Processing of the IPv4 Precedence FieldX. Xiao, A. Hannan, V. Paxson and E. CrabbeRFC 2873, Proposed Standard, June 2000.200006__000002 000376Ms.Detecting BackdoorsY. Zhang and V. PaxsonIn Proceedings of the 9th USENIX Security Symposium, August 2000.200008__000002 000377Detecting Stepping StonesY. Zhang and V. PaxsonIn Proceedings of the 9th USENIX Security Symposium, August 2000.200008__000002 000378The Stationarity of Internet Path Properties: Routing, Loss, and ThroughputY. Zhang, V. Paxson and S. ShenkerACIRI Technical Report, May 2000.200005__000002 000379TCP Byte Counting RefinementsM. AllmanACM Computer Communication Review, 29(3), July 1999.199907__000002 000380On the Effective Evaluation of TCPM. Allman and A. FalkACM Computer Communication Review, 29(5), October 1999.199910__000002 000381Enhancing TCP Over Satellite Channels using Standard Mechanisms.M. Allman, D. Glover and L. SanchezRFC 2488, BCP 28, January 1999.199901__000002 000382FTP Security ConsiderationsM. Allman and S. OstermannRFC 2577, May 1999.199905__000002 000383On Estimating End-to-End Network Path PropertiesM. Allman and V. PaxsonACM SIGCOMM, September 1999, Cambridge, MA.199909__000002 000384TCP Congestion ControlM. Allman, V. Paxson and W.R. StevensRFC 2581, Proposed Standard, April 1999.199904__000002 000385Web Caching and Zipf-like Distributions: Evidence and ImplicationsL. Breslau, P. Cao, L. Fan, G. Phillips and S. ShenkerIn Proceedings of Infocom 1999.1999____000002 000386Internetworking MultimediaJ. Crowcroft, M. Handley and I. WakemanMorgan Kaufmann, ISBN 1-55860-584-3, 1999.1999____000002 000387A Dynamic Bootstrap Mechanism for Rendezvous-based Multicast RoutingD. Estrin, M. Handley, A. Helmy, P. Huang and D. ThalerIn Proceedings of Infocom 1999.1999____000002 000388Promoting the Use of End-to-End Congestion Control in the InternetS. Floyd and K. FallIEEE/ACM Transactions on Networking, August 1999. Winner of the Communications Society William R. Bennett Prize Paper Award, 1999.199908__000002 000389The NewReno Modification to TCP's Fast Recovery AlgorithmS. Floyd and T. HendersonRFC 2582, Experimental, April 1999.199904__000002 000390An architecture for a global Internet host distance estimation serviceP. Francis, S. Jamin, V. Paxson, L. Zhang, D. Gryniewicz and Y. JinIn Proceedings of IEEE INFOCOM, March 1999.199903__000002 000391A readable TCP in the Prolac protocol languageE. Kohler, M.F. Kaashoek and D.R. MontgomeryIn Proceedings of the ACM SIGCOMM '99 Conference: Applications, Technologies, Architectures, and Protocols for Computer Communication, Cambridge, Massachusetts, August 1999, pages 3-13.199908__000002 000392Evolving software with an application-specific languageE. Kohler, M, Poletto and D.R. MontgomeryWorkshop Record of WCSSS '99: The 2nd ACM SIGPLAN Workshop on Compiler Support for Systems Software, Atlanta, Georgia, May 1999, pages 94-102.199905__000002 000393Satellite Network Performance Measurements Using Simulated Multi-User Internet TrafficH. Kruse, M. Allman, J. Griner, S. Ostermann and E. HelveyIn Proceedings of the Seventh International Conference on Telecommunication Systems. March 1999.199903__000002 000394IPPM Metrics for Measuring ConnectivityJ. Mahdavi and V. PaxsonRFC 2678, Proposed Standard, September 1999.199909__000002 000395The Click modular routerR. Morris, E. Kohler, J. Jannotti and M.F. KaashoekIn Proceedings of the 17th ACM Symposium on Operating Systems Principles (SOSP '99) , Kiawah Island, South Carolina, December 1999, pages 217-231.199912__000002 000396Bro: A System for Detecting Network Intruders in Real-TimeV. PaxsonComputer Networks, 31(23-24), pp. 2435-2463, 14 Dec. 1999.19991214000002 000397End-to-End Internet Packet DynamicsV. PaxsonIEEE/ACM Transactions on Networking, Vol.7, No.3, pp. 277-292, June 1999. An earlier version appeared in Proc. ACM SIGCOMM '97, September 1997, Cannes, France.199906__000002 000398Known TCP Implementation ProblemsV. Paxson, M. Allman, S. Dawson, W. Fenner, J. Griner, I. Heavens, K. Lahey, J. Semke and B. VolzRFC 2525, Informational, March 1999.199903__000002 000399Scaling of Multicast Trees: Comments on the Chuang-Sirbu Scaling LawG. Phillips, S. Shenker and H. TangmunarunkitSIGCOMM, September 1999.199909__000002 000400Exploiting the Bandwidth-Memory Tradeoff in Multicast State AggregationP. Radoslavov, D. Estrin and R. GovindanTechnical Report 99-697, Computer Science Department, University of Southern California, July 1999.199907__000002 000401A Proposal to add Explicit Congestion Notification (ECN) to IPK.K. Ramakrishnan and S. FloydRFC 2481, Experimental, January 1999. Obsoleted by RFC 3168, Proposed Standard.199901__000002 000402Quality Adaptation for Congestion Controlled Playback Video over the InternetR. Rejaie, M. Handley and D. EstrinIn Proceedings of ACM SIGCOMM '99, Cambridge, September 1999.199909__000002 000403RAP: An End-to-end Rate-based Congestion Control Mechanism for Realtime Streams in the Internet.R. Rejaie, M. Handley and D. EstrinIn Proceedings of Infocom 1999.1999____000002 000404Proxy Caching Mechanism for Multimedia Playback Streams in the InternetR. Rejaie, M. Handley, H. Yu and D. EstrinIn Proc. 4th International Web Caching Workshop, San Diego, California, March 31 - April 2, 1999.199903__000002 000405A Scalable Web Cache Consistency ArchitectureH. Yu, L. Breslau and S. ShenkerIn Proceedings of ACM SIGCOMM'99, Cambridge, Sept. 1999.199909__000002 000406Selfish behavior and stability of the Internet: a game-theoretic analysis of TCPA. Akella, R.M. Karp, S. Seshan, S. Shenker and C. Papadimitriou.In Proceedings ACM SIGCOMM 2002.2002____000003 000407Multiple sequence alighment with arbitrary gap costs: computing an optimal solution using polydhedral combinatoricsE. Althaus, A. Caprara, H.P. Lenhof and K. ReinertIn ECCB, 4-16, 2002.2002____000003 000408A polyhedral approach to surface reconstruction from planar contoursE. Althaus and C. FinkIn Proceedings of the Ninth Conference on Integer Programming and Combinatorial Optimization. Lecture Notes in Computer Science, 2002.2002____000003 000409Constant-factor approximation of vertex-cuts in planar graphsE. Amir, R. Krathgamer and S. RaoManuscript, 2002.2002____000003 000410The efficiency of resolution and Davis-Putnamn proceduresP. Beame, R.M. Karp, T. Pitassi and M. SaksIn SIAM Journal of Computing 31(4) 1048-1075, 2002.2002____000003 000411Discovering local structure in gene expression data: the order preserving submatrix problemA. Ben-Dor, B. Chor, R.M. Karp and Z. YakhiniIn Proceedings of the Sixth Annual International Conference on Research in Computational Molecular Biology (RECOMB'02), 2002.2002____000003 000412The restriction scaffold problemA. Ben-Dor, R.M. Karp, B Schwikowski and R. ShamirIn Proceedings of the Sixth Annual International Conference on Research in Computational Molecular Biology (RECOMB'02), 2002.2002____000003 000413Idempotents among Partesan GamesE. BerlekampIn More Games of No Chance, MSRI Publicatons vol. 42, pp 3-23, Cambridge University Press, 2002.2002____000003 000414Large-scale reconstruction of haplotype structure via perfect phylogenyE. Eskin, E. Halperin and R.M. KarpTechnical report UCB/CSD 2-1196, August 2002.200208__000003 000415Large-scale recovery of haplotypes from genotype data using imperfect phylogenyE. Halperin and E. EskinTechnical report no. UCB/CSD-1-1196, August 2002.200208__000003 000416Improved approximation algorithms for the partial vertex cover problemE. Halperin and R. SrinivasanIn 5th International Workshop on Approximation Algorithms for Combinatorial Optimization, (APPROX), 185-199, Springer, 2002.2002____000003 000417Mathematical challenges from genomics and molecular biologyR.M. KarpIn Notices of the American Mathematical Society, 49(5) 544-553, 2002.2002____000003 000418Hardness of approximation for vertex-connectivity network design problemsG. Kortsarz, R. Krauthgamer and J.R. LeeIn 5th International Workshop on Approximation Algorithms for Combinatorial Optimization (APPROX), 185-199, Springer, 2002.2002____000003 000419The intrinsic dimensionality of graphsR. Krauthgamer and J.R. LeeManuscript, 2002.2002____000003 000420Online scheduling for sorting buffersH. Racke, C. Sohler and M. WestermannEuropean Symposium on Algorithms, 820-832, 2002.2002____000003 000421Go Thermography: The 4/12/98 Jiang-Rui Environmental EndgameB. SpightMore Games of No Chance, MSRI Publictations vo. 42, Cambridge University Press, 2002.2002____000003 000422Distributed caching independent of the network sizeM. WestermannSymposium on Parallel Algorithms and Archiectures, 31-40, 2002.2002____000003 000423Distance metric learning, with application to clustering with side-informationE.P. Xing, A.Y. Ng, M.I. Jordan and S. RussellTo appear in Neural Information Processing Systems, 2002.2002____000003 000424A hierarchical Bayesian Markovian model for motifs in biopolymer sequencesE.P. Xing, M.I. Jordan, R.M. Karp and S. RussellTo appear in Neural Information Processing Systems, 2002.2002____000003 000425Curve reconstruction and the traveling salsman problem.E. AlthausPh.D. Thesis, Universitat des Saarlandes. 2001.2001____000003 000426TSP-based curve reconstruction in polynomial timeE. Althaus and K. MelhornSIAM Journal of Computing 31(1), 2001.2001____000003 000427Developments in Forecast Combination and Portfolio ChoiceC. Dunis, A. Timmermann and J. Moody, edsWiley Financial Economics, London, 2001.2001____000003 000428A Maximun Likelihood Polynomial Time Syndrome Decoder to Correct Linearly Independent ErrorsG.B. Horn and R.M. KarpIn Proceedings of International Symp. on Information Theory, 2001.2001____000003 000429Approximation algorithms for data management in networksC. Krick, H. Racke and M. WestermannIn Proceedings of the 13th ACM Symposium on Parallel Algorithms and Architectures (SPAA) pp. 237-246, 2001.2001____000003 000430Approximation algorithms for data management in networksC. Krick, H. Racke and M. WestermannTheory of Computing Systems, 2001.2001____000003 000431Neural Networks for Time Series AnalysisY Liao, J. Moody and L. WuIn Handbook on Neural Network Signal Processing, edited by Y-H Hu and J-N Hwang, CRC Press 2001.2002____000003 000432Learning to Trade via Direct ReinforcementJ. Moody and M. SaffellIEEE Transactions on Neural Networks, Vol. 12, No. 4, July 2001.200107__000003 000433On approximating optimal auctionsA. RonenIn The Third ACM Conference on Electronic Commerce (EC01), 11-17, 2001.2001____000003 000434TSP-based curve reconstruction in polynomial timeE. Althaus and K. MelhornIn Proceedings of the 11th Symp. Discrete Algorithms, pp. 686-695, 2000.2000____000003 000435Experiments on curve reconstructionE. Althaus, K. Melhorn. S Naher and S. SchirraIn Proceedings of the 2nd Workshop Algorithm Engineering and Experiments (ALENEX00). Lecture Notes in Computer Science, Springer-Verlag, 2000.2000____000003 000436A branch and cut algorithm for the optimal solution of the side chain placement problemE. Althaus, O. Kohlbacher, H.P. Lenhof and P. MullerTech report Vol.2000-1-001, Max-Planck-Institut fur Informatik: Foschungsbericht, 2000.2000____000003 000437A combinatorial approach to protein docking wit flexible side-chainsE. Althaus, O. Kohlbacher, H.P. Lenhof and P. MullerIn Proceedings of the 4th Annual International conference on Computational Molecular Biology (RECOMB-00), Tokyo, Japan, 2000.2000____000003 000438Universal DNA Tag Systems: A Combinatorial Design SchemeA. Ben-Dor, R. Karp, B. Schwikowski and Z. YakhiniProceedings of RECOMB 2000.2000____000003 000439An optimal algorithm for Monte Carlo estimationP. Dagum, R.M. Karp, M. Luby and S. RossSIAM Journal on Computing, SIAM J. Comput. (USA), vol.29, (no.5), SIAM, 2000. p.1484-96.2000____000003 000440Topologically-aware overlay construction and server selectionM. Handley, R.M. Karp, S. Ratnasamy and S. ShenkerIn Proceedings, INFOCOM 2000.2000____000003 000441Discovery of Regulatory Interactions Through Perturbation: Inference and Experimental DesignT.E. Ideker, V. Thorsson and R.M. KarpProceedings of Pacific Symposium on Biocomputing (2000).2000____000003 000442Constructing Heterogeneous Committees via Input Feature GroupingY. Liao and J. MoodyIn Advances in Neural Information Processing Systems, Vol.12, S.A. Solla, T.K. Leen and K.-R. Muller (eds.),MIT Press, 2000.2000____000003 000443Minimizing Downside Risk via Stochastic Dynamic ProgrammingJ. Moody and M. SaffellIn Computational Finance 1999, edited by Y. S. Abu-Mostafa, B. LeBaron, A. W. Lo, and A. S. Weigend, MIT Press, Cambridge, MA, 2000.2000____000003 000444Term Structure of Interactions of Foreign Exchange RatesJ. Moody and H. YangIn Computational Finance 1999, edited by Y. S. Abu-Mostafa, B. LeBaron, A. W. Lo, and A. S. Weigend, MIT Press, Cambridge, MA, 2000.2000____000003 000445Discrete Prediction games with Arbitrary Feedback and LossA. Piccolboni and C. SchindelhauerTechnical Report AIIM-TR-A-00-18, Universitat zu Lubeck, Novermber 2000.200011__000003 000446Data Visualization and Feature Selection: New Algorithms for Nongaussian DataH. Yang and J. MoodyIn Advances in Neural Information Processing Systems, Vol.12, S.A. Solla, T.K. Leen and K.R. Muller (eds.), MIT Press, 2000.2000____000003 000447Challenges for theory of computingA. Condon, H. Edelsbrunner, E.A. Emerson, L. Fortnow, et.al.SIGACT News, June 1999, vol.30, (no.2):62-76.199906__000003 000448Algorithms for graph partitioning on the planted partition model. (Randomization, Approximation, and Combinatorial Optimization. Algorithms and Techniques.A. Condon and R.M. KarpEdited by: Hochbaum, D., Jansen, K., Rolim, J.D.P., Sinclair, A. Berlin, Germany: Sprnger-Verlag, 1999. p.221-32. ix+287 pp. 141999____000003 000449Error-resilient DNA computation. Random Structures & Algorithms, Random StructR.M. Karp, C. Kenyon and O. WaartsAlgorithms (USA), vol.15, (no.3-4), Wiley, Oct.-Dec. 1999. p.450-66. 19199910__000003 000450Reinforcement Learning for Trading, Advances in Neural Information Processing Systems 11M.S. Kearns, S.A. Solla and D.A. Cohn, eds.MIT Press, Cambridge, MA 1999.1999____000003 000451Predicting Blood Glucose Metabolism in Diabetics -- A Machine Learning SolutionV. Tresp, T. Briegel and J. MoodyIEEE Transactions on Neural Networks, v. 10, n. 5, pp. 1204--1213, 1999.1999____000003 000452Feature Selection Based on Joint Mutual InformationH. Yang and J. MoodyIn Advances in Intelligent Data Analysis (AIDA), Computational Intelligence Methods and Applications (CIMA), International Computer Science Conventions, Rochester, New York, June 22-25, 1999.199906__000003 000453On the complexity of unsatisfiability proofs for random k-CNF formulasP. Beame, R.M. Karp, T. Pitassi and M. SaksProceedings of the Thirtieth Annual ACM Symposium on Theory of Computing. May 1998. New York, NY, USA: ACM, 1998. p. 561-71.199805__000003 000454Branch and infer: A unifying framework for integer and nite domain constraint programmingA. Bockmayr and T. KasperINFORMS Journal on Computing, 10(3): 287-300, 1998.1998____000003 000455Constructing maps using the span and inclusion relationsD. Fasulo, T. Jiang, R.M. Karp and N. SharmaRECOMB 98. Proceedings of the Second Annual International Conference on Computational Molecular Biology. March 1998. Edited by: Istrail, S.; Pevzner, P.; Waterman, M. New York, NY, USA: ACM, 1998. p. 64-73.1998____000003 000456Graph traversals, genes and matroids: an efficient case of the travelling salesman problem.D. Gusfield, R.M. Karp, L. Wang and P. StellingDiscrete Applied Mathematics, 9 Nov. 1998, vol.88, (no.1-3):167-80.1998____000003 000457Mapping clones with a given ordering or interleavingT. Jiang and R.M. KarpAlgorithmica, July 1998, vol.21, (no.3):262-84.1998____000003 000458Variations on the theme of "twenty questions"R.M. KarpProceedings. 1998 IEEE International Symposium on Information Theory, Aug. 1998. New York, NY, USA: IEEE, 1998. p. 3.199808__000003 000459Algorithms for optical mappingR.M. Karp and R. ShamirRECOMB 98. Proceedings of the Second Annual International Conference on Computational Molecular Biology. 22-25 March 1998. Edited by: Istrail, S.; Pevzner, P.; Waterman, M. New York, NY, USA: ACM, 1998. p. 117-24.1998____000003 000460On parallel evaluation of game treesR.M. Karp and Y. ZhangJournal of the ACM, Nov. 1998, vol.45, (no.6):1050-75.1998____000003 000461Forecasting the Economy with Neural Nets: A Survey of Challenges and SolutionsJ. MoodyIn Neural Networks: Tricks of the Trade, Genevieve B. Orr and Klaus-Robert Muller, eds., Springer Verlag, pp. 347-371, 1998.1998____000003 000462Reinforcement Learning for Trading: Immediate vs. Future RewardsJ. Moody and M. SaffellKnowledge Discovery and Datamining, Proceedings of the 1998 New York Conference, AAAI Press, 1998.1998____000003 000463Reinforcement Learning for Trading Systems and PortfoliosJ. Moody, M. Saffell, Y. Liao and L. WuDecision Technologies for Computational Finance, Proceedings of the London Conference, A.N. Refenes, N. Burgess and J. Moody, eds., Kluwer Financial Publishing, 1998.1998____000003 000464Performance Functions and Reinforcement Learning for Trading Systems and PortfoliosJ. Moody, L. Wu, Y. Liao and M. SaffellJournal of Forecasting, vol. 17, pp. 441-470, 1998.1998____000003 000465High Frequency Foreign Exchange Rates: Price Behavior Analysis and `True Price' ModelsJ. Moody and L. WuChapter 2 of Nonlinear Modelling of High Frequency Financial Data, Christian Dunis and Bin Zhou, editors, Wiley Financial Publishing, London, 1998.1998____000003 000466Decision Technologies for Computational Finance, Proceedings of the London ConferenceA. Refenes, N. Burgess, and J. Moody, eds.Kluwer Financial Publishing, The Netherlands, 1998.1998____000003 000467Emerging opportunities for theoretical computer scienceA.V. Aho, D.S. Johnson, R.M. Karp, S.R. Kosaraju, and othersSIGACT News, Sept. 1997, vol.28, (no.3):65-74.1997____000003 000468The rank of sparse random matrices over finite fieldsJ. Blomer, R.M. Karp and E. WelzlRandom Structures & Algorithms, July 1997, vol.10, (no.4):407-19.199707__000003 000469Nearly optimal competitive online replacement policiesR. El-Yaniv and R.M. KarpMathematics of Operations Research, Nov. 1997, vol.22, (no.4):814-39.1997____000003 000470Mapping clones with a given ordering or interleavingT. Jiang and R.M. KarpProceedings of the Eighth Annual ACM-SIAM Symposium on Discrete Algorithms. Jan. 1997. New York, NY, USA: ACM, 1997. p. 400-9.199701__000003 000471Stochastic Manhattan Learning: Time-Evolution Operator for the Ensemble DynamicsT. Leen and J. MoodyPhysical Review E, 1997.1997____000003 000472Smoothing Regularizers for Projective Basis Function NetworksJ. Moody and T. RognvaldssonIn Advances in Neural Information Processing Systems 9, M.C. Mozer, M.I. Jordan and T. Petsche, eds, MIT Press, Cambridge, 1997.1997____000003 000473Optimization of Trading Systems and PortfoliosJ. Moody and L. WuIn Decision Technologies for Financial Engineering, Y. Abu-Mostafa, A. N. Refenes, and A. S. Weigend, eds., World Scientific, London, 1997.1997____000003 000474What is the True Price? -- State Space Models for High Frequency FX RatesJ. Moody and L. WuIn Decision Technologies for Financial Engineering, Y. Abu-Mostafa, A. N. Refenes, and A. S. Weigend, eds., World Scientific, London, 1997.1997____000003 000475Multi-Effect Decompositions for Financial Data ModelingL. Wu and J. Moodyin Advances in Neural Information Processing Systems 9, M.C. Mozer, M.I. Jordan and T. Petsche, eds, MIT Press, Cambridge, 1997.1997____000003 000476Fast and intuitive clustering of Web documentsO. Zamir, O. Etzioni, O. Madani and R.M. KarpProceedings of the Third International Conference on Knowledge Discovery and Data Mining. 14-17 Aug. 1997. Edited by: Heckerman, D.; Mannila, H.; Pregibon, D.; Uthurusamy, R. Menlo Park, CA, USA: AAAI Press, 1997. p. 287-90.199708__000003 000477A method for obtaining randomized algorithms with small tail probabilitiesH. Alt, L. Guibas, K. Mehlhorn, R.M. Karp and othersAlgorithmica, Oct.-Nov. 1996, vol.16, (no.4-5):543-7.199610__000003 000478LogP: a practical model of parallel computationD.E. Culler, R.M. Karp, D. Patterson, A. Sahay, and othersCommunications of the ACM, Nov. 1996, vol.39, (no.11):78-85.199611__000003 000479Efficient information gathering on the InternetO. Etzioni, S. Hanks, T. Jiang, R.M. Karp and othersProceedings. 37th Annual Symposium Foundations of Computer Science, 14-16 Oct. 1996. Los Alamitos, CA, USA: IEEE Comput. Soc. Press, 1996. p. 234-43.199610__000003 000480Graph traversals, genes, and matroids: an efficient case of the travelling salesman problemD. Gusfield, R.M. Karp, W. Lusheng and P. StellingCombinatorial Pattern Matching. 7th Annual Symposium, CPM 96. Proceedings. 10-12 June 1996. Berlin, Germany: Springer-Verlag, 1996. p. 304-19.199606__000003 000481Efficient PRAM simulation on a distributed memory machine.R.M. Karp, M. Luby and F. Meyer auf der HeideAlgorithmica, Oct.-Nov. 1996, vol.16, (no.4-5):517-42.199610__000003 000482A Neural Network Visualization and Sensitivity Analysis ToolkitY. Liao and J. MoodyProceedings of the International Conference on Neural Information Processing, Hong Kong, Sun-ichi Amari, Lei Xu, Laiwan Chan, Irwin King, and Kwong-Sak Leung, eds. Springer Verlag Singapore Pte. Ltd. pp. 1069-74, Sept. 1996.199609__000003 000485Incorporating Tandem/HATs MLP Features into SRI's Conversational Speech Recognition SystemQ. Zhu, A. Stolcke, B. Y. Chen, and N. MorganProceedings of the EARS RT-04F Workshop, Palisades, New York, November 2004.200411__000001http://www.icsi.berkeley.edu/ftp/global/pub/speech/papers/EARS-RT04f-na-system.pdf 000486The ICSI Meeting Corpus: Close-talking and Far-field, Multi-channel Transcriptions for Speech and Language ResearchersJ.A. EdwardsLREC 2004, Workshop on Compiling and Processing Spoken Language Corpora, Lisbon, Portugal, May 2004.200405__000001http://www.icsi.berkeley.edu/ftp/global/pub/speech/papers/edwards-lrec2004.pdf 000487Auditory-based Automatic Speech RecognitionW. Hemmert, M. Holmberg and D. GelbartProc. ISCA Tutorial and Research Workshop on Statistical and Perceptual Audio Processing, Jeju, Korea, October 2004.200410__000001http://www.icsi.berkeley.edu/ftp/global/pub/speech/papers/sapa04-hemmert.pdf 000488Vocabulary and Language Model Adaptation using Information RetrievalB. Bigi, Y. Huang and R. De MoriProc. Intl. Conf. Spoken Language Processing, Jeju, Korea, October 2004.200410__000001http://www.icsi.berkeley.edu/Speech/papers/icslp2004-bigi.pdf 000489Learning Long-Term Temporal Features in LVCSR Using Neural NetworksB. Chen, Q. Zhu and N. MorganProc. Intl. Conf. Spoken Language Processing, Jeju, Korea, October 2004.200410__000001http://www.icsi.berkeley.edu/ftp/global/pub/speech/papers/icslp2004-byc.pdf 000490On using MLP features in LVCSRQ. Zhu, B. Chen, N. Morgan and A. StolckeProc. Intl. Conf. Spoken Language Processing, Jeju, Korea, October 2004.200410__000001http://www.icsi.berkeley.edu/ftp/global/pub/speech/papers/icslp2004-mlp-lvcsr.pdf 000491Direct Modeling of Prosody: An Overview of Applications in Automatic Speech ProcessingE. Shriberg and A. StolckeProc. International Conference on Speech Prosody, Nara, Japan, March 2004.200403__000001http://www.icsi.berkeley.edu/ftp/global/pub/speech/papers/shriberg_SP04_v3.pdf 000492Prosody Modeling for Automatic Speech Recognition and UnderstandingE. Shriberg and A. StolckeMathematical Foundations of Speech and Language Modeling, M. Johnson, M. Ostendorf, S. Khudanpur, R. Rosenfeld (eds.), Volume 138 in IMA Volumes in Mathematics and its Applications, pp. 105-114, Springer-Verlag.2004____000001http://www.speech.sri.com/cgi-bin/run-distill?papers/IMA-prosody.ps.gz 000484A Minimax Arc Theorem for Reducible Flow GraphsVijaya Ramachandrantr-87-001198711__000006We establish a conjecture of Frank and Gyarfas byproving that the cardinality of a minimum feedbackarc set in a reducible flow graph is equal to thecardinality of a maximum collection of arc disjointcycles. 000493Monitoring and Management-Support of Distributed SystemsDieter Haban, Dieter Wybranietz, Amnon Baraktr-88-007198811__000006This paper describes a tool for on-line monitoring ofdistributed systems. The tool consists of a hardwarecomponent and software level, i.e., a hybridmonitor, which is capable of presenting theinteractive user and the local operating system witha high-level information and performanceevaluation of the activities in the host system withminimal interferences. A special hardware support,which consists of a test and measurement processor(TMP), was designed and has been implemented in thenodes of an experimental multicomputer system. Themain function of the TMP is to execute low leveloperating system functions, to manage localresources and to trigger time driven events in orderto reduce the overhead of the host operating system.The operations of the TMP are completely transparentto the users with a minimal, less that 0.1%, overheadto the hardware system. In the experimental system,all the TMPs were connected with a central monitoringstation, using an independent communicationnetwork in order to provide a global view of themonitored system. The central monitoring stationdisplays the resulting information in easy-to-readcharts and graphs. Our experience with the TMP showsthat it promotes an improved understanding ofrun-time behavior and performance measurements, toderive qualitative and quantitative assessments ofdistributed systems. 000494Links Between Markov Models and Multilayer PerceptronsHerve Bourlard, C. J. Wellekenstr-88-008198811__000006Hidden Markov models are widely used for automaticspeech recognition. They inherently incorporatethe sequential character of speech signal and arestatistically trained. However, the a priori choiceof a model topology limits the flexibility of theHMM's. Another drawback of these models is their weakdiscriminating power.

Multilayer perceptrons arenow promising tools in the connectionist approachfor classification problems and have already beensuccessfully tested on speech recognitionproblems. However, the sequential nature of thespeech signal remains difficult to handle in thatkind of machine.

In this paper, a discriminant hiddenMarkov model is defined and it is shown how aparticular multilayer perceptron with contextualand extra feedback input units can be considered as ageneral form of such Markov models. Relations withother recurrent networks commonly used in speechrecognition are also pointed out. 000495Designing Computers to Check Their WorkManuel Blumtr-88-009198811__000006Students, engineers, programmers...are taught tocheck their work. Computer programs are not. Thereare several reasons for this:

1. Computer hardwarealmost never makes errors -- but that fails torecognize that programmers unfortunately do!

2.Programs are hard enough to write without having toalso write program checkers for them -- but that is theprice of increased confidence!

3. There is no clearnotion what constitutes a good checker. Indeed, thesame students and engineers who are cautioned tocheck their work are rarely informed what it is thatmakes for a good procedure to do so -- but that is justthe sort of problem that computer scientists shouldbe able to solve!

In the view of the author, the lack ofcorrectness checks in programs is an oversight.Programs have bugs that could perfectly well becaught by such checks. This paper urges that programsbe written to check their work, and outlines apromising and rigorous approach to the study of thisfascinating new area. 000496Knowledge-Intensive Recruitment LearningJoachim Diederichtr-88-010198811__000006The model described in this paper is aknowledge-intensive connectionist learningsystem which uses a built-in knowledgerepresentation module for inferencing, and thisreasoning capability in turn is used forknowledge-intensive learning. On theconnectionist network level, the central process isthe recruitment of new units and the assembly of unitsto represent new conceptual information. Free,uncommitted subnetworks are connected to thebuilt-in knowledge network during learning. Thegoal of knowledge-intensive connectionistlearning is to improve the operationality of theknowledge representation: mediated inferences,i.e., complex inferences which require severalinference steps, are transformed into immediateinferences; in other words, recognition is based onthe immediate excitation from features directlyassociated with a concept. 000497Time, Space and Form in VisionJerome A. Feldmantr-88-011198812__000006The prodigious spatial capabilities of the primatevisual system are even more remarkable when temporalconsiderations are taken into account. Recentadvances in neurophysiology, psychophysics andcomputer vision provide significant constraints onhow the system could work. This paper presents afairly detailed connectionist computational modelof how the perception and recognition of objects iscarried out by primate brains. The model is claimed tobe functionally adequate and to satisfy all theconstraints established by the variousdisciplines. One key notion introduced is amulti-input, multi-output network for invertingspatio-temporal cues. The central construct inintermediate level motion vision is taken to be thetrajectory and these are used in recognition ofdynamic situations called scenarios. The entiredevelopment is an extension of the author's 1985 FourFrames model, which required relatively littlemodification to accommodate temporal change(eventually). 000498On a Theory of Computation and Complexity Over the Real Numbers; NPCompleteness, Recursive Functions and Universal MachinesLenore Blum, Mike Shub, Steve Smaletr-88-012198812__000006We present a model for computation over the reals or anarbitrary (ordered) ring R. In this general setting,we obtain universal machines, partial recursivefunctions, as well as NP complete problems. While ourtheory reflects the classical theory over Z (e.g.,the computable functions are the recursivefunctions) it also reflects the specialmathematical character of the underlying ring R(e.g., complements of Julia sets provide naturalexamples of R.E. undecidable sets over the reals) andprovides a natural setting for studyingfoundational issues concerning algorithms innumerical analysis. 000499Program Correctness Checking and the Design of Programs That Check TheirWorkManuel Blum, Sampath Kannantr-88-013198812__000006A program correctness checker is an algorithm forchecking the output of a computation. This paperdefines the concept of a program checker. It designsprogram checkers for a few specific and carefullychosen problems in the class P of problems solvable inpolynomial time. It also applies methods of moderncryptography, especially the idea of aprobabilistic interactive proof, to the design ofprogram checkers for group theoretic computations.Finally it characterizes the problems that can bechecked. 000500Guaranteeing Performance for Real-Time Communication in Wide-Area NetworksDomenico Ferraritr-89-001198901__000006The increasing importance of distributedmultimedia applications and the emergence of userinterfaces based on digital audio and digital videowill soon require that computer communicationnetworks offer real-time services. This paperargues that the feasibility for providingperformance guarantees in a wide-area networkshould be investigated, and describes a possibleapproach. We present a model of the network to bestudied, and discuss its generality, as well as thepresumable limits to its validity in the future. Wealso give a careful formulation of the problem,including a precise definition of the guarantees tobe provided and a provably correct scheme for theestablishment of real-time connections withdeterministic, statistical, and best-effort delaybounds. 000501Pseudo-Random Number Generator From ANY One-Way FunctionRussell Impagliazzo, Mike Lubytr-89-002198902__000006We construct a pseudo-random number generator fromANY one-way function. Previous results show how toconstruct pseudo-random number generators fromone-way functions that have special properties(Blum and Micali [BM], Yao [Y], Levin [L1],[Goldreich, Krawczyk and Luby [GKL]). We usetechniques borrowed from the theory ofslightly-random sources (Santha and Vazirani [SV],Vazirani and Vazirani [VV], Vazirani [V], Chor andGoldreich [CG]) and from the theory of universal hashfunctions (Carter and Wegman [CW]).

We alsointroduce a weaker kind of one-way function, that wecall an informationally one-way function. For aninformationally one-way function f, given y = f(x)for a randomly chosen x, it is hard to generateuniformly a random preimage of y. We show that theexistence of an informationally one-way functionyields a one-way function in the usual sense, andhence a pseudo-random number generator. Theseresults can be combined to show that the following areequivalent: (1) private key encryption; (2) bitcommitment; (3) pseudo-random number generators;(4) one-way functions; (5) informationally one-wayfunctions. 000502Parallel Search for Maximal Independence Given Minimal DependencePaul Beame, Michael Lubytr-89-003198902__000006We consider the problem of finding a maximalindependent set fast in parallel when theindependence system is presented as an explicit listof minimal dependent sets. Karp and Wigderson [KW]were the first to find an NC algorithm for the specialcase when the size of each minimal dependent set is atmost two, and subsequent work by Luby [Lu1], by Alon,Babai and Itai[ABI] and Goldberg and Spencer [GS]have introduced substantially better algorithmsfor this case. On the other hand, no previous work onthis problem extends even to the case when the size ofeach minimal dependent set is at most a constant, andwe conjecture that this algorithm is a randomized NCalgorithm for the general case. 000503Towards a Theory of Average Case ComplexityShai Ben-David, Benny Chor, Oded Goldreich, Michael Lubytr-89-004198902__000006This paper takes the next step in developing thetheory of average case complexity, a study initiatedby Levin. Previous works have focused on theexistence of complete problems [Le,Gu,VL]. We widenthe scope to other basic questions in computationalcomplexity. For the first time in the context ofaverage case complexity, we show the equivalence ofsearch and decision problems, analyze the structureof NP under P reductions, and relate the NP versusaverage-P to non-deterministic versusdeterministic (worst case) exponential time. Wealso present definitions and basic theoremsregarding other complexity classes, such as averagelog-space. 000504A Study of Password SecurityMichael Luby, Charles Rackofftr-89-005198902__000006We prove relationships between the security of afunction generator when used in an encryption schemeand the security of a function generator when used in aUNIX-like password scheme. 000505Fault-Tolerant Routing in Hypercube Multicomputers Using Depth-First SearchMing-Syan Chen, Kang G. Shintr-89-006198902__000006A fault-tolerant routing scheme for hypercubemulticomputers is developed using the depth-firstsearch. The routing scheme requires a node to knowonly the condition (faulty or not) of its own links,and adds information on the components traversed toeach message as it is routed toward the destinationnode.

Performance of the proposed routing scheme isrigorously analyzed. We derive an exact expressionfor the probability of routing messages via optimalpaths (of length identical to the Hamming distancebetween the corresponding pair of nodes) from thesource node to an obstructed node, the first node on apath determined by the above routing scheme fromwhich no optimal path to the destination exists.Moreover, bounds for this probability are derived inclosed form. The probability of routing messages viaoptimal paths between the source and destination canbe obtained from this expression by replacing theobstructed node with the destination node. Thelengths of paths obtained from this scheme areanalyzed, and the scheme, despite its simplicity, isshown to be able to route messages via optimal pathswith a very high probability.

Due to the absence ofinformation at each node on components other than itsown links, the actual paths chosen by the above schemecould sometimes be longer than the desired. Toalleviate this deficiency, we also present a simplemodification to the above routing scheme in whichevery node is made aware of not only the condition ofits own links but also that of links one hop away fromthe node. The improvement of routing efficiency withthis additional information at each node isanalyzed. 000506A Linear-Algorithm for Enumerating Perfect Matchings in Skew BipartiteGraphsPaul Dagumtr-89-007198902__000006Let G = (U,V,E) be a bipartite graph with |E| = m, U unionV = {v(subscript 1),..., v(subscript 2n)} and withthe bipartition U consisting of all odd indexedvertices and V consisting of all even indexedvertices. An edge in G is always assumed to be orientedtowards the endpoint with the larger index. We referto the up (resp. down) edges of G as the edges which areoriented from an even (resp. odd) indexed vertex. Ifall the up edges are nested among themselves and amongthe down edges we say G is a skew graph. The main resultof this paper is to give an O(m) algorithm to enumerateperfect matchings in skew graphs. Applications toouterplanar graphs and some problems in chemistryare given. 000507Spreading Activation and Connectionist Models for Natural LanguageProcessingJoachim Diederichtr-89-008198902__000006High level cognitive tasks performed by anartificial neural network require both knowledgeover a domain and inferencing abilities. To operatein a complex, natural environment neural networksmust have robust, reliable and massively parallelinference mechanisms. This paper describes variousspreading activation and connectionist mechanismsfor inferencing as part of natural languageprocessing systems, including possible techniquesto enrich these systems by machine learning. Inparticular models which attack one or more importantproblems such as variable binding,knowledge-intensive learning, avoidance ofcross-talk and false classifications are selectedfor this overview. 000508Constructive Omega(t(superscript 1.26)) Lower Bound for the Ramsey Number R(3,t)Richard Cleve, Paul Dagumtr-89-009198902__000006We present a feasibly constructive proof that R(3,t)> 5((t-1)/2)(superscript (log4/log3)) ElementOmega (t(superscript 1.26)). This is, as far as weknow, the first constructive superlinear lowerbound for R(3,t). Also, our result yields the firstfeasible method for constructing triangle-freek-chromatic graphs that are polynomial-size in k. 000509Conceptual Hierarchies in Classical and Connectionist ArchitectureAlfred Kobsatr-89-010198902__000006Representation systems for conceptual hierarchieshave been used in the field of ArtificialIntelligence for nearly two decades. They are basedon symbolic representation structures andsequential processes operating upon thesestructures. Recently, a number of networkstructures have been developed in the field ofConnectionism which are also claimed to be able torepresent conceptual hierarchies. Processes inthese networks operate in a parallel way and largelywithout a global control mechanism. This paperinvestigates the expressive power,interpretation, and inferential capabilities ofthese networks as compared to traditionalrepresentations, of concept hierarchies inparticular to KL-ONE, a standard representationlanguage for conceptual hierarchies in the field ofnatural-language processing. Although thecapabilities of current connectionist hierarchiesfall short of traditional representations, threeinference processes will be described which can bevery easily and elegantly realized in aconnectionist architecture whilst they are hard andcumbersome to implement in traditional knowledgerepresentation systems. 000510Preemptive Ensemble Motion Planning on a TreeGreg N. Frederickson, D. J. Guantr-89-011198903__000006Consider the problem of transporting a set of objectsbetween the vertices of a tree by a vehicle thattravels along the edges of the tree. The vehicle cancarry only one object at a time, and it starts andfinishes at the same vertex of the tree. It is shownthat if objects can be dropped at intermediatevertices along its route and picked up later, then theproblem can be solved in polynomial time. Twoefficient algorithms are presented for thisproblem. The first algorithm runs in O(k + qn) time,where n is the number of vertices in the tree, k is thenumber of objects to be moved, and q is less than orequal to min{k,n} is the number of nontrivialconnected components in a related directed graph.The second algorithm runs in O(k + nlogn) time.

* Hassince been revised by author. Contact him via"gnf at cs.purdue.edu" for a current copy. 000511Nonpreemptive Ensemble Motion Planning on a TreeGreg N. Frederickson, D. J. Guantr-89-012198903__000006Consider the problem of transporting a set of objectsbetween the vertices of a tree by a vehicle thattravels along the edges of the tree. The vehicle cancarry only one object at a time, and it starts andfinishes at the same vertex of the tree. It is shownthat if each object must be carried directly from itsinitial vertex to its destination, then finding aminimum cost transportation is NP-hard. Severalfast approximation algorithms are presented forthis problem. The fastest runs in O(k + n) time andgenerates a transportation of cost at most 3/2 timesthe cost of an optimal transportation, where n is thenumber of vertices in the tree, k is the number ofobjects to be moved. Another runs in O(k +nlogbeta(n,q)) time, and generates atransportation of cost at most 5/4 times the cost of anoptimal transportation, where q is less than or equalto min{k,n} is the number of nontrivial connectedcomponents in a related directed graph.

* Has sincebeen revised by author. Contact him via"gnf at cs.purdue.edu" for a current copy. 000512The Establishment of the International Computer Science Institute inBerkeley, California: Venturing with NorbertRon Kaytr-89-013198903__000006This is an account of the events and considerationswhich led to the establishment of the InternationalComputer Science Institute in Berkeley,California. The initiative for this undertakingcame from Norbert Szyperski, as Managing Director ofthe German National Center for Computer Science(GMD). He also took the lead in assuring support on thepart of German industry and government. Copies of themost important source documents are included as anappendix to this account. 000513Subtree Isomorphism is in Random NCPhilip Gibbons, Richard M. Karp, Gary L. Miller, Danny Sorokertr-89-014198903__000006Given two trees, a guest tree G and a host tree H, thesubtree isomorphism problem is to determine whetherthere is a subgraph of H that is isomorphic to G. Wepresent a randomized parallel algorithm for findingsuch an isomorphism, if it exists. The algorithm runsin time O(log(superscript 3)n) on a CREW PRAM, where nis the number of nodes in H. The number of processorsrequired by the algorithm is polynomial in n.Randomization is used (solely) to solve each of aseries of bipartite matching problems during thecourse of the algorithm. We demonstrate the closeconnection between the two problems by presenting alog-space reduction from bipartite perfectmatching to subtree isomorphism. Finally, wepresent some techniques to reduce the number ofprocessors used by the algorithm. 000514Planar Graph Decomposition and All Pairs Shortest PathsGreg N. Fredericksontr-89-015198903__000006An algorithm is presented for generating a succinctencoding of all pairs shortest path information in adirected planar graph G with real-valued edge costsbut not negative cycles. The algorithm runs in O(pn)time, where n is the number of vertices in G, and p is theminimum cardinality of a subset of the faces thatcover all vertices, taken over all planar embeddingsof G. The algorithm is based on a decomposition of thegraph into O(pn) outerplanar subgraphs satisfyingcertain separator properties. Linear-timealgorithms are presented for various subproblemsincluding that of finding an appropriate embeddingof G and a corresponding face-on-vertex covering ofcardinality O(p), and of generating all pairsshortest path information in a directed outerplanargraph.

* Has since been revised by author. Contact himvia "gnf at cs.purdue.edu" for a current copy. 000515Explanation and Connectionist SystemsJoachim Diederichtr-89-016198904__000006Explanation is an important function in symbolicartificial intelligence (AI). For example,explanation is used in machine learning and for theinterpretation of prediction failures incase-based reasoning. Furthermore, theexplanation of results of a reasoning process to auser who is not a domain expert must be a component ofany inference system. Experience with expertsystems has shown that the ability to generateexplanations is absolutely crucial for theuser-acceptance of AI systems (Davis, Buchanan &Shortliffe 1977). In contrast to symbolic systems,neural networks have no explicit, declarativeknowledge representation and therefore haveconsiderable difficulties in generatingexplanation structures. In neural networks,knowledge is encoded in numeric parameters(weights) and distributed all over the system.

It isthe intention of this paper to discuss the ability ofconnectionist systems to generate explanations. Itwill be shown that connectionist systems benefitfrom the explicit encoding of relations and the use ofhighly structured networks in order to realizeexplanation and explanation components.Furthermore, structured connectionist systemsusing spreading activation have the advantage thatany intermediate state in processing issemantically meaningful and can be used forexplanation. The paper describes severalsuccessful applications of explanation componentsin connectionist systems which use highlystructured networks, and discusses possible futurerealizations of explanation in neural networks. 000516Generalization and Parameter Estimation in Feedforward Nets: SomeExperimentsN. Morgan, H. Bourlardtr-89-017198904__000006We have begun an empirical study of the relation of thenumber of parameters (weights) in a feedforward netto generalization performance. Two experiments arereported. In one, we use simulated data sets withwell-controlled parameters, such as thesignal-to-noise ratio of continuous-valued data.In the second, we train the network onvector-quantized mel cepstra from real speechsamples. In each case, we use back-propagation totrain the feedforward net to discriminate in amultiple class pattern classification problem. Wereport the results of these studies, and show theapplication of cross-validation techniques toprevent overfitting. 000517A Parallel Algorithm for Maximum Matching in Planar GraphsMarek Karpinski, Elias Dahlhaus, Andrzej Lingastr-89-018198904__000006We present a new parallel algorithm for finding amaximum (cardinality) matching in a planarbipartite graph G. Our algorithm is processor-timeproduct efficient if the size l of a maximum matchingof G is large. It runs in time O((n/2-l + (the squareroot of n))log (superscript 7)n) on a CRCW PRAM withO(n(superscript 1.5)log (superscript 3)n)processors. 000518A More Practical PRAM ModelPhillip B. Gibbonstr-89-019198904__000006This paper introduces the Asynchronous PRAM model ofcomputation, a variant of the PRAM in which theprocessors run asynchronously and there is anexplicit charge for synchronization. A family ofAsynchronous PRAM's are defined, varying in thetypes of synchronization steps permitted and thecosts for accessing the shared memory. Algorithms,lower bounds, and simulation results are presentedfor an intersting member of the family. 000519Multiple Network Embeddings into HypercubesAjay Gupta, Susanne E. Hambruschtr-89-020198904__000006In this paper we study the problem of how toefficiently embed r interconnection networksG(subscript 0),...G(subscript r-1), r is less thanor equal to k, into a k-dimensional hypercube H so thatevery node of the hypercube is assigned at most r nodesall of which belong to different G(subscript i)s.When each G(subscript i) is a complete binary tree or aleap tree of 2(superscript k)-1 nodes, we describe anembedding achieving a dilation of 2 and a load of 5 and6, respectively. For the cases when each G(subscripti) is a linear array of a 2-dimensional mesh of2(superscript k) nodes, we describe embeddings thatachieve a dilation of 1 and an optimal load of 2 and 4,respectively. Using these embeddings, we also showthat r(subscript 1) complete binary trees,r(subscript 2) leap trees, r(subscript 3) lineararrays, and r(subscript 4) meshes cansimultaneously be embedded into H with constantdilation and load, (4 over sum over i=1) (r(subscripti)) is less than or equal to k. 000520Learning Read-Once Formulas Using Membership QueriesLisa Hellerstein, Marek Karpinskitr-89-021198904__000006In this paper we examine the problem of exact learning(and inferring) of read-once formulas (also calledmu-formulas or boolean trees) using membershipqueries. The power of membership queries in learningvarious classes of formulas was studied by Angluin[A]. Valiant proved that, using three powerfuloracles, read-once formulas can be learned inpolynomial time [V]. Pitt and Valiant proved that ifRP is not equal to NP, read-once formulas cannot belearned by example in polynomial time [PV,KLPV]. Weshow that given explicitly a boolean formula fdefining a read-once function, if RP is not equal toNP, then there does not exist a polynomial timealgorithm for inferring an equivalent read-onceformula. An easy argument on the cardinality of theset of all (read-once) 1-term DNF formulas implies anexponential lower bound on the number of membershipqueries necessary to learn read-once formulas.Angluin showed that it takes time 2(superscriptOmega(n)) to learn monotone n-term DNf formulasusing membership queries [A]. We prove that,surprisingly, it is possible to learn monotoneread-once formulas in polynomial time usingmembership queries. We present an algorithm thatruns in time O(n(superscript 3)) and makesO(n(superscript 3)) queries to the oracle. It isbased on a combinatorial characterization ofread-once formulas developed by Karchmer et. al.[KLNSW]. We also use the combinatorialcharacterization to prove two other results. We showthat read-once formulas can be learned in polynomialtime using only one of the three oracles used inValiant's polynomial time algorithm. In addition,we show that given an arbitrary boolean formula f, theproblem of deciding whether f defines a read-oncefunction is complete in the class D(superscript P)under randomized NC(superscript 1)-reductions.The main results of this paper can also be interpretedin terms of efficient input oracle algorithms forboolean function interpolation (cf. [KUW],[GKS]. 000521Real-Time Communication in Packet-Switching Wide-Area NetworksDomenico Ferraritr-89-022198905__000006The increasing importance of distributedmultimedia applications and the emergence of userinterfaces based on digital audio and digital videowill soon require that computer communicationnetworks offer real-time services. This paperargues that the feasibility of providingperformance guarantees in a packet-switchingwide-area network should be investigated, anddescribes a possible approach. We present a model ofthe network to be studied, and discuss itsgenerality, as well as the presumable limits to itsvalidity in the future. We also formulate theproblem, give a definition of the guarantees to beprovided, and describe a correct scheme for theestablishment of real-time connections withdeterministic, statistical, and best-effort delaybounds. 000522Approximating the Permanent of Graphs with Large FactorsPaul Dagum, Michael Lubytr-89-023198904__000006Let G = (U,V,E) be a bipartite graph with |U|=|V|=n.The factor size of G,f, is the maximum number of edgedisjoint perfect matchings in G. We characterize thecomplexity of counting the number of perfectmatchings in classes of graphs parameterized byfactor size. We describe the simple algorithm, whichis an approximation algorithm for the permanent thatis a natural simplification of the algorithmsuggested in [Broder 86] and analyzed in [Jerrum,Sinclair 88 a,b]. A combinatorial lemma is used toprove that the simple algorithm runs in timen(superscript O(n/f)). Thus (1) for all constantsalpha > 0, the simple algorithm runs in polynomialtime for graphs with factor size at least alpha(n);(2) for some constant c, the simple algorithm is thefastest known approximation for graphs with factorsize at least c log n. (Compare with the approximationalgorithms described in [Karmarkar, et. al. 88).

Weprove the following complementary hardnessresults. For functions f such that 3 is less than orequal to f(n) is less than or equal to n-3, the exactcounting problem for f(n)-regular bipartite graphsis #P-complete. For any epsilon > 0, for any function fsuch that 3 is less than or equal to f(n) is less than orequal to n (superscript 1-epsilon), approximatecounting for f(n)-regular bipartite graphs is ashard as approximate counting for all bipartitegraphs. 000523An Efficient Parallel Algorithm for the Minimal Elimination Ordering (MEO)of an Arbitrary GraphElias Dahlhaus, Marek Karpinskitr-89-024198905__000006We design the first efficient parallel algorithm forcomputing Minimal Elimination Ordering (MEO) of anarbitrary graph.

The algorithm works inO(log(superscript 3)n) parallel time and O(nm)processors on a CRCW PRAM, for an n-vertex, m-edgegraph, and is optimal up to polylogarithmic factorwith respect to the best sequential algorithm ofRose, Tarjan and Lueker.

The MEO Problem forarbitrary graphs arises in a number of combinatorialoptimization problems, as well as in databaseapplications, scheduling problems, and the sparseGaussian elimination of symmetric matrices. It wasbelieved before to be inherently sequential andstrongly resisting sublinear parallel time(sublinear sequential storage) algorithms.

As anapplication, this paper gives the first efficientparallel solutions to the problem of Minimal Fill-Infor arbitrary graphs (and connected combinatorialproblems, cf. [RTL 76],[Ta 85]), and to the problem ofthe Gaussian elimination of sparse symmetricmatrices [Ro 70], [Ro 73]. (The problem of computingMinimum Fill-In is known to be NP-complete [Ya 81]).It gives also an alternative to [GM 87] efficientparallel algorithm for computing Breadth-FirstSearch (BFS) trees in arbitrary graphs using O(nm)processors on a CRCW PRAM.

The method of solutioninvolves a development of new techniques for solvingconnected minimal set system problem, and combiningit with some new divide-and-conquer methods. 000524On Parallel Evaluation of Game TreesRichard M. Karp, Yanjun Zhangtr-89-025198905__000006We present parallel algorithms for evaluating gametrees. These algorithms parallelize the"left-to-right" sequential algorithm forevaluating AND/OR trees and the alpha-beta pruningnprocedure for evaluating MIN/MAX trees. We showthat, on every instance of a uniform tree, theseparallel algorithms achieve a linear speed-up overtheir corresponding sequential algorithms, ifnumber of processors used is close to the height of theinput tree. These are the first non-trivialdeterministic speed-up bounds known for the"left-to-right" algorithm and the alpha-betapruning procedure. 000525Separating Abstraction from Implementation in Communication Network DesignRamon Cacerestr-89-026198905__000006Datagrams and visual circuits are not disjointconceptual models for data communication, butrather inhabitants of a wide design space containingmany other viable networking solutions. Many designchoices often closely associated with these twocommunication styles can be decoupled from thedatagram and virtual circuit abstractions, andcombined to form new and effective networkimplementations. This paper examines several keyelements of network architecture. For each element,it shows how certain characteristics often thoughtto differentiate datagrams and virtual circuits areindependent of these two concepts and form amulti-valued spectrum of design choices. Thisdiscussion is motivated by the current drive todesign a new generation of high-speed wide-areanetworks, and the observation that this effort wouldbenefit from a more systematic evaluation ofexisting and future network design alternatives. 000526Boolean Circuit Complexity of Algebraic Interpolation ProblemsMarek Karpinskitr-89-027198905__000006We present here some recent results on fast parallelinterpolation of multivariate polynomials overfinite fields. Some applications towards thegeneral conversion algorithms for booleanfunctions are also formulated. 000527Application of Real-Time Monitoring to Scheduling Tasks with RandomExecution TimesDieter Haban, Kang Shintr-89-028198905__000006A real-time monitor is employed to aid in schedulingtasks with random execution times in a real-timecomputing system. Scheduling algorithms areusually based on the worst-case execution time (WET)of each task. Due to data-dependent loops andconditional branches in each program and resourcesharing delay during execution, this WET is usuallydifficult to obtain and could be several orders ofmagnitude larger than the true exception time. Thus,scheduling tasks based on WET could result in a severeunderutilization of CPU cycles andunder-estimation of systems schedulability.

Toalleviate the above problem, we propose to use areal-time monitor as a scheduling aid. The real-timemonitor is composed of dedicated hardware, calledTest and Measurement Processor (TMP), and used tomeasure accurately, with minimal interference, thetrue execution time which consists of the pureexecution time and resource sharing delay. Themonitor is a permanent and transparent part of areal-time system, degrades system performance byless than 0.1 percent, and does not interfere with thehost system's execution.

Using the measured pureexecution time and resource sharing delay for eachtask, we have developed a mechanism which reduces thediscrepancy between the WET and the estimatedexecution time. This result is then used to decide atan earliest possible time whether or not a task canmeet its deadline. 000528Behavior and Performance Analysis of Distributed Systems Using a HybridMonitorDieter Haban, Dieter Wybranietztr-89-029198905__000006This paper describes a hybrid monitor for measuringthe performance and observing the behavior ofdistributed systems during execution. We emphasizedata collection, analysis and presentation ofexecution data. A special hardware support, whichconsists of a test and measurement processor (TMP),was designed and has been implemented in the nodes ofan experimental multicomputer system consisting ofeleven nodes. The operations of the TMP arecompletely transparent with a minimal, less than0.1%, overhead to the measured system. In theexperimental system, all the TMPs were connectedwith a central monitoring station, using anindependent communication network, in order toprovide a global view of the monitored system. Thecentral monitoring station displays the resultinginformation in easy-to-read charts and graphs. Ourexperience with the TMP shows that it promotes animproved understanding of run-time behavior andperformance measurements, to derive qualitativeand quantitative assessments of distributedsystems. 000529Monitoring and Measuring Parallel Systems Using a Non-Intrusive Rule-BasedSystemDieter Haban, Dieter Wybranietztr-89-030198903__000006This paper describes a tool for on-line monitoring ofdistributed systems and the evaluation of thecollected data. The hybrid monitor is capable ofpresenting the interactive user and the localoperating system with high-level information of thebehavior and the activities in the host system withminimal interferences. A special hardware support,which consists of a test and measurement processor(TMP), was designed and has been implemented in thenodes of an experimental multicomputer system. Theoperations of the TMP are completely transparent tousers with a minimal, less than 0.1 percent, overheadto the hardware system. To provide a global view of themonitored system, a central monitoring stationevaluates the locally collected data and displaysthe resulting information in charts and graphs. Arule-based evaluation system assists in improvingthe understanding of run-time behavior and in easilyassessing performance measurements. Flexibilityis achieved by rules given in tables which control theevaluation and the display of monitored andprocessed data. These rules represent expert-levelknowledge about the evaluation of distributedsystems. 000530One-Way Functions are Essential for Complexity Based Cryptography (Extended)Russell Impagliazzo, Michael Lubytr-89-031198905__000006In much of modern cryptography, the security of aprotocol is based on the intractability of a problemsuch as factorization of randomly chosen largenumbers. The problems assumed intractable all havethe same form; they are based on a one-way function,i.e. one that is easy to compute but hard to invert.This is not a coincidence. We show that for manycryptographic tasks any secure protocol for the taskcan be converted into a one-way function, and thus anyproposed protocol for these tasks is implicitlybased on a one-way function. Tasks examined here arechosen to cover a spectrum of cryptographicapplications: private-key encryption,identification/authentication, bit commitmentand coin-flipping by telephone. Thus, unlessone-way functions exist, secure protocols for thesetasks are impossible. 000531A Connectionist Model of UnificationAndreas Stolcketr-89-032198905__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1989/tr-89-032.pdfA general approach to encode and unify recursivelynested feature structures in connectionistnetworks is described. The unification algorithmimplemented by the net is based on iterativecoarsening of equivalence classes of graph nodes.This method allows the reformulation of unificationas a constraint satisfaction problem and enables theconnectionist implementation to take fulladvantage of the potential parallelism inherent inunification, resulting in sublinear timecomplexity. Moreover, the method is able to processany number of feature structures in parallel,searching for possible unifications and makingdecisions among mutually exclusive unificationswhere necessary.

Keywords: Unification, constraint satisfaction, connectionism, feature structures. 000532Merging Multilayer Perceptrons and Hidden Markov Models: Some Experiments inContinuous Speech RecognitionHerve Bourlard, Nelson Morgantr-89-033198905__000006The statistical and sequential nature of the humanspeech production system makes automatic speechrecognition difficult. Hidden Markov Models (HMM)have provided a good representation of thesecharacteristics of speech, and were a breakthroughin speech recognition research. However, the apriori choice of a model topology and weakdiscriminative power limit HMM capabilities.Recently, connectionist models have beenrecognized as an alternative tool. Their main usefulproperties are their discriminative power and theirability to capture input-output relationships.They have also proved useful in dealing withstatistical data. However, the sequentialcharacter of speech is difficult to handle withconnectionist models. We have used a classic form of aconnectionist system, the Multilayer Perceptron(MLP), for the recognition of continuous speech aspart of an HMM system. We show theoretically andexperimentally that the outputs of the MLPapproximate the probability distribution overoutput classes conditioned on the input (i.e., theMaximum a Posteriori (MAP) probabilities). We alsoreport the results of a series of speech recognitionexperiments. By using contextual information at theinput of the MLP, frame classification performancecan be achieved which is significantly improved overthe corresponding performance for simple MaximumLikelihood probabilities, or even MAPprobabilities without the benefit of context.

However, it was not so easy to improve the recognitionof words in continuous speech by the use of an MLP,although it was clear that the classification at theframe and phoneme levels was better than we achievedwith our HMM system. We present severalmodifications of the original methods that wererequired to achieve acceptable performance at theword level. Preliminary results are reported for a1000 word vocabulary, phoneme based,speaker-dependent continuous speech recognitionsystem embedding MLP into HMM. These results showequivalent recognition performance using eitherthe Maximum Likelihood or the outputs of an MLP toestimate emission probabilities of an HMM. 000533A Survey of Optical Fibers in CommunicationRamesh Govindan, Srinivasan Keshav, Dinesh C. Vermatr-89-034198905__000006In recent years there has been a major effort tointegrate fiber optic media into existingcommunication systems. In this survey, we outlinethe physics behind fiber optic media and opticalinterfaces. Different types of optical interfacesand optical media are considered and the advantagesand disadvantages of each are listed. We then discusstopologies and protocols suitable for opticalfibers in communication. We also take a detailed lookinto the new Fiber Distributed Data Interface (FDDI)Standard for fiber-optic token rings. Finally, welist off-the-shelf fiber networks available as of September 1988. 000534Conjectures on Representations in Backpropagation NetworksPaul W. Munrotr-89-035198905__000006The pros and cons of the backpropagation learningprocedure have been the subject of numerous debatesrecently. Some point out its promise as a powerfulinstrument for finding the weights in aconnectionist network appropriate to a givenproblem, and the generalizability of the solution tonovel patterns. Others claim that it is an algorithmfor fitting data to a function by error correctionthrough gradient descent. The arguments in thispaper focus on the latter (curve-fitting) point ofview, but take the point of view that the power of backpropagation comes from carefully choosing the formof the function to be fit. This amounts to choosing thearchitecture and the activation functions of theunits (nodes) in the net. A discussion of the role ofthese two network features motivates twoconjectures identifying the form of the squashingfunction as an important factor in the process. Somepreliminary simulations in support of theseconjectures are presented. 000535A Scheme for Real-Time Channel Establishment in Wide-Area NetworksDomenico Ferrari, Dinesh C. Vermatr-89-036198905__000006Multimedia communication involving digital audioand/or digital video has rather strict delayrequirements. A real-time channel is defined in thispaper as a simplex connection between a source and adestination characterized by parametersrepresenting the performance requirements of theclient. A real-time service is capable of creatingreal-time channels on demand and guaranteeing theirperformance. These guarantees often take the form ofdelay bounds that the service enforces in exchangefor offered load bounds specified and enforced by theclient.

In this paper, we study the feasibility ofproviding real-time services on a packet-switchedstore-and-forward wide-area network with generaltopology. We describe a scheme for the establishmentof channels with deterministic or statistical delaybounds, and present the results of the simulationexperiments we ran to evaluate it. The results areencouraging: our approach is correct (i.e.,satisfies the guarantees even in worst-casesituations), uses the network's resources to a fairextent, and efficiently handles channels with avariety of offered load and burstinesscharacteristics. The packet transmission overheadis quite low, whereas the channel establishmentoverhead may occasionally become too large; anapproximation method is therefore needed to reducethe latter overhead to an acceptable level even inthose cases. 000536A Tagging Method for Distributed Constraint SatisfactionHans Werner Guesgentr-89-037198906__000006Local propagation algorithms such as Waltz'filtering and Mackworth's AC-x algorithms have beensuccessfully applied in AI for solving constraintsatisfaction problems (CSPs). In general, thesealgorithms can only be used as preprocessing methodsas they do not compute a global consistent solutionfor a CSP; they result in local consistency also knownas arc consistency.

In this paper, we introduce anextension of local constraint propagation toovercome this drawback, i.e., to compute globalconsistent solutions for a CSP. The advantage overbacktracking approaches is that the methodintroduced here is easy to implement on parallelmachines with an arbitrary number of processors. Theunderlying idea is to associate recursive tags withthe values during the propagation process so thatglobal relationships among the values aremaintained. 000537Metric Constraint Satisfaction with IntervalsPeter B. Ladkintr-89-038198906__000006We show how algorithms in Dechter, Meiri and Pearl'srecent paper on constraint satisfaction techniquesfor metric information on time points [DeMePe89] maybe adapted to work directly with metric constraintson intervals. Inter alia we show termination ofpath-consistency algorithms if range intervals inthe problem contain only rational number endpoints. 000538Fast Parallel Algorithms for the Clique Separator DecompositionElias Dahlhaus, Marek Karpinski, Mark B. Novicktr-89-039198907__000006We give an efficient NC algorithm for finding a cliqueseparator decomposition of an arbitary graph, thatis, a series of cliques whose removal disconnects thegraph. This algorithm allows one to extend a largebody of results which were originally formulated forchordal graphs to other classes of graphs. Ouralgorithm is optimal to within a polyalgorithmicfactor of Tarjan's O(nm) time sequential algorithm.The decomposition can also be used to find NCalgorithms for some optimization problems onspecial families of graphs, assuming these problemscan be solved in NC for the prime graphs of thedecomposition. These optimization problemsinclude: finding a maximum weight clique, a minimumcoloring, a maximum-weight independent set, and aminimum fill-in elimination order. We also give thefirst parallel algorithms for solving theseproblems by using the clique separatordecomposition. Our maximum independent setalgorithm applied to chordal graphs yields the mostefficient known parallel algorithm for finding amaximum-weight independent set of a chordal graph. 000539The Possibility of an Executable Specification LanguagePeter B. Ladkintr-89-040198907__000006We consider what it takes to build an executablespecification language for concurrent systems. Thekey ingredients are executability andvery-high-level specification. Many researchershave concluded that one can't have both in anyreasonable way. We consider a number of criteria foran executable specification language. We concludethat it is possible to build such a language, and thusthat executability should be a criterion forevaluating any specification language forconcurrent systems. 000540Geometric Learning Algorithms;Stephen M. Omohundrotr-89-041198906__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1989/tr-89-041.pdfEmergent computation in the form of geometriclearning is central to the development of motor andperceptual systems in biological organisms andpromises to have a similar impact on emergingtechnologies including robotics, vision, speech,and graphics. This paper examines some of thetrade-offs involved in different implementationstrategies, focussing on the tasks of learningdiscrete classifications and smooth nonlinearmappings. The trade-offs between local and globalrepresentations are discussed, a spectrum ofdistributed network implementations are examined,and an important source of computationalinefficiency is identified. Efficient algorithmsbased on k-d trees and the Delaunay triangulation arepresented and the relevance to biological networksis discussed. Finally, extensions of both the tasksand the implementations are given.

Keywords: learning algorithms, neural networks, computational geometry, emergent computation, robotics. 000541Optimal Parallel Algorithm for the Hamiltonian Cycle Problem on Dense GraphsElias Dahlhaus, Peter Hajnal, Marek Karpinskitr-89-042198906__000006Dirac's classical theorem asserts that, if everyvertex of a graph G on n vertices has degree at leastn/2, then G has a Hamiltonian cycle. We give a fastparallel algorithm on a CREW-PRAM to find aHamiltonian cycle in such graphs. Our algorithm usesa linear number of processors and is optimal up to apolylogarithmic factor. The algorithm works inO((log (superscript 4)) n) parallel time and useslinear number of processors on a CREW-PRAM. Ourmethod bears some resemblance to Anderson's RNCalgorithm [An] for maximal paths: we, too, start froma system of disjoint paths and try to glue themtogether. We are, however, able to perform the basestep (perfect matching) deterministically. We alsoprove that a perfect matching in dense graphs can befound in NC(superscript 2). The cost of improved timeis a quadratic number of processors.

On the negativeside, we prove that finding an NC algorithm forperfect matching in slightly less dense graphs (1/2 -epsilon) |V| is as hard as the same problem for allgraphs, and interestingly the problem of finding aHamiltonian cycle becomes NP-complete. 000542Parallel Asynchronous Connected Components in a MeshSusan Hambrusch, Michael Lubytr-89-043198907__000006Levialdi [6] introduced a parallel synchronousalgorithm for counting the number of connectedcomponents in a binary image embedded in an n x n mesh ofprocessors that runs in time O(n). We describe aparallel asynchronous algorithm for the sameproblem achieving the same time 000543Removing Randomness in Parallel Computation Without a Processor PenaltyMichael Lubytr-89-044198907__000006We develop some general techniques for convertingrandomized parallel algorithms into deterministicparallel algorithms without a blowup in the number ofprocessors. One of the requirements for theapplication of these techniques is that the analysisof the randomized algorithm uses only pairwiseindependence. Our main new result is a parallelalgorithm for coloring the vertices of an undirectedgraph using at most delta + 1 distinct colors in such away that no two adjacent vertices receive the samecolor, where delta is the maximum degree of any vertexin the graph. The running time of the algorithm isO((log (superscript 3)) n log log n) using a linearnumber of processors on a concurrent read, exclusivewrite (CREW) parallel random access machine (PRAM).Our techniques also apply to several other problems,including the maximal independent set problem andthe maximal matching problem. The application of thegeneral technique to these last two problems ismostly of academic interest because parallelalgorithms that use a linear number of processorswhich have better running times have been previouslyfound [Israeli, Siloach86], [Goldberbg, Spencer87]. 000544Parallel Path-Consistency Algorithms for Constraint SatisfactionPeter B. Ladkin, Roger D. Madduxtr-89-045198908__000006This paper concerns heuristic algorithms used forsolution of Boolean Constraint SatisfactionProblems, or CSPs [Mon74, Mac77, Fre78, Mac87]. CSPsoccur particularly in areas of artificialintelligence such as vision, temporal reasoning,and truth-maintenance systems. The most common forminvolves binary constraints and we considerproperties of binary CSPs only (we shall omit theadjective from now on). CSPs may be represented bylabeled digraphs called binary constraintnetworks, or BCNs. Many constraint satisfactiontechniques operate upon BCNs. An important propertyof BCNs is that of path-consistency, which is usedextensively as a heuristic for solving CSPs (manyclasses of CSPs are NP-hard, e.g. [VilKau86]).nEvery BCN has a path-consistent reduction, and it isknown that algorithms for computing it are serial O(nsuperscript 3) in the number of variables [Mac77,Fre78, All83, MacFre85, MohHen86].

We haveformulated CSPs and path-consistency computationsin the framework of Tarski's relation algebra, andgive a brief overview below [Tar41, LadMad88.2]. Wegive a parallel O((n superscript 2) log n) algorithmfor achieving path-consistency. We also give a classof hard examples on which all algorithms proposed sofar, and possible parallelisations of them, taketime 0(n superscript 2). This effectivelyconstrains parallel path- consistency algorithmsof the most common form (which we glorify with the nameof reduction-type) within a fairly narrowasymptotic range.

In the next section, we introducethe relation-algebraic formulation of CSPs. Weformulate some algorithms in the following section,ending with the O((n superscript 2) log n) parallelpath-consistency algorithm. In the final section,we describe the class of problems on which thereduction-type algorithms take 0(n superscript 2)time. 000545On Zero-Testing and Interpolation of k-Sparse Multivariate Polynomials overFinite FieldsMichael Clausen, Andreas Dress, Johannes Grabmeier, Marek Karpinskitr-89-046198907__000006Given a black box which will produce the value of ak-sparse multivariate polynomial for any givenspecific argument, one may ask for optimalstrategies (1) to distinguish such a polynomial fromthe zero-polynomial, (2) to distinguish any two suchpolynomials from one other and (3) to (uniformly)reconstruct the polynomial from such an informationsource. While such strategies are known already forpolynomials over fields of characteristic zero, theequally important, but considerably morecomplicated case of a finite field K of smallcharacteristic is studied in the present paper. Theresult is that the time complexity of such strategiesdepends critically on the degree m of the extensionfield of K from which the arguments are to be chosen;e.g., if m equals the number n of variables, then (1)can be solved by k+1 and (2) as well as (3) by 2k+1queries, while in case m=1 essentially 2(superscript log n log k) queries are needed. 000546The Transitive Closure of a Random Digraph;Richard M. Karptr-89-047198908__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1989/tr-89-047.pdfIn a random $n$-vertex digraph, each arc is presentwith probability $p$, independently of the presenceor absence of other arcs. We investigate thestructure of the strong components of a randomdigraph and present an algorithm for theconstruction of the transitive closure of a randomdigraph. We show that, when $n$ is large and $np$ isequal to a constant $c$ greater than 1, it is verylikely that all but one of the strong components arevery small, and that the unique large strongcomponent contains about $ size 9 {\(*H} sup 2 n$vertices, where $ size 9 {\(*H}$ is the unique root in$[0,1]$ of the equation $1~-~x~-~e sup -cx ~=~0$.Nearly all the vertices outside the large strongcomponent lie in strong components of size 1.Provided that the expected degree of a vertex isbounded away from 1, our transitive closurealgorithm runs in expected time $O(n)$. For allchoices of $n$ and $p$, the expected execution time ofthe algorithm is $O(w(n)~(n^ log ^n) sup 4/3 )$, where$w(n)$ is an arbitrary nondecreasing unboundedfunction. To circumvent the fact that the size of thetransitive closure may be $\(*W (n sup 2 )$ thealgorithm presents the transitive closure in thecompact form $(A~ times ~B)~\(cu~C$, where $A$ and$B$ are sets of vertices, and $C$ is a set of arcs. 000547Parallel Heuristics for the Steiner Tree Problem in Images without Sortingor RoutingSusanne Hambrusch, Lynn TeWinkeltr-89-048198908__000006In this paper we consider the problem of determining aminimum-cost rectilinear Steiner tree when theinput is an n X n binary array I which is stored in an n X nmesh of processors. We present several heuristicmesh algorithms for this NP-hard problem. A majordesign criteria of our algorithms is to avoid sortingand routing which are expensive operations inpractice. All of our algorithms have a O(n log k)running time, where k is the number of connectedcomponents formed by the entries of value `1'. Themain contribution of the paper are two conceptuallydifferent methods for connecting components in animage. 000548Spatial Reasoning Based on Allen's Temporal LogicHans Werner Guesgentr-89-049198907__000006

"If one were to categorize the behavior of theintelligent machine of the future, one might do so onthe basis of the machine's capabilities to carry outtemporal reasoning over interrelated entities thatchange with time; to carry out spatial reasoning forsolving problems dealing with entities occupyingspace; and, on a more complex level, to reason overinterrelated entities occupying space and changingin time with respect to their attributes and spatialinterrelationships." --Avi Kak [12]


There are a lot ofapproaches to spatial reasoning which are more orless efficient. Nevertheless, they are not alwaysadequate from the cognitive point of view. What wewant to suggest in this paper is reasoning based onqualitative descriptions of spatialrelationships. We introduce a set of basic relationssimilar to the one Allen suggested for temporalreasoning and we show how inferences can be performedon this set.

We start with one dimensionaldescriptions which we extend to more-dimensionalones in various ways. A theoretical base is providedand the soundness of our approach is proven. Althoughwe do not claim our approach to be suitable in general,it is an efficient and straightforward way in manysituations to handle spatial knowledge. 000549Learning Read-Once Formulas with QueriesDana Angluin, Lisa Hellerstein, Marek Karpinskitr-89-050198907__000006A read-once formula is a boolean formula in which eachvariable occurs at most once. Such formulas are alsocalled m-formulas or boolean trees. This papertreats the problem of exactly identifying an unknownread-once formula using specific kinds of queries.The main results are a polynomial time algorithm forexact identification of monotone read-onceformulas using only membership queries, and apolynomial time algorithm for exact identificationof general read-once formulas using equivalence andmembership queries (a protocol based on the notion ofa minimally adequate teacher[1]). Our resultsimprove on Valiant's previous results for read-onceformulas [18]. We also show that no polynomial timealgorithm using only membership queries or onlyequivalence queries can exactly identify allread-once formulas. 000550A Note on Computational IndistinguishabilityOded Goldreichtr-89-051198907__000006We show that the following two conditions areequivalent:

1) The existence of pseudorandomgenerators.

2) The existence of a pair of efficientlyconstructible distributions which arecomputationally indistinguishable butstatistically very different.

000551An Efficient Parallel Algorithm for the 3MIS ProblemElias Dahlhaus, Marek Karpinskitr-89-052198909__000006The paper considers the problem of computing amaximal independent set in hypergraphs (see [Karp,Ramachandran 88] and [Beame, Luby 89]). We present anefficient deterministic parallel algorithm for thecase when the maximal cardinality of any hyperedge is3. The algorithm works in O((log superscript 4) n)parallel time with O(n + m) processors on a CREW PRAMand is optimal up to a polylogarithmic factor. 000552Supporting Formal Program Developments: the DEVA EnvironmentStefan Jahnichen, Robert Gabriel, Matthias Weber, Matthias Anlaufftr-89-053198909__000006The project ToolUse aims at providing means foractive assistance in the design, implementation andevolution of software. This is achieved andsupported by a formal development language calledDeva. As Deva uses two-dimensional notations to getbetter structured and surveyable representationsof developments, and as different Devaimplementations have been used within the project,both internal and external integration play crucialroles in the project ToolUse. The paper shortlyintroduces the language DEVA, sketches one of itsimplementations, and discusses both kinds ofintegration. 000553Fast Evaluation of Boolean Formulas by CREW-PRAMsRudiger Reischuktr-89-054198909__000006We extend the result of Cook, Dwork and Reischuk[CDR86] that a CREW-PRAM with a linear number ofprocessors can computer the or of n bits in less thanlog(subscript 2)n time to arbitrary Booleanformulas of logarithmic depth. Furthermore amatching lower bound for the or shown by Kutylowski[K89] is generalized to probabilistic andnondeterministic computations. 000554On the Theory of Average Case Complexity (Revised Edition)Shai Ben-David, Benny Chor, Oded Goldreich, Michael Lubytr-89-055198909__000006This paper takes the next step in developing thetheory of average case complexity initiated byLeonid A. Levin. Previous works [Levin 84, Gurevich87, Venkatesan and Levin 88] have focused on theexistence of complete problems. We widen the scope toother basic questions in computational complexity.Our results include: 000555Fast Establishment of Real-Time ChannelsSpiridon Damaskos, Dinesh C. Vermatr-89-056198910__000006A real-time channel [Fer89a] is a simplex connectionbetwen two nodes characterized by parametersrepresenting the performance requirements of theclient. In this paper, we consider fastestablishment of real-time channels, i.e., data canbe sent on a real-time channel without waiting for aconnection establishment to be confirmed by thedestination. 000556Multiplexing Real-Time ChannelsSpiridon Damaskos, Dinesh C. Vermatr-89-057198910__000006A real-time channel is a simplex connection betwentwo nodes characterized by parameters representingthe performance requirements of the client. Such aconnection may be established through the schemedescribed in [Fer89a]. In this paper, we study thefeasibility of multiplexing real-time channels on alower-layer real-time channel. Sufficientconditions for multiplexing channels are obtainedas an extension of the establishment algorithm.

Theextension is based on two observations: (1) areal-time channel can be looked upon as a network withbounded delays connecting the multiplexing point (avirtual source) to the demultiplexing point (avirtual destination); and the parameters of thephysical channel can be used to define the servicetime at the virtual source and sink. Multiplexing isnothing but channel establishment over thisnetwork. By a judicious definition of the parameterspecifying service times, it is possible to makemultiplexing decisions at the multiplexing point(source) without consulting the destination, whichis merely informed about the new multiplexedchannel. 000557Controlled Gradual Disclosure Schemes for Random Bits and Their ApplicationsRichard Clevetr-89-058198910__000006We construct a protocol that enables a secret bit to berevealed gradually in a very controlled manner. Inparticular, if Alice possesses a bit S that wasgenerated randomly according to the uniformdistribution and 1/2 < p(subscript 1) < ... <p(subscript m) = 1 then, using our protocol with Bob,Alice can achieve the following. The protocolconsists of m stages and after the i-th stage, Bob'sbest prediction of S, based on all his interactionswith Alice, is correct with probability exactlyp(subscript i) (and a reasonable condition issatisfied in the case where S is not initiallyuniform). Furthermore, under an intractabililityassumption, our protocol can be made "oblivious" toAlice and "secure" against an Alice or Bob that mighttry to cheat in various ways. Previous proposedgradual disclosure schemes for single bits releaseinformation in a less controlled manner: theprobabilities that represent Bob's confidence ofhis knowledge of S follow a random walk thateventually drifts towards 1, rather that apredetermined sequence of values.

Using controlledgradual disclosure schemes, we show how to constructan improved version of the protocol proposed by Luby,Micali and Rackoff for two-party secret bitexchanging ("How to Simultaneously Exchange aSecret Bit by Flipping a Symmetrically-BiasedCoin," Proc. 22nd Ann. IEEE Symp. on Foundations ofComputer Science, 1983, pp. 11-21) that is secureagainst additional kinds of attacks that theprevious protocol is not secure against. Also, ourprotocol is more efficient in the number of roundsthat it requires to attain a given level of security,and is proven to be asymptotically optimal in thisrespect.

We also show how to use controlled gradualdisclosure schemes to improve existing protocolsfor other cryptographic problems, such asmulti-party function evaluation. 000558Accessing and Customizing Services in Distributed SystemsRalf Guido Herrtwich, Uwe Wolfgang Brandenburgtr-89-059198910__000006In a distributed system, entities access servicesprovided to them by other entities at remote sites.While it may be unimportant to the service users whichentities act as service providers, they often haveother requirements on the services they use. On theother hand, service providers only have certainpossibilities. Both the requirements andpossibilities can be described by means ofquality-of-service parameters (QOSPs), which haveto be determined for each service session. In thispaper we design a session establishment service(SES) which takes QOSP values into account. The SEScan be used for any kind of QOSPs since it uses badnessspecifications as a uniform means to identify theusefulness of a certain QOSP value to a service user,to determine the relative importance of singleQOSPs, and to calculate the overall quality of aservice. Three kinds of QOSPs are distinguished:Static parameters do not change as long as the serviceis available, dynamic parameters depend on thecurrent state of a service provider, andretrospective parameters result from evaluationsof the service which are obtained from previousservice users. While some QOSP values are availableothers can only be accomplished if the serviceprovider schedules its resources appropriately.The reservation of resources can be integratedwithin the SES. This is especially important forreal-time services. 000559VC Dimension and Learnability of Sparse Polynomials and Rational FunctionsMarek Karpinski, Thorsten Werthertr-89-060198911__000006We prove upper and lower bounds on the VC dimension ofsparse univariate polynomials over reals, and applythese results to prove uniform learnability ofsparse polynomials and rational functions. Asanother application we solve an open problem ofVapnik [Vapnik 82] on uniform approximation of thegeneral regression functions, a central problem ofcomputational statistics (cf. [Vapnik 82], p. 256). 000560On Space-bounded Learning and the Vapnik-Chervonenkis Dimension (Thesis)Sally Floydtr-89-061198912__000006This thesis explores algorithms that learn a conceptfrom a concept class of Vapnik-Chervonenkis (VC)dimension d by saving at most d examples at a time. Theframework is the model of probably approximatelycorrect (pac) learning introduced by Valiant [V84].A maximum concept class of VC dimension d is defined.For a maximum class C of VC dimension d, we give analgorithm for representing a finite set of positiveand negative examples of a concept by a subset of dlabeled examples of that set. This data compressionscheme of size d is used to construct a space-boundedalgorithm called the iterative compressionalgorithm that learns a concept from the class C bysaving at most d examples at a time. These d examplesrepresent the current hypothesis of the learningalorithm. A space-bounded algorithm is calledacyclic if a hypothesis that has been rejected asincorrect is never reinstated. We give a sufficientcondition for the iterative compression algorithmto be acyclic on a maximum class C. Classes for whichthe iterative compression algorithm is acyclicinclude positve half-spaces in Euclidean spaceE(superscript n), balls in E(superscript n), andarbitrary rectangles and triangles in the plane. Theiterative compression algorithm can be thought of aslearning a boundary between the positive and thenegative examples. 000561The Asynchronous PRAM: A Semi-Synchronous Model for Shared Memory MIMD Machines (Thesis)Phillip Baldwin Gibbonstr-89-062198912__000006This thesis introduces the Asynchronous PRAM modelof computation, of the design and analysis ofalgorithms that are suitable for large parallelmachines in which processors communicate via adistributed, shared memory. The Asynchronous PRAMis a variant of the well-studied PRAM model whichdiffers from the PRAM in two important respects: (i)the processors run asynchronously and there is anexplicit charge for synchronization, and (ii) thereis a non-unit time cost to access the shared memory.

Many new algorithms are presented for theAsynchronous PRAM model. We modify a number of PRAMalgorithms for improved asymptotic time andprocessor complexity in the Asynchronous PRAM. Weshow general classes of problems for which the timecomplexity can be improved by restructuring thecomputation. We prove lower bounds that reflectlimitation on information flow and load balancing inthis model. Simulation results between theAsynchronous PRAM and various known synchronousmodels are presented as well.

We introduce a postoffice gossip game for studying the inherentsynchronization complexity of coordinatingprocessors using pairwise synchronizationprimitives. Results are presented that compare therelative power of various such primitives. Theseresults and techniques are used to reduce the amountof synchronization in Asynchronous PRAMalgorithms.

Furthermore, we discuss a programmingmodel based on the Asynchronous PRAM. We introducethe notion of a semi-synchronous programming model,a model for repeatable asynchronous programs.Repeatable programs, in which the output and allintermediate results are the same every time theprogram is run on a particular input, greatlysimplify the tasks of writing, debugging,analyzing, and testing programs.

Finally, wediscuss hardware support for the Asynchronous PRAMmodel. In particular, we present a cache protocolsuitable for the Asynchronous PRAM and a newtechnique for barrier synchronous PRAM and a newtechnique for barrier synchronization. 000562Five Balltree Construction Algorithms;Stephen M. Omohundrotr-89-063198912__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1989/tr-89-063.pdfBalltrees are simple geometric data structures witha wide range of practical applications to geometriclearning tasks. In this report we compare 5 differentalgorithms for constructing balltrees from data. Westudy the tradeoff between construction time and thequality of the constructed tree. Two of thealgorithms are on-line, two construct thestructures from the data set in a top down fashion, andone uses a bottom up approach. 000563Program Checkers for Algebraic Problems (Thesis)Sampath Kanantr-89-064198902__000006In this thesis we explore a model of ensuring thecorrectness of results produced by programs. Thismodel called program checking is distinct from thetwo methods in the literature -- testing andverification. Testing does not providemathematical guarantees on the correctness ofcomputation. Verification requires going into theinner workings of a program to determine itscorrectness, and is infeasible to implement for allbut very simple programs.

Program checking treatsthe program as a black box. In the checking scenariothe program is run on the desired input and the outputis checked by a program checker. The checker isallowed to make other calls to the program to ensurethe correctness of the original computation withvery high probability. The theory of programchecking draws heavily from the theory ofinteractive proof systems and probabilisticalgorithms, but the model is intended to be verypractical as well.

Our focus in this thesis is onprogram checkers for algebraic problems. Theunifying theme amongst such problems is the conceptof random self-reducibility. A function f israndomly self-reducible if the computation of f(x)for any x can be reduced to the computation of several"randomly chosen" inputs. For most of the algebraicproblems considered in this thesis the checkers usethe fact that the problem is at least partiallyself-reducible. This allows us to construct sets ofinstances whose answers are related. Verifyingconsistency of the program's answers on theseinstances allows us to design checkers for problemsin linear algebra such as rank and determinant and forproblems such as graph isomorphism and groupintersection.

We also study the connection betweeninteractive proofs and program checking. Using thetwo step approach of designing an interactive proofand converting it into a checker, we design a checkerfor group intersection. We construct bounded roundinteractive proofs for a few other problemsincluding the problem of permutation groupnon-isomorphism. This interactive proof usesinteresting consequences of the classification offinite simple groups.

Finally we consider the notionof random self-reducibility in its own right andobtain negative results about the randomself-reducibility of certain functions. 000564Lectures on a Theory of Computation and Complexity over the Reals (or anArbitrary Ring)Lenore Blumtr-89-065198912__000006These lectures discuss a new theory of computationand complexity which attempts to integrate key ideasfrom the classical theory in a setting more amenableto problems defined over continuous domains. Thegoal is to develop theoretical foundations for atheory of computational complexity for numericalanalysis and scientific computation that mightembody some of the naturalness and strengths of theclassical theory.

We highlight key aspects of the newtheory as well as to give exposition, in this setting,of classical ideas and results. Indeed, one of ourthemes will be the comparison of results over theintegers with results over the reals and complexnumbers. Contrasting one theory with the other willhelp illuminate each, and give deeper understandingto such basic concepts as decidability,definability, computability and complexity. 000565The Delaunay Triangulation and Function Learning;Stephen M. Omohundrotr-90-001199001__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1990/tr-90-001.pdfIn this report we consider the use of the Delaunaytriangulation for learning smooth nonlinearfunctions with bounded second derivatives from setsof random input output pairs. We show that ifinterpolation is implemented by piecewise-linearapproximation over a triangulation of the inputsamples, then the Delaunay triangulation has asmaller worst case error at each point than any othertriangulation. The argument is based on a niceconnection between the Delaunay criterion andquadratic error functions. The argument also allowsus to give bounds on the average number of samplesneeded for a given level of approximation. 000566Speech Segmentation and Labeling on the NeXT MachineChuck Wooters, Nelson Morgantr-90-002199001__000006We are attempting to incorporate connectionistmodels into speech recognition algorithms. Sincethese models require a large amount of training data,it was necessary to build an automated speechlabeling/segmentation application. There were twosignificant system requirements for this program:

The NeXT machine fulfills both ofthese requirements. It has built in AD/DAcapabilities. Its object-oriented programmingenvironment and application-building modulespermit quick program development.

We report here on aprogram we have developed to integrate automaticlabeling and segmentation of continuous speech witha manual system for observing and correcting thesesignal annotations. The overall system hasfunctioned well enough to permit easy user marking of600 sentences in a reasonable amount of time. 000567Considerations for the Electronic Implementation of Artificial NeuralNetworksNelson Morgantr-90-003199001__000006Computer scientists and designers have long beeninterested in comparisons between artificialautomata and the human brain [Von Neumann, 1957].Mental activity is often characterized as the resultof the parallel operation of large numbers of neurons(~10 superscript 11 for the human brain). Neuronsinteract electrochemically on a time scale ofmilliseconds, and are jointly capable ofsignificant feats of pattern recognition (such asrecognizing a friend wearing an unusual costume).These commonplace human achievements are currentlyunattainable by large electronic computers builtfrom components with characteristic delays in thenanosecond range. Artificial Neural Network (ANN)researchers hope that simplified functional modelsof nervous tissue can help us to design algorithms andmachines that are better than conventionalcomputers for difficult problems in machineperception and intelligence.

However, engineeringconstraints for silicon implementations of thesesystems may suggest design choices which differ frommimicry of biology in significant ways. Inparticular, large silicon ANN systems may requiremultiplexing of communication AND CO andcomputation as a consequence of limitedconnectivity. This report discussesconsiderations such as these, and concludes with ashort description of an ongoing effort to designsilicon ANN building blocks using powerful CADtools. 000568On the Complexity of Genuinely Polynomial ComputationMarek Karpinski, Friedhelm Meyer auf der Heidetr-90-004199001__000006We present the separation results on genuinely (alsocalled strong) sequential, parallel, andnon-deterministic complexity classes for the set ofarithmetic RAM operations {+, -, *} and {+, -, DIVsubscript c}. In particular, we separatenon-uniform polynomial time from non-uniformparallel polynomial time for the set of operations{+, -, *}, answering a question posed in [Meyer auf derHeide 88]. 000569Interpolation of Sparse Rational Functions Without Knowing Bounds onExponentsDima Y. Grigoriev, Marek Karpinski, Michael F. Singertr-90-005199001__000006We present the first algorithm for the (black box)interpolation of t-sparse rational functionswithout knowing bounds on exponents of their sparserepresentations. 000570A Resource Reservation Protocol for Guaranteed-Performance Communication inthe InternetDavid P. Anderson, Ralf Guido Herrtwich, Carl Schaefertr-90-006199002__000006This report describes the Session Reservationprotocol (SRP). SRP is defined in the DARPA Internetfamily of protocols. It allows communicating peerentities to reserve the resources (CPU and networkbandwidth) necessary to achieve given performanceobjectives (delay and throughput). The immediategoal of SRP is to support continuous media (digitalaudio and video) in IP-based distributed systems.However, it is applicable to any application thatrequires guaranteed-performance networkcommunication.

The design goals of SRP include:independence from transport protocols (SRP can beused with standard protocols such as TCP or with newreal-time protocols); compatibility with IP(packets are not modified); and that a hostimplementing SRP can benefit from its use even whencommunicating with hosts not supporting SRP.

SRP isbased on a workload and scheduling model called theDASH resource model. This model defines aparameterization of client workload, an abstractinterface for hardware resources, and an end-to-endalgorithm for negotiated resource reservationbased on cost minimization. SRP implements thisend-to-end algorithm, handling those resourcesrelated to network communication. 000571Client Requirements for Real-Time Communication ServicesDomenico Ferraritr-90-007199003__000006A real-time communication service provides itsclients with the ability to specify theirperformance requirements and to obtain guaranteesabout the satisfaction of those requirements. Inthis paper, we propose a set of performancespecifications that seem appropriate for suchservices; they include various types of delaybounds, throughput bounds, and reliability bounds.We also describe other requirements and desirableproperties from a client's viewpoint, and the ways inwhich each requirement is to be translated to make itsuitable for lower levels in the protocol hierarchy.Finally, we present examples of requirementsspecification, and discuss some of the possibleobjections to our approach. 000572An Algebraic Approach to General Boolean Constraint ProblemsHans W. Guesgen, Peter B. Ladkintr-90-008199003__000006We consider an algebraic approach to the statementand solution of general Boolean constraintsatisfaction problems (CSPs). Our approach is toconsider partial valuations of a constraint network(including the relational constraints themselves)as sets of partial functions, with the operators ofjoin and projection. We formulate all the usualconcepts of CSPs in this framework, includingk-consistency, derived constraints, andbacktrack-freeness, and formulate an algorithmscheme for k-consistency which has thepath-consistency scheme in [LadMad88.2] as aspecial case. This algebra may be embedded in thecylindric algebra of Tarski [HeMoTa71, 85], via theembedding of [ImiLip84], and a connection withrelational database operations. CSPs are shown tocorrespond to conjunctive queries in relationaldatabase theory, and we formulate a notion ofequivalence of CSPs with hidden variables,following [ChaMer76, Ull80], and show that testingequivalence is NP-hard. 000573Miniature Language Acquisition: A touchstone for cognitive science;Jerome A. Feldman, George Lakoff, Andreas Stolcke, Susan Hollbach Webertr-90-009199003__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1990/tr-90-009.pdfCognitive Science, whose genesis was interdisciplinary, shows signs ofreverting to a disjoint collection of fields. This paper presents a compact,theory-free task that inherently requires an integrated solution. The basicproblem is learning a subset of an arbitrary natural language frompicture-sentence pairs. We describe a very specific instance of this taskand show how it presents fundamental (but not impossible) challenges toseveral areas of cognitive science including vision, language, inference andlearning. 000574L0: A Testbed for Miniature Language Acquisition;Susan Hollbach Weber, Andreas Stolcketr-90-010199005__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1990/tr-90-010.pdfL0 constitutes a recent effort in Cognitive Scienceto build a natural language acquisition system for alimited visual domain. As a preparatory step towardsaddressing the issue of learning in this domain, wehave built a set of tools for rapid prototyping andexperimentation in the areas of languageprocessing, image processing, and knowledgerepresentation. The special focus of our work was theintegration of these different components into aflexible system which would allow us to betterunderstand the domain given by L0 and experiment withalternative approaches to the problems it poses. 000575A Network for Extracting the Locations of Point Clusters Using SelectiveAttention;Subutai Ahmad, Stephen Omohundrotr-90-011199005__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1990/tr-90-011.pdfThis report explores the problem of dynamicallycomputing visual relations in connectionistsystems. It concentrates on the task of learningwhether three clumps of points in a 256x256 image forman equilateral triangle. We argue that feed-forwardnetworks for solving this task would not scale well toimages of this size. One reason for this is that localinformation does not contribute to the solution: itis necessary to compute relational information suchas the distances between points. Our solutionimplements a mechanism for dynamically extractingthe locations of the point clusters. It consists of anefficient focus of attention mechanism and a clusterdetection scheme. The focus of attention mechanismallows the system to select any circular portion ofthe image in constant time. The cluster detectordirects the focus of attention to clusters in theimage. These two mechanisms are used to sequentiallyextract the relevant coordinates. With this newrepresentation (locations of the points) very fewtraining examples are required to learn the correctfunction. The resulting network is also verycompact: the number of required weights isproportional to the number of input pixels. 000576A Connectionist Unification AlgorithmSteffen Hoelldoblertr-90-012199003__000006Unification plays an important role in many areas ofcomputer science, mathematical logic, andartificial intelligence. It is also at the heart ofconnectionist models concerned with knowledgerepresentation and inference. However, most ofthese models are severly restricted by theirpropositional fixation as they are defined over afinite set of constants and predicates. Thisrestriction is caused by the inability to unify termsbuilt from function symbols, constants andvariables. In this paper a connectionistunification algorithm is presented. It utilizes thefact that the most general unifier of two termscorresponds to a finest valid equivalence relationdefined on a occurrence-label representation of theunification problem. The algorithm exploits themaximal parallelism inherent in the computation ofsuch a finest valid equivalence relation while usingonly computational features of connectionism. Itcan easily be restricted to solve special forms of theunification problem such as the word problem, thematching problem, or the unification problem overinfinite trees. 000577Towards Optimal Simulations of Formulas by Bounded-Width ProgramsRichard Clevetr-90-013199003__000006We show that, over an arbitrary ring, for any fixedepsilon > 0, all balanced algebraic formulas of size sare computed by algebraic straight-line programsthat employ a constant number of registers and havelength O (s superscript(1+epsilon)). Inparticular, in the special case where the ring isGF(2), we obtain a technique for simulating balancedBoolean formulas of size s by bounded-widthbranching programs of length O(ssuperscript(1+epsilon)), for any fixed epsilon > 0.This is an asymptotic improvement in efficiency overprevious simulations in both the Boolean andalgebraic setting. 000578Dynamic ConstraintsHans Werner Guesgen, Joachim Hertzbergtr-90-014199004__000006Usually, a constraint describes a relation onvariables, and networks of constraints are obtainedby sharing variables among constraints.Manipulating a constraint or a constraint networkmeans manipulating the variables until a consistentassignment is found. There are, however, deviationsfrom this classical view, e.g., manipulating theconstraints themselves to make the computation ofconsistent assignments more efficient, or relaxingconstraints to make an overspecified constraintproblem solvable.

In this paper, we present aformalism that subsumes classical constraintsatisfaction, constraint manipulation, andconstraint relaxation. The idea is that theconstraints in a network are not static but that theirrelations can and must be manipulated and thatmanipulating relations subsumes manipulatingvariable values. We clarify the relation betweenclassical constraint networks and the newlydeveloped dynamical ones; we prove terminationproperties of dynamic constraint networks in thespecial case of filtering; and we show by examples howto express constraint manipulation and constraintrelaxation in the new formalism. 000579Learning Feature-based Semantics with Simple Recurrent Networks;Andreas Stolcketr-90-015199004__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1990/tr-90-015.pdfThe paper investigates the possibilities for usingsimple recurrent networks as transducers which mapsequential natural language input intonon-sequential feature-based semantics. Thenetworks perform well on sentences containing asingle main predicate (encoded by transitive verbsor prepositions) applied to multiple-featureobjects (encoded as noun-phrases with adjectivalmodifiers), and shows robustness againstungrammatical inputs. A second set of experimentsdeals with sentences containing embeddedstructures. Here the network is able to processmultiple levels of sentence-final embeddings butonly one level of center-embedding. This turns out tobe a consequence of the network's inability to retaininformation that is not reflected in the outputs overintermediate phases of processing. Two extensionsto Elman's \shortcite{Elman:88} originalrecurrent network architecture are introduced. 000580Temporal Reasoning Based on Semi-Intervals (Revised Version)Christian Freksatr-90-016199004__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1990/tr-90-016.pdfA generalization of Allen's interval-basedapproach to temporal reasoning is presented. Thescope of reasoning capabilities can be considerablyextended by using relations between semi-intervalsrather than intervals as the basic units ofknowledge. Semi-intervals correspond tobeginnings or endings of temporal events. We developa representational framework in which relationsbetween semi-intervals appear as coarse knowledgein comparison with relations between intervals. Wedemonstrate the advantages of reasoning on the basisof semi-intervals: 1) coarse knowledge can beprocessed directly; computational effort is saved;2) incomplete knowledge about temporal intervalscan be fully exploited; 3) incomplete inferencesmade on the basis of complete knowledge can be useddirectly for further inference steps; 4) there is notrade-off in computational strength for the addedflexibility and efficiency; 5) semi-intervalscorrespond to natural entities both from a cognitiveand from a computational point of view. The presentedscheme supports reasoning on the basis offine-grained or complete knowledge, on the basis ofcoarse or incomplete knowledge, and on combinationsof both kinds of knowledge. The notion of `conceptualneighborhood' is central to the presented approach.Besides enhancing the reasoning capabilities inseveral directions, this notion allows for a drasticcompaction of the knowledge base underlying Allen'sinference scheme. A connection to fuzzy reasoning onthe basis of `conceptual neighborhood' is drawn. Itis suggested that reasoning based on the simplifiedknowledge base may be particularly suited for theimplementation of parallel inference engines.

[Revised version was published as:
Freksa C, Temporal reasoning based on semi-intervals, ArtificialIntelligence 54 (1992) 199-227.] 000581Time Dated Streams in Continuous-Media SystemsRalf Guido Herrtwichtr-90-017199005__000006Data in continuous-media systems, such as digitalaudio and video, has time parameters associated withit that determine its processing and display. Wepresent the "time capsule" abstraction to describehow timed data shall be stored, exchanged, andaccessed in a real-time system. When data is writteninto a time capsule, a time stamp and a duration areassociated with the data item. When it is read, a timestamp is used to select the data item. The time capsuleabstraction includes the notion of "clocks" thatensure periodic data access that is typical forcontinuous-media applications. By modifying theparameters of a clock, effects such as time lapses orslow motion can be achieved. 000582A Connectionist Approach to Symbolic Constraint SatisfactionHans Werner Guesgentr-90-018199004__000006Algorithms for solving constraint satisfactionproblems, i.e., for finding one, several, or allsolutions for a set of constraints on a set ofvariables, have been introduced in a variety ofpapers in the area of Artificial Intelligence. Here,we illustrate how a connectionist network forconstraint satisfaction can be implemented.

Theidea is to use a connectionist node for each value ofeach variable and for each tuple of each constraint ofthe constraint satisfaction problem, and to connectthem according to the way in which the constraints arerelated to the variables. Goedel numbers are used aspotentials of the nodes that correspond tovariables, representing possible paths ofsolutions. 000583Applications of Topology to Lower Bound Estimates in Computer ScienceMichael D. Hirschtr-90-019199005__000006This research explores the relationship betweentopology and computer science by analyzing simpleproblems in which the role played by topology iscrucial, yet which can be approached usingtechniques that are not too esoteric. The goal is todevelop a set of topological tools which can then beapplied to other, more central, problems incomplexity theory.

We define the concepts of "aproblem" and "problem reduction" in computerscience in such a way as to make the techniques of pointset and algebraic topology applicable. FollowingSmale, we define "topological complexity" as theminimal number of branch nodes in an algebraiccomputation tree and relate it to the Schwartz genusof a map.

We introduce a new problem, the new pointproblem (NPP), and calculate its topologicalcomplexity for a variety of spaces. NPP has manyvariations. The most realistic and applicableversion is the following. Given a list of n distinctpoints in a metric space X with a known lower bounddelta for the distance between any two points, what isthe topological complexity of finding a new point ysuch that delta is still a lower bound for the distancebetween any two points.

We prove:

Theorem
Thetopological complexity of the above problem on theinterval [0,1], with n sufficiently small, is n.
Inthe final chapter, we show how to use the definition of"a problem" to get lower bounds on the non-linearcomplexity of many problems in Computer Science thatare slightly better than previous lower bounds. 000584Prototyping and Analysis of Non-Sequential Systems Using Predicate-EventNetsHeinz W. Schmidttr-90-020199005__000006The specific language SEGRAS is centered onPredicate-Event nets (PrE-nets), a class of Petrinets whose data and behavioral invariants aredefined using algebraic specification. This paperfocuses on the analysis methods we have developed forthese nets in the ESPRIT-project GRASPIN.

PrE-netsinherit from the algebraic theory of abstractdatatypes and from net-theory. From the side ofalgebraic specification notions like the modulardecomposition, initial models or consistency andcompleteness carry over to PrE-nets and preservetheir standard semantics. These notions are relatedto the static semantics and the invariants of thedynamic behavior of a non-sequential system. Fromthe net-theoretic side theorems and methods foranalysis of behavioral properties are applicable toPrE-nets in a straightforward way. Here we considerin particular net transformations anddecomposition methods. 000585Structure and Schedulingin Real-Time Protocol ImplementationsDavid P. Anderson, Luca Delgrossi, Ralf G. Herrtwichtr-90-021199006__000006Real-time network communication involves 1) theunderlying network and its contention mechanism, 2)the design of transport protocols, 3) the schedulingof CPU and network interface devices, and 4) theprocess/interrupt structure of protocolimplementations. This paper is concerned with 3) and4), in the context of network communication ofdigital audio and video data.

We describe the issuesand design alternatives for CPU and networkinterface scheduling in the sending host, and CPUscheduling for protocol processing in the receivinghost. We discuss how the proposed policies can beincorporated in existing operating systems such asUNIX. Our discussion is based on the "DASH resourcemodel", a workload and scheduling model designed forreal-time communication. 000586Buffer Space Allocation for Real-Time Channels in a Packet-Switching NetworkDomenico Ferrari, Dinesh C. Vermatr-90-022199006__000006Broadband integrated networks will have to offerreal-time communication services; that is, theywill have to transport information with performanceguarantees. A paper previously published by theauthors presented a scheme for establishingreal-time channels in a pure packet-switchingnetwork; that scheme did not include any method forallocating buffer space in the network's nodes to thechannels being established. This paper completesthe description and evaluation of that scheme, sinceit presents one such method, and some of the results ofthe extensive simulations performed to test it. Themethod is found to be correct and to have a lowoverhead. While the utilization of the buffer spaceallocated to the statistical channels is often quitelow, thereby indicating that our worst-caseapproach tends to overallocate space to thosechannels, the space our method gives todeterministic channels seems to be reasonably wellutilized. 000587On the Power of Randomization in Online Algorithms;S. Ben-David, A. Borodin, R. Karp, G. Tardos, A. Wigdersontr-90-023199006__000006Against an adaptive adversary, we show that the powerof randomization in online algorithms is severelylimited! We prove the existence of an efficient``simulation'' of randomized online algorithms bydeterministic ones, which is best possible ingeneral.

The proof of the upper bound is existential.We deal with the issue of computing the efficientdeterministic algorithm, and show that this ispossible in very general cases. 000588An Introduction to Randomized Algorithms;Richard M. Karptr-90-024199006__000006Research conducted over the past fifteen years hasamply demonstrated the advantages of algorithmsthat make random choices in the course of theirexecution. This paper presents a wide variety ofexamples intended to illustrate the range ofapplications of randomized algorithms, and thegeneral principles and approaches that are ofgreatest use in their construction. The examples aredrawn from many areas, including number theory,algebra, graph theory, pattern matching,selection, sorting, searching, computationalgeometry, combinatorial enumeration, and paralleland distributed computation. 000589Approximating the Number of Solutions of a GF[2] PolynomialMarek Karpinski, Michael Lubytr-90-025199007__000006We develop a polynomial time Monte-Carlo algorithmfor estimating the number of solutions to amultivariate polynomial over GF[2]. This gives thefirst efficient method for estimating the number ofpoints on algebraic varieties ove GF[2], which hasbeen recently proven to be #P-complete even for cubicpolynomials. There are a variety of applications ofour result, which will be discussed in the fullversion of the paper. 000590Audio and Video in Distributed Computer Systems: Why and How?Ralf Guido Herrtwichtr-90-026199007__000006Technological advances allow computer systems tohandle "continuous media" such as audio and video inaddition to "discrete media" such as text andgraphics. As with the introduction of computergraphics ten years ago, the integration ofcontinuous media will extend the range of computerapplications and change existing paradigms forcomputer usage and programming. Distributedcomputer systems that are capable of handlingcontinuous media can (1) unify the methods ofinformation distribution, (2) personalizeinformation services through interactive accessand individual information selection, and (3) makeinformation presentation more effective. The majorobstacles to using continuous media in today'scomputer systems are performance limitations. Inaddition to high-capacity and high-speed hardware,system software is needed that meets the real-timedemands of audio and video, and that providesapplication interfaces which take the specialrequirements of these new data types into account. 000591Complexity Theoretic Issues Concerning Block Ciphers Related to D.E.S.Richard Clevetr-90-027199007__000006The D.E.S. cipher is naturally viewed as acomposition of sixteen invertible transformationson 64-bit strings (where the transformations dependof the value of a 56-bit key). Each of thetransformations has a special form and satisfies theparticular property that each of its output bits isdetermined by a "small" number of its input bits. Weinvestigate the computational power of blockciphers on n-bit strings that can be expressed aspolynomial-length (with respect to n) compositionsof invertible transformations that have a formsimilar to those of D.E.S. In particular, we requirethat the basic transformations have the propertythat each of their output bits depends on the value of asmall number of their input bits (where "small" issomewhere in the range between O(1) and O(log n)). Wepresent some sufficient conditions for ciphers ofthis type to be "pseudorandom function generators"and, thus, to yield private key cryptosystems thatare secure against adaptive chosen plaintextattacks. 000592Temporal Resoning with Intervals in Branching TimePeter B. Ladkin, Frank D. Anger, Rita V. Rodrigueztr-90-028199007__000006Allen [ALLE83] adapted path-consistencytechniques [MACK77] to heuristic reasoningconcerning intervals over linear time, bycalculating the composition table of binaryrelations on intervals, and using it in thepath-consistency algorithm. We consider here amodel of branching time which is dense, unbounded,future branching, without rejoining branches. Thealgorithm in [ALLE83] works directly withbranching-time intervals, provided only that thecomposition table of the binary branching-timeinterval relations is used instead of Allen's table[LADK88]. Here we calculate the composition tablewhich has to be used, which is considerably morecomplex than the table for linear-time intervals.This provides a heuristic, cubic-time algorithm forreasoning with branch-time intervals. 000593On Location: Points About RegionsPeter B. Ladkin, Judith S. Crowtr-90-029199007__000006In this paper we formalize Whitehead's constructionfor inducing point structures from regionstructures using a primitive relation of connectionon regions [Whi79]. Our concern is to formulate aspatiotemporal analogue to the construction oftemporal periods/points from events, and isreminiscent of the temporal constructions of Kamp[Kam79] and van Benthem [vBen83]. We compare ourinterpretation of Whitehead with the Kamp/vanBenthem/Russell constructions and find someunresolved issues of interdefinability. Our goal isan appposite formulation of spatiotemporallocations as suggested for Situation Theory byBarwise and Perry [BP83]. 000594On the Magnification of Exchange Graphs with Applications to Enumeration Problems (Thesis)Paul Dagumtr-90-030199007__000006This thesis concerns the design of fully polynomialapproximation algorithms for some #P-completeenumeration problems. The types of enumerationproblems we consider can be regarded as instances ofcomputing |F| for set systems (V,F) having adescription in terms of a "complete set ofimplicants" I with |I| = O(|V| superscript 2). Bystudying the geometric quantities of adjacency andmagnification of the "exchange graph" of setsystems, we establish criteria for the design offully polynomial algorithms. 000595Fault Tolerance in Feed-foward Artificial Neural NetworksCarlo H. Sequin, Reed D. Claytr-90-031199007__000006The errors resulting from defective units and faultyweights in layered feed-forward ANN's are analyzed,and techniques to make these networks more robustagainst such failures are discussed. First, usingsome simple examples of pattern classificationtasks and of analog function approximation, it isdemonstrated that standard architecturessubjected to normal backpropagation trainingtechniques do not lead to any noteworthy faulttolerance. Additional, redundant hardware coupledwith suitable new training techniques are necessaryto achieve that goal. A simple and general procedureis then introduced that develops fault tolerance inneural networks: The type of failures that one mightexpect to occur during operation are introduced atrandom during the training of the network, and theresulting output errors are used in a standard way forbackpropagation and weight adjustment. The resultof this training method is a modified internalrepresentation that is not only more robust to thetype of failures encountered in training, but whichis also more tolerant of faults for which the networkhas not been explicitly trained. 000596A Note on Self-Testing/Correcting Methods for Trigonometric FunctionsRichard Cleve, Michael Lubytr-90-032199007__000006Blum, Luby and Rubinfeld (1990) introduced thenotion of self-testing/correcting for variousproblems. We show how to apply some of theirtechniques to construct a self-testing/correctingpair for the problem of computing the sin and cosfunctions. 000597The Computational Complexity of (XOR, AND)-Counting ProblemsAndrzej Ehrenfeucht, Marek Karpinskitr-90-033199007__000006We characterize the computational complexity ofcounting the exact number of satisfying assignmentsof (XOR, AND)-formulas in their RSE-representation(i.e., equivalently, polynomials in GF[2] [xsubscript 1, ..., x subscript n]. This problemrefrained for some time efforts to find a polynomialtime solution and efforts to prove the problem to be#P-complete. Both main results can be generalized tothe arbitrary finite fields GF[q]. Because countingthe number of solutions of polynomials over finitefields is generic for many other algebraic countingproblems, the results of this paper settle a borderline for the algebraic problems with a polynomialtime counting algorithms and for problems which are#P-complete. In [Karpinski, Luby 89] the countingproblem for arbitrary multivariate polynomialsover GF[2] has been proved to have randomizedpolynomial time approximation algorithms. 000598Finite Representations of Deformable FunctionsPeitro Peronatr-90-034199007__000006Starting from a `template' function F(x) andcomposing it with a family of transformations Tsubscript 0 (e.g., rotations, scalings) of itsdomain one obtains a family of `deformations' of F,F0T(x) spanning an n-dimensional space; n is ingeneral infinite. A technique is presented thatallows (1) to compute the best approximation of agiven family using linear combinations of a finitenumber of `basis' functions; (2) to characterizethose functions F generating finite-dimensionalfamilies. The technique applies to all cases where Tsubscript 0 belongs to a compact group oftransformations. The results presented here haveapplications in early vision and signal processingfor the computation of filters in a continuum oforientations and scales. 000599An Introduction to Real-Time SchedulingRalf Guido Herrtwichtr-90-035199007__000006Until now, real-time processing techniques wereonly used in more exotic computer applications suchas process automation. With the advent of computersystems capable of handling time-critical data suchas digital audio and video, they become important forgeneral-purpose computing as well. Real-timescheduling, i.e., assigning resources to processesin a way that takes the timing requirements of theseprocesses into account, is the single most importanttechnique in the construction of real-time systems.This tutorial introduces the most widely used systemmodels for real-time scheduling, describingresource characteristics, process parameters, andscheduling objectives. It summarizes,illustrates, and verifies essential findings aboutbasic real-time scheduling algorithms such asearliest-deadline-first, least-laxity-first,and rate-monotonic scheduling for both sporadic andperiodic processes. 000600The Goedel Incompleteness Theorem and Decidability over a RingLenore Blumtr-90-036199008__000006Goedel showed in 1931 that given any reasonable(consistent and effective) theory of arithmetic,there are true assertions about the natural numbersthat are not theorems in that theory. This"incompleteness theorem" ended Hilbert's programof formalizing mathematics and is rightfullyregarded as the most important result in thefoundations of mathematics in this century. Now theconcept of undecidability of a set plays an importantrole in understanding Goedel's work. On the otherhand, the question of the undecidability of theMandelbrot set has been raised by Roger Penrose.Penrose acknowledges the difficulty of formulatinghis question because "decidability" hascustomarily only dealt with countable sets, not setsof real or complex numbers.

Here we give an expositionof Goedel's result in an algebraic setting and also aformulation (and essentially an answer) toPenrose's problem. The notions of computability anddecidability over a ring R underly our point of view.Goedel's Theorem follow from the Main Theorem: Thereis a definable undecidable set over Z. By way ofcontrast, Tarski's Theorem asserts that everydefinable set over the reals or any real closed field Ris decidable over R. We show a converse to this result,namely: any sufficiently infinite ordered fieldwith this property is necessarily real closed. 000601Two Results on the List Update Problem;Sandy Iranitr-90-037199008__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1990/tr-90-037.pdfIn this paper we give a randomized on-line algorithmfor the list update problem. Sleator and Tarjan show adeterministic algorithm, Move-to-Front, thatachieves competitive ratio of (2L-1)/L for lists oflength L. Karp an Raghavan show that no deterministicalgorithm can beat 2L/(L+1). We show thatMove-to-Front in fact achieves an optimalcompetitive ratio of 2L/(L+1). We show a randomizedalgorithm that achieves a competitive ratio of (31 L +1 )/16(L+1) against an oblivious adversary. This isthe first randomized strategy whose competitivefactor beats a constant less than 2.

Keywords: Analysis of Algorithms, On-line Algorithms, Competitive Analysis, Amortized Analysis, Linear Lists. 000602Information-Based Complexity: New Questions for MathematiciansJ. F. Traub, H. Woznaikowskitr-90-038199008__000006[No Abstract] 000603The Monte Carlo Algorithm with a Pseudo-Random GeneratorJ. F. Traub, H. Woznaikowskitr-90-039199008__000006We analyze the Monte Carlo algorithm for theapproximation of multivariate integrals when apseudo-random generator is used. We establish lowerand upper bounds on the error of such algorithms. Weprove that as long as a pseudo-random generator iscapable of producing only finitely many points, theMonte Carlo algorithm with such a pseudo-randomgenerator fails for L subscript 2 or continuousfunctions. It also fails for Lipschitz functions ifthe number of points does not depend on the number ofvariables. This is the case if a linear congruentialgenerator is used with one initial seed. On the otherhand, if a linear congruential generator of period mis used for each component with independentuniformly distributed initial seeds, then the MonteCarlo algorithm with such a pseudo-random generatorusing n function values behaves as for the uniformdistribution and its expected error is roughly nsuperscript (-1/2) as long as the number n of functionvalues is less than m superscript 2. 000604Designing Checkers for Programs that Run in ParallelRonitt Rubinfeldtr-90-040199008__000006We extend the theory of program result checking toparallel programs, and find general techniques fordesigning such result checkers. We find resultcheckers for many basic problems in parallelcomputation. We show that there are P-completeproblems (evaluating straight-line programs,linear programming) that have very fast (evenconstant depth) parallel result checkers. Sorting,multiplication, parity, majority and the all pairsshortest path problem all have constant depth resultcheckers. In addition, the sequential versions ofthe parallel result checkers given for integersorting and the all pairs shortest path problems arethe first deterministic sequential result checkersfor those problems. 000605Self-Testing/Correcting with Applications to Numerical ProblemsManuel Blum, Michael Luby, Ronitt Rubinfeldtr-90-041199008__000006Suppose someone gives us an extremely fast program Pthat we can call as a black box to compute a function f.Should we trust that P works correctly? Aself-testing/correcting pair for f allows us to: (1)estimate the probability that P(x) is not equal tof(x) when x is randomly chosen; (2) on any input x,compute f(x) correctly as long as P is not too faulty onaverage. Furthermore, both (1) and (2) take time onlyslightly more than the original running time of P.

Wepresent general techniques for constructing simpleto program self-testing/correcting pairs for avariety of numerical functions, including integermultiplication, modular multiplication, matrixmultiplication, inverting matrices, computing thedeterminant of a matrix, computing the rank of amatrix, integer division, modular exponentiationand polynomial multiplication. 000606CHCL - A Connectionist Inference System for Horn Logic based on theConnection Method and using Limited ResourcesSteffen Hoelldoblertr-90-042199008__000006A connectionist inference system for a class of Hornclauses is presented. The system is based on aconnectionist unification algorithm forfirst-order terms and utilizes Bibel's connectionmethod. The resources of the system are limited inthat at most one instance of each clause may be used in aproof. 000607ODA-Based Data Modeling in Multimedia SystemsRalf Guido Herrtwich, Luca Delgrossitr-90-043199008__000006A multimedia system can handle both discrete media(text, graphics) and continuous media (audio,video). The design of a multimedia system comprisesprocessing and data modeling aspects. In this paper,we are concerned with data modeling only. We present aproposal to extend the ISO Office DocumentArchitecture (ODA) to accommodate continuousmedia. To provide media flexibility, the needs fornew ODA content architectures are identified. Totake into account the timing requirements ofcontinuous-media data, attributes for temporalsynchronization are introduced for the logical andlayout structure of an ODA document. To consider thatmultimedia information does not only appeal to thesense of vision, the layout structure is extendedfrom two-dimensional visual space to arbitrary"presentation space". In addition, the inclusion oflive information and hypertext features into ODAdocuments is proposed. 000608Continuous Speech Recognition on the Resource Management Database UsingConnectionist Probability EstimationN. Morgan, C. Wooters, H. Bourlard, M. Cohentr-90-044199009__000006Previous work has shown the ability of MultilayerPerceptrons (MLPs) to estimate emissionprobabilities for a Hidden Markov Model (HMM). Theadvantage to this approach is the ability toincorporate multiple sources of evidence(features, temporal context) without restrictiveassumptions of distribution or statisticalindependence.

In our earlier publications on thistopic, a hybrid MLP/HMM continuous speechrecognition algorithm was tested on the SPICOSGerman-language data base. In our recent work, wehave shifted to the speaker-dependent portion ofDARPA's English language Resource Management (RM)data base. Both consist of continuous utterances(sentences) and incorporate a lexicon of roughly1000 words. Preliminary results appear to supportthe previously reported utility of MLP probabilityestimation for continuous speech recognition (atleast for the case of this simple form of HMM). 000609SPOONS '90: The SPeech recOgnition frOnt eNd workShopN. Morgan H. Hermansky, C. Wooterstr-90-045199009__000006An appropriate input representation is crucial forpattern classification. In spite of this, we findthat feature extraction, transformation, andselection tend to be under-represented aspects ofthe speech recognition literature. Therefore, theauthors decided to gather together a group ofinterested parties for a dialog on the subject. Weultimately invited a group of about 30 researchers,and on July 6, 1990, held a 1-day workshop which wecalled SPOONS. This document is a brief summary ofthat day, including the abstract for each talk. 000610Space-Frequency Distributions in the Analysis and Modeling of Early VisionGabriel Cristobaltr-90-046199009__000006The use of the joint space-spatial frequencyrepresentations has recently receivedconsiderable attention; especially in those areasof science and engineering where nonstationarysignals appear. In that case, local energydistribution representations based in the localspectra computation would be more appropriate. TheWigner Distribution (WD) which gives a jointrepresentation in the space and spatial frequencydomain entails a rigorous mathematical framework inthe study of these local representations. In thispaper, texture recognition is performed through theextraction of features from the WD and a comparativestudy with other methods is presented. A review of thestate-of-the-art of the joint representations indifferent areas of research namely signal, speechand vision processing is presented. Afterwards, theimportance of these distributions in the modeling ofearly vision processes is considered, and a briefreview about the physiological findings ispresented in order to have a quantitative measure ofthe degree of biological plausibility. 000611The Ring Array Processor (RAP): Algorithms and ArchitectureNelson Morgantr-90-047199009__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1990/tr-90-047.pdfWe have designed and implemented a Ring ArrayProcessor (RAP) for fast implementation of ourcontinuous speech recognition training algorithmswhich are currently dominated by layered neuralnetwork calculations. The RAP is a multi-DSP systemwith a low-latency ring interconnection schemeusing programmable gate array technology and asignificant amount of local memory per node (4-16MBytes of dynamic memory and 256 KByte of fast staticRAM). Theoretical peak performance is 128MFlops/board, and test runs with the first workingboard show a sustained throughput of roughly 30-90percent of this for algorithms of current interest.

This report describes the motivation for the RAPdesign, and shows how the architecture matches thetarget algorithm. Technical reports from othermembers of the RAP team focus on the hardware andsoftware specifics for the system. 000612The Ring Array Processor (RAP): Hardware;J. Becktr-90-048199009__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1990/tr-90-048.pdfThe ICSI Ring Array Processor, or RAP, is a system ofhardware and software specifically designed for ourresearch in speech processing using neuralnetworks. This technical report describes the RAPhardware, paying particular attention to thefeatures that may be unusual in a system of this type.Other features and design decisions thatfacilitated realization of the RAP are alsodescribed. Technical reports from other members ofthe RAP team focus on the architecutre and algorithmsof the RAP, and the software specifics for the system. 000613Ring Array Processor (RAP): Software User's Manual Version 1.0;P. Kohn, J. Bilmestr-90-049199009__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1990/tr-90-049.pdfThe RAP machine is a high performance parallelprocessor developed at ICSI as described in previoustechnical reports. This report documents the RAPsoftware environment. It is intended for themoderately experienced C programmer who wishes toprogram the RAP. The RAP software environment is verysimilar to the UNIX C programming environment.However, there are some differences arising from thehardware that the programmer must keep in mind. Alsodescribed is the RAP library which containshand-optimized matrix, vector and inter-processorcommunications routines. Single Program MultipleDatastream (SPMD) programs can be developed underUNIX with a simulated RAP library and then recompiledto run on the RAP. 000614Ring Array Processor (RAP): Software Architecture;Jeff Bilmes, Phil Kohntr-90-050199009__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1990/tr-90-050.pdfThe design and implementation of software for theRing Array Processor (RAP), a high performanceparallel computer, involved development for threehardware platforms: Sun SPARC workstations,Heurikon MC68020 boards running the VxWorksreal-time operating system, and Texas InstrumentsTMS320C30 DSPs. The RAP now runs in Sun workstationsunder UNIX and in a VME based system using VxWorks. Aflexible set of tools has been provided both to the RAPuser and programmer. Primary emphasis has beenplaced on improving the efficiency of layeredartificial neural network algorithms. This was doneby providing a library of assembly languageroutines, some of which use node-customcompilation. An object-oriented RAP interface inC++ is provided that allows programmers toincorporate the RAP as a computational server intotheir own UNIX applications. For those not wishing toprogram in C++, a command interpreter has been builtthat provides interactive and shell-script styleRAP manipulation. 000615Characterizing the Variability of Arrival Processes with Indices ofDispersionRiccardo Gusellatr-90-051199009__000006We propose to characterize the burstiness of packetarrival processes with indices of dispersion forintervals and for counts. These indices, which arefunctions of the variance of intervals and counts,are relatively straightforward to estimate andconvey much more information than simpler indices,such as the coefficient of variation, that are oftenused to describe burstiness quantitatively.

Wedefine and evaluate the indices of dispersion forsome of the simple analytical models that arefrequently used to represent highly variableprocesses. We then estimate the indices for a numberof measured point processes which were generated byworkstations communicating to file servers over alocal-area network.

We show that nonstationarycomponents in the measured packet arrival datadistort the shape of the indices and propose ways tohandle nonstationary data. Finally, to show how toincorporate measures of variability intoanalytical models and to offer an example of how tomodel our measured packet arrival processes, wedescribe a fitting procedure based on the index ofdispersion for counts for the Markov-modulatedPoisson process. 000616On Semi-Algebraic Decision ComplexityThomas Lickteigtr-90-052199009__000006The topic of this paper is the lower bound question forcomposition trees that solve certain semialgebraicdecision problems. 000617A Pipelining Model Which Pipelines Blocks of CodeJoachim Beertr-90-053199010__000006This paper presents a new technique of softwarepipelining and an architecture to support thistechnique. Rather than attempting to pipeline asequence of individual instructions, the presentedtechnique tries to pipeline entire blocks of code,i.e. the units to be pipelined are chunks of code,instructions within each code block might or mightnot be pipelined themselves. In this model blocks ofcode are identified which can be executed in apipelined fashion. Neighboring blocks of code do notneed to be data independent; pipeline stages can feedresults and/or synchronization markers on to thenext pipeline stage. The architecture can be seen asan attempt to use classical pipelining techniques ina multiprocessor system. The architecture consistsof a circular pipeline of ordinary microprocessors.Advantages of the architecture are: unlikesupercomputers and VLIW architectures the systemcan be based on commercial micro-processors, itavoids the high overhead of process startup, and it isnot restricted to vectorizing only inner-loops.Simulation studies show the viability of thearchitecture and the associated execution model. 000618A Mathematical Theory of Self-Checking, Self-Testing and Self-Correcting Programs (Thesis)Ronitt Rubinfeldtr-90-054199010__000006Suppose someone gives us an extremely fast program Pthat we can call as a black box to compute a function f.Rather than trust that p works correctly, aself-testing/correcting pair for f allows us to: (1)estimate the probability that P(x) is not equal tof(x) when x is randomly chosen; (2) on any input x,compute f(x) correctly as long as P is not too faulty onaverage. Furthermore, both (1) and (2) require only asmall multiplicative overhead (usually constant)over the running time of P. A program result checkerfor f (as introduced by Manuel Blum) allows us to checkthat on particular input x, P(x) = f(x).

We presentgeneral techniques for constructing simple toprogram self-testing/correcting pairs for avariety of numerical functions. Theself-testing/correcting pairs introduced for manyof the problems are based on the property that thesolution to a particular instance of the problem canbe expressed as the solution to a few random instancesof the same size. An important idea is to designself-testing/correcting pairs for an entirelibrary of functions rather than for each functionindividually.

We extend these notions and some of thegeneral techniques to check programs for somespecific functions which are only intended to givegood approximations to f(x). We extend the abovemodels and techniques of program result checking andself-testing/correcting to the case where thebehavior of the program is modelled as beingadaptive, i.e., the program may not always give thesame answer on a particular input. These strongercheckers provide multi-prover interactive proofsfor these problems.

The theory of checking is alsoextended to parallel programs [Rubinfeld]. Weconstruct parallel checkers for many basic problemsin parallel computation.

We show that for someproblems, result checkers that are much moreefficient can be constructed if the answers arechecked in batches, i.e., many answers are checked atthe same time. For these problems, themultiplicative overhead of checking the result canbe made arbitrarily small. 000619ICSIM: Initial Design of An Object-Oriented Net SimulatorHeinz W. Schmidttr-90-055199010__000006ICSIM is a connectionist net simulator beingdeveloped at ICSI. It is object-oriented to meet therequirements for flexibility and reuse of models andto allow the user to encapsulate efficientcustomized implementations perhaps running ondedicated hardware. Nets are composed by combiningoff-the-shelf library classes and, if necessary, byspecializing some of their routines.

The reportgives an overview of the simulator. The classstructure and some important design decisions aresketched and a number of example nets are used toillustrate how net structure, connectivity andbehavior are defined. 000620How Fast Can A Threshold Gate Learn?Wolfgang Maass, Gyoergy Turantr-90-056199010__000006It is shown that a threshold gate with d Boolean inputvariables can learn any halfspace in polynomially ind many steps in the common on-line learning model(worst case analysis). This is achieved by acomputationally feasible learning algorithm thatexploits geometrical properties in the versionspace. This positive result can be extended to thecase of input variables that range over {0,...,n-1},and to threshold gates with more than two differentoutput values (these gates can learn arbitrarydiscrete approximations to sigmoid thresholdfunctions).

On the other hand we show that all knowndistributed learning algorithms for thresholdgates (delta-rule, WINNOW 1, WINNOW 2) areinherently slow. 000621Learning Spatial Terms without Explicit Negative InstancesTerry Regiertr-90-057199010__000006A method is presented for learning to associatescenes with spatial terms, in the absence of explicitnegative instances, using error back-propagation.A straightforward approach, in the learning of agiven term, is to take all positive instances for anyother term to be implicit negative instances for theterm in question. While this approach is inadequate,a variation on it is shown to work well: error signalsfrom implicit negative instances are attenuated, sothat an implicit negative instance will have lesseffect on the network's weights than will a positiveinstance of the same error magnitude. It is also shownthat "a priori" knowledge of which pairs of spatialterms are antonyms facilitates the learningprocess. 000622A Theory of Computation and Complexity over the Real NumbersLenore Blumtr-90-058199010__000006The classical theory of computation and complexitypresupposes all underlying spaces are countable andhence ipso facto cannot handle arbitrary sets of realor complex numbers. Thus e.g., Penrose (1990)acknowledges the difficulty of formulatingclassically his question: Is the Mandelbrot setrecursive? On the other hand, this as well as a numberof other inherent questions of decidability andcomputability over the reals or complex number can benaturally posed and settled within the frameworkpresented in this paper. 000623Constraint Reasoning With Intervals: A Tutorial, Survey and BibliographyPeter B. Ladkintr-90-059199011__000006A version of this work was presented at the 1990Berkeley Workshop on Temporal and Real-TimeSpecification, held at ICSI, Berkeley. In Part I, wepresent a short tutorial on constraint reasoningwith time intervals, of the sort initiallyintroduced by James Allen, and continued by manyothers. The tutorial concentrates on the generalmathematical expression of common algorithms, inparticular path-consistency algorithms, forconstraint satisfaction using the thirteeninterval relations. We use the relation algebra ofTarski to express the important concepts. In Part II,we survey important research in this field to date,focusing on mathematical results and algorithms forreasoning directly with intervals, although we doattempt to include as much literature as the author isaware of. Part III is a select bibliography. Threeappendices include the mathematical background,and the operation tables for the Point Algebra andInterval Algebra, which form the focus of Part I. 000624Proceedings of the Berkeley Workshop on Temporal and Real-TimeSpecification, August 9-10, 1990P. B. Ladkin, F. H. Vogttr-90-060199011__000006This report contains papers presentedby participants at the workshop, with anintroduction, a participant list, a synopsis of theworkshop, and a short summary of the problem sessiondiscussion. The workshop brought togetherpractitioners with different interests in temporaland real-time specification, from simulation,testing and verification to theoretical issues suchas relative strengths of theories. The papersconcern interval logic, theories of intervals,real-time temporal logic and automata, a real-timesystems simulation language, and a causalityproblem in robot motion planning. 000625Stochastic Model-Based Image Segmentation Using Markov Random Fields andMulti-layer PerceptronsJun Zhang, Nelson Morgantr-90-061199011__000006Recently, there has been much interest in Markovrandom field (MRF) model-based techniques for image(texture) segmentation. MRF models are used toenforce reasonable physical constraints onsegmented regions, such as the continuity of theregions, and have been shown to improve segmentationresults. However, in these techniques, parametricprobability models which do not have sufficientphysical justifications are often used to modelobserved image data because they arecomputationally tractable. In this paper, weoutline an MRF approach to image segmentation inwhich the probability distribution of observedimage data is modeled by using a multi-layerperceptron (MLP) which can "learn" the distributionfrom training data. Furthermore, we propose atechnique to achieve unsupervised imagesegmentation using this approach. We hope that thiswill improve the current MRF image segmentationtechniques by providing a better model for observedimage data. 000626Proceedings of the First International Workshop on Network Operating SystemSupport for Digital Audio and Video[Proceedings Editor]tr-90-062199011__000006Held at the International Computer ScienceInstitute November 8-9, 1990. 000627A Monte-Carlo Algorithm for Estimating the Permanent;N. Karmarkar, R. Karp, R. Lipton, L. Lovasz, M. Lubytr-90-063199011__000006Let $A$ be an $n \times n$ matrix with 0-1 valuedentries, and let $\PER(A)$ be the permanent of $A$. Wedescribe a Monte-Carlo algorithm which produces a``good in the relative sense'' estimate of $\PER(A)$and has running time $\POLY(n) 2^{n/2}$, where$\POLY(n)$ denotes a function that growspolynomially with $n$.

Key Words: permanent,matching, Monte-Carlo algorithm, algorithm,bipartite graph, determinant. 000628Quality of Service in ATM NetworksDomenico Ferrari, Dinesh Vermatr-90-064199012__000006B-ISDN networks of the future will have to handletraffic with a wide range of traffic characteristicsand performance requirements. In view of the highbandwidth of these networks and the relatively largepropagation delays involved in wide-area B-ISDNnetworks, the performance requirements can only beprovided by reserving resources to communicatingclients at the connection establishment time.However, reservation mechanisms for heterogenousbursty traffic usually result in a rather poorutilization of network resources.

In this paper, wepropose a simple admission control criterion thatcan be used to reserve resources for bursty as well assmooth traffic with delay and loss sensitivities.Our scheme leads to a reasonable value of the maximumutilization of network bandwidth (about 40 percent)for delay sensitive traffic with moderateburstiness (peak-to-average bandwidth ratios ofabout 4), even under the worst possible conditions.Actual utilizations can be higher if there is smoothtraffic or traffic which is not delay-sensitive. Ouradmission control algorithm uses a well-definedtraffic specification scheme which is easy toenforce and verify, and able to accommodatearbitrary degrees of burstiness. Extensivesimulation experiments failed to show that ouradmission control criterion are incorrect, in thesense that the quality of service requirements of thetraffic was always met,even in the worst case.Moreover, the scheme is simple and feasible at thehigh speeds required of B-ISDN networks. 000629Developments in Digital VLSI Design for Artificial Neural NetworksNelson Morgan, Krste Asanovic, Brian Kingsbury, John Wawrzynektr-90-065199012__000006Artificial Neural Networks (ANNs) have beenheralded as a form of massive parallelism that maysignificantly advance the state of the art in machineintelligence and perception. While theseexpectations may or may not be realistic, this classof algorithms has already been useful for difficultproblems in signal processing and patternrecognition over the last 25 years. However, forextension to a wider class of problems, a keyrequirement is the parallel hardwareimplementation of such systems, since ANNimplementation on conventional Von Neumannmachines is often prohibitively slow. While the ANNmainstream has focused on analog VLSI ANNs, someprojects have shown the potential of a fully digitalapproach. We report here on progress in developing amethodology for digital ANN design, including a newobject-oriented CAD interface, and a set ofANN-specific library cells. A new measure forefficiency of silicon ANNs is also described. 000630Automatic Worst Case Complexity Analysis of Parallel ProgramsWolf Zimmermanntr-90-066199012__000006This paper introduces a first approach for theautomatic worst case complexity analysis. It is anextension of previous work on the automaticcomplexity analysis of functional programs. Thelanguage is a first order parallel functionallanguage which allows the definition of indexed datatypes and parallel execution of indexed terms. Themachine model is a parallel reduction system based oneager evaluation. It is shown how parallel programsbased on the basic design principles balanced binarytree technique, divide-and-conquer technique andpointer jumping technique can be analyzedautomatically. The analysis techniques aredemonstrated by various examples. Finally it isshown that an average case analysis of parallelprograms is difficult. 000631The Mean Field Theory in EM Procedures for Markov Random FieldsJun Zhangtr-91-001199101__000006The EM (expectation maximization) algorithm is amaximum-likelihood parameter estimationprocedure for incomplete data problems in which partof the data is hidden, or unobservable. In many signalprocessing and pattern recognition applications,the hidden data are modeled as Markov processes andthe main difficulty of using the EM algorithm forthese applications is the calculation of thecondition expectations of the hidden Markovprocesses. In this paper, we show how the mean fieldtheory from statistical mechanics can be used toefficiently calculate the conditionalexpectations for these problems. The efficacy of themean field theory approach is demonstrated on theparameter estimation for one-dimensional mixturedata and two-dimensional unsupervised stochasticmodel-based image segmentation. Experimentalresults indicate that in the 1-D case, the mean fieldtheory approach provides comparable results tothose obtained by Baum's algorithm, which is known tobe optimal. In the 2-D case, where Baum algorithm canno longer be used, the mean field theory provides goodparameter estimates and image segmentation for bothsynthetic and real-world images. 000632Protocols for Providing Performance Guarantees in a Packet SwitchingInternetCarlyn M. Lowerytr-91-002199101__000006As advances in technology enable us to implement veryhigh speed computer networks, we expect to use ournetworks for more diverse applications. While theInternet was designed with textual data processingin mind, future networks will carry information suchas voice, music, images, and video, along withtextual data. Many new applications will havereal-time performance requirements, where thetiming of data arrival is crucial to its usefulness.

This paper describes a methodology developed at theUniversity of California at Berkeley to support suchapplications, reviews related research work, andproposes a real-time delivery system, composed of anew protocol for administration of real-timeconnections, combined with modifications to theInternet Protocol (IP) to support such connections.Transport protocol requirements are alsodiscussed. This work is intended to facilitateexperiments with real-time communication over theExperimental University Network (XUNET). 000633On-Line Learning with an Oblivious Environment and the Power ofRandomizationWolfgang Maasstr-91-003199101__000006A new model for on-line learning is introduced. Inthis model the environment is assumed to be"oblivious" to the learner: it supplies an arbitrary(not necessarily random) sequence of examples forthe target concept which does not depend on thesequence of hypotheses of the learner. This modelprovides a framework for the design and analysis ofon-line learning algorithms which acquireinformation not just from counter examples, but alsofrom examples which "support" their currenthypothesis. It is shown that for various conceptclasses C an arbitrary target concept from C can belearned in this model by a randomized learningalgorithm (which uses only hypotheses from C) withsubstantially fewer prediction errors than inAngluin's classical model for on-line learning withan adaptive worst-case environment. In particularany target-setting of weights and threshholds in afeed forward neural net can be learned by a randomizedlearning algorithm in this model with an expectednumber of prediction errors that is polynomial in thenumber of units of the neural net.

For comparison wealso examine the power of randomization forAngluin's model for learning with an adaptiveenvironment. 000634Real-Time Transmission and Software Decompression of Digital Video in aWorkstationK. Umemura, A. Okazaketr-91-004199101__000006This paper describes an experiment in whichcompressed video data is transformed via Ethernet toa workstation, and uncompressed and displayed on theworkstation. The workstation has no specialhardware. The video data is 192x114 pixel gray scale,30 frames per second. The data consists of a humanspeaker with a static background. It is displayed onmonochrome display, with dithering, in a 768x576rectangle. This decompression and display usesabout 10 MIPS. The quality of output is suitable forapplications such as conferencing, telephony, andpresentations. 000635Some Computational Problems in Linear Algebra as Hard as MatrixMultiplicationPeter Buergisser, Marek Karpinski, Thomas Lickteigtr-91-005199101__000006We define the complexity of a computational problemgiven by a relation using the model of a computationtree with Ostrowski complexity measure. To asequence of problems we assign an exponent similar asfor matrix multiplication. For the complexity of thefollowing computational problems in linearalgebra:

Weprove relative lower bounds of the form ((a)(Msubscript n)) - b and absolute lower bounds where (Msubscript n) denotes the complexity of matrixmultiplication and a, b, d are suitably chosenconstants. We show that the exponents of the problemsequences KER, OGB, SPR are the same as the exponentomega of matrix multiplication. 000636Parallel Combinatorial ComputingRichard M. Karptr-91-006199101__000006In this article we suggest that the application ofhighly parallel computers to applications with acombinatorial or logical flavor will grow inimportance. We briefly survey the work oftheoretical computer scientists on theconstruction of efficient parallel algorithms forbasic combinatorial problems. We then discuss atwo-stage algorithm design methodology, in which analgorithm is first designed to run on a PRAM and thenimplemented for a distributed-memory machine.Finally, we propose the class of node expansionalgorithms as a fruitful domain for the applicationof highly parallel computers. 000637Delay Jitter Control for Real-Time Communication in a Packet SwitchingNetworkDinesh C. Verma, Hui Zhang, Domenico Ferraritr-91-007199101__000006A real-time channel is a simplex connection betweentwo nodes characterized by parameters representingthe performance requirements of the client. Theseparameters may include a bound on the minimumconnection bandwidth, a bound on the maximum packetdelay, and a bound on the maximum packet loss rate.Such a connection may be established in apacket-switching environment by means of theschemes described by some of the authors in previouspapers.

In this paper, we study the feasibility ofbounding the delay jitter for real-time channels in apacket-switched store-and-forward wide-areanetwork with general topology, extending the schemeproposed in the previous papers. We prove thecorrectness of our solution, and study itseffectiveness by means of simulations. The resultsshow that the scheme is capable of providing asignificant reduction in delay jitter, that there isno accumulation of jitter along the path of a channel,and that jitter control reduces the buffer spacerequired in the network significantly. 000638A Study of I/O Architecture for High Performance Next Generation ComputersAnurag Sah, Vojin G. Oklobdjiza, Dinesh C. Vermatr-91-008199101__000006We describe an I/O architecture for a highperformance next generation computer. Thearchitecture proposed in this paper makes specialprovisions for communication networks. In order toallow for the expected multi-media andtime-critical components of future computer usage,we propose the concept of "Illogical buses" whichgives the illusion that there are a number ofdedicated buses between the components of a system. Alogical bus has a number of performance parametersassociated with it, and the system architectureensures that the performance parameters for eachlogical bus are satisfied during the operation of thesystem. 000639Bumptrees for Efficient Function, Constraint, and Classification LearningStephen M. Omohundrotr-91-009199101__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1991/tr-91-009.pdfA new class of data structures called "bumptrees" isdescribed. These structures are useful forefficiently implementing a number of neural networkrelated operations. An empirical comparison withradial basis functions is presented on a robot armmapping learning task. Applications to densityestimation, classification, and constraintrepresentation and learning are also outlined. 000640How Receptive Field Parameters Affect Neural LearningStephen M. Omohundro, Bartlett W. Meltr-91-010199101__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1991/tr-91-010.pdfWe identify the three principle factors affectingthe performance of learning by networks withlocalized units: unit noise, sample density, and thestructure of the target function. We then analyze theeffect of unit receptive field parameters on thesefactors and use this analysis to propose a newlearning algorithm which dynamically altersreceptive field properties during learning. 000641Algorithms for Sparse Rational InterpolationDima Grigoriev, Marek Karpinskitr-91-011199101__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1991/tr-91-011.pdfWe present two algorithms on sparse rationalinterpolation. The first is the interpolationalgorithm in a sense of the sparse partial fractionrepresentation of rational functions. The second isthe algorithm for computing the entier and theremainder of a rational function. The firstalgorithm works without apriori known bound on thedegree of a rational function, the second one is in theclass NC provided the degree is known. The presentedalgorithms complement the sparse interpolationresults of [Grigoriev, Karpinski, and Singer(1990)].

Keywords: Algorithms, NC-Class, Sparse Rational Interpolation, Fraction Representation. 000642On Distributed Representation in Word SemanticsBurghard B. Riegertr-91-012199101__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1991/tr-91-012.pdfThe dualism of the rationalistic tradition ofthought is sketched in view of the "semiotic problem"of meaning constitution. Being a process ofcognition which is based upon communicativeinteraction by signs, their usages (in linear orderand selective combination), constitute languagestructures. Other than "symbolic"representational formats employed so far in naturallanguage processing by machine, it is argued herethat "distributional" representations corresponddirectly to the way word meanings are constituted andunderstood (as fuzzy structures of world knowledge)by (natural and artificial) information processingsystems. Based upon such systems' theoreticalperformance in general and the pragmatics ofcommunicative interaction by real language users inparticular, the notions of "situation" and"language game" as introduced by Barwise/Perry andWittgenstein respectively are combined to allow fora numerical reconstruction of processes thatsimulate the constitution of meaning and theinterpretation of signs. This is achieved bymodelling the linear or "syntagmatic" and selectiveor "paradigmatic" constraints which naturallanguage structure imposes on the formation of(strings of) linguistic entities. A formalism, arelated algorithm, and test results of itsimplementation are given in order to substantiatethe claim for an artificial "cognitive informationprocessing system" (CIPS) that operates in alinguistic environment as some meaning acquisitionand understanding device. 000643Short Proofs for Nondivisibility of Sparse Polynomials under the ExtendedRiemann HypothesisDima Grigoriev, Marek Karpinski, Andrew M. Odlyzkotr-91-013199102__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1991/tr-91-013.pdfSymbolic manipulation of sparse polynomials, givenas lists of exponents and nonzero coefficients,appears to be much more complicated than dealing withpolynomials in dense encoding (see e.g. [GKS 90, KT88, P 77a, P 77b]). The first results in this directionare due to Plaisted [P 77a, P 77b], who proved, inparticular, the NP-completeness of divisibility ofa polynomial x**n-1 by a product of sparsepolynomials. On the other hand, essentially nothingnontrivial is known about the complexity of thedivisibility problem of two sparse integerpolynomials. (One can easily prove that it is inPSPACE with the help of [M 86].) Here we prove thatnondivisibility of two sparse multivariablepolynomials is in NP, provided that the ExtendedRiemann Hypothesis (ERH) holds (see e.g. [LO 77]).

The divisibility problem is closely related to therational interpolation problem (whosedecidability and complexity bound are determined in[GKS 90]). In this setting we assume that a rationalfunction is given by a black box for evaluating it. Weprove also that the problem of deciding whether arational function given by a black box equals apolynomial belongs to the parallel class NC,provided the ERH holds and moreover, that we know thedegree of some sparse rational representation of it.

Keywords: Algorithms, NC-Class, Symbolic Manipulation, Nondivisibility, Short Proofs, Extended Riemann Hypothesis. 000644Computational Complexity of Learning Read-Once Formulas over Different BasesLisa Hellerstein, Marek Karpinskitr-91-014199102__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1991/tr-91-014.pdfWe study computational complexity of learningread-once formulas over different boolean bases. Inparticular we design a polynomial time algorithm forlearning read-once formulas over a threshold basis.The algorithm works in time O(n**3) using O(n**3)membership queries. By the result of [Angluin,Hellerstein, Karpinski, 1989] on the correspondingunate class of boolean functions, this gives apolynomial time learning algorithm for arbitraryread-once formulas over a threshold basis withnegation using membership and equivalence queries.Furthermore we study the structural notion ofnondegeneracy in the threshold formulasgeneralizing the result of [Heiman, Newman,Wigderson, 1990] on the uniqueness of read-onceformulas over different boolean bases and derive anegative result on learnability of nondegenerateread-once formulas over the basis (AND, XOR).

Keywords: Computational Complexity, Learning Algorithms, Read-Once Formulas, Queries. 000645A Control-Theoretic Approach to Flow ControlSrinivasan Keshavtr-91-015199103__000006This paper presents a control-theoretic approach toreactive flow control in networks that do not reservebandwidth. We assume a round-robin-like queueservice discipline in the output queues of thenetwork's switches, and propose deterministic andstochastic models for a single conversation in anetwork of such switches. We then construct astandard time-invariant linear model for thesimplified dynamics of the system. This is used todesign an optimal (Kalman) state estimator, aheuristic second-order state estimator as well as aprovably stable rate-based flow control scheme.Finally, schemes for correcting parameter drift andfor coordination with window flow control aredescribed. 000646Parallel Priority QueuesMaria Cristina Pinotti, Geppino Puccitr-91-016199103__000006This paper introduces the Parallel Priority Queue(PPQ) abstract data type. A PPQ stores a set ofinteger-valued items and provides operations suchas insertion of n new items or deletion of the nsmallest ones. Algorithms for realizing PPQoperations on an n-processor CREW-PRAM are based ontwo new data structures, the n-Bandwidth-Heap (n-H)and the n-Bandwidth-Leftist-Heap (n-L), that areobtained as extensions of the well known sequentialbinary-heap and leftist-heap, respectively. Usingthese structures, it is shown that insertion of n newitems in a PPQ of m elements can be performed inparallel time O(h+logn), where h=log(m/n), whiledeletion of the n smallest items can be performed intime O(h+loglogn). 000647Optimal Adaptive K-means Algorithm with Dynamic Adjustment of Learning RateChedsada Chinrungrueng, Carlo Sequintr-91-017199103__000006Adaptive k-means clustering algorithms have beenused in several artificial neural networkarchitectures, such as radial basis functionnetworks or feature-map classifiers, for acompetitive partitioning of the input domain. Thispaper presents a modification of the traditionalk-means algorithm. In approximates an optimalclustering solution with an efficient adaptivelearning rate, which renders it usable even insituations where the statistics of the problem taskslowly varies with time. This modification is basedon the optimality criterion for the k-meanspartition stating that all of the region in theoptimal k-means partition have the same "within-cluster variation" when the number of regions in thepartition is large and the underlying distributionfor generating input patterns is smooth. Thewithin-cluster variation of any cluster is definedas the expectation of the squared Euclidean distancebetween pattern vectors in that cluster and thecenter of that cluster. Simulations comparing thisimproved adaptive k-means algorithm with otherk-means variants are presented. 000648Computational Complexity of Sparse Rational InterpolationDima Grigoriev, Marek Karpinski, Michael F. Singertr-91-018199103__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1991/tr-91-018.pdfWe analyze the computational complexity of sparserational interpolation, and give the first genuinetime (arithmetic complexity does not depend on thesize of the coefficients) algorithm for thisproblem.

Keywords: Computational Complexity, Algorithms, Arithmetic Complexity, Sparse Rational Interpolation. 000649Probabilistic Recurrence RelationsRichard M. Karptr-91-019199103__000006This paper is concerned with recurrence relationsthat arise frequently in the analysis ofdivide-and-conquer algorithms. In order to solve aproblem instance of size $x$, such an algorithminvests an amount of work $a(x)$ to break the probleminto subproblems of sizes$h_1(x),h_2(x),\ldots,h_k(x)$, and then proceedsto solve the subproblems. Our particular interest isin the case where the sizes $h_i(x)$ are randomvariables; this may occur either because ofrandomization within the algorithm or because theinstances to be solved are assumed to be drawn from aprobability distribution. When the $h_i$ are randomvariables the running time of the algorithm oninstances of size $x$ is also a random variable$T(x)$. We give several easy-to-apply methods forobtaining fairly tight bounds on the upper tails ofthe probability distribution of $T(x)$, and presenta number of typical applications of these bounds tothe analysis of algorithms. The proofs of the boundsare based on an interesting analysis of optimalstrategies in certain gambling games. 000650The Design of a File System that Supports MultimediaVassilios G. Polimenistr-91-020199103__000006A multimedia file system is one that can supportreal-time sessions as well as normal disk traffic.When a request for a real-time session is accepted,the file system guarantees that, as long as the systemdoes not crash and the user process reads or writesdata at most as fast as the initially specified rate,starvation will never occur.

It is shown that the onlyhard requirement for the acceptance of a set ofreal-time sessions are that there is enough diskbandwidth and buffer space. A rigorous discussion ofthese requirements as well as the various parametersthat affect the system's behaviors are presented.

Finally and most importantly, a scheduler that usesthis theory to schedule the various disk transfers isdesigned. The scheduler guarantees thenon-starvation for multimedia sessions and alsothat interactive (non-real-time) jobs willexperience acceptable response delays. 000651Generalized Compact Multigrid (REVISED)Victor Pan, John Reiftr-91-021199212__000006Extending our recent work, based on the ideas of themultigrid iteration, we decrease the storage spacefor a smooth solution of a nonlinear PDE and,furthermore, for any smooth function on amultidimensional grid and on discretization setsother than grids. 000652An (epsilon, delta)--Approximation Algorithm of the Number of Zeros for aMultilinear Polynomial over GF[q]Marek Karpinski, Barbara Lhotzkytr-91-022199103__000006We construct a polynomial time (epsion,delta)-approximation for estimating the number ofzeros of an arbitrary multi-linear polynomial f((xsubscript 1), ..., (x subscript n)) over GF[q]. Thisextends the recent result of Karpinski/Luby [KL90]on approximating the number of zeros of polynomialsover the field GF[2]. 000653On the Average Case Complexity of Parallel Sublist SelectionGeppino Pucci, Wolf Zimmermantr-91-023199103__000006The "Sublist Selection Problem" (SSP) is thefollowing: Given an input list of nodes labelled Trueor False, extract the sublist of nodes labelled True.This paper analyzes the average case complexity of aparallel algorithm that solves SSP on the PRAM modelof computation. The algorithm is based on thewell-known "recursive doubling" technique. Doublylogarithmic upper and lower bounds are derived forthe average number of iterations needed to producethe output list, under the assumption that all thenodes of the input list are marked False withprabability p, independently of the other nodes.Finally, the exact number of iterations (up to lowerorder terms) is established in the case that the inputlist is drawn from the uniform distribution over allpossible labelings. 000654Large Comparison of Rate-Based Service DisciplinesHui Zhang, Srinivasan Kesahvtr-91-024199104__000006This paper compares six new queue servicedisciplines that are implemented at the outputqueues of switches in a connection-oriented packetswitched data network. These are Virtual Clock, FairQueueing, Delay-Earliest-Due-Date,Jitter-Earliest-Due-Date, Stop-and-Go andHierarchical Round Robin. We describe theirmechanisms, their similarities and differences,and some implementation strategies. In particular,we show why each discipline can or cannot providebandwidth, delay and delay jitter guarantees. Thisleads to some interesting conclusions about therelative strengths and weaknesses of each approach. 000655Limiting Fault-Induced Output Errors In ANNsReed D. Clay, Carlo H. Sequintr-91-025199104__000006The worst case output errors produced by the failureof a hidden neuron in layered feed-forward ANNs areinvestigated. These errors can be much worse thansimply the loss of the contribution of a neuron whoseoutput goes to zero. A much larger erroneous signalcan be produced when the failure sets the value of thehidden neuron to one of the power supply voltages.

Anew method is investigated that limits thefractional error in the output signal of afeed-forward net due to such saturated hidden unitfaults in analog function approximation tasks. Thenumber of hidden units is significantly increased,and the maximal contribution of each unit is limitedto a small fraction of the net output signal. Toachieve a large localized output signal, severalGaussian hidden units are moved into the samelocation in the input domain and the gain of the linearsumming output unit is suitably adjusted. Since thecontribution of each unit is equal in magnitude,there is only a modest error under any possiblefailure mode. 000656[REVISED:] New Resultant Inequalities and Complex Polynomial Factorization(formerly known as "Randomized Incomplete Numerical Factorization of aPolynomial Over the Complex Field")Victor Pantr-91-026199212__000006We deduce some new probabilistic estimates on thedistances between the zeroes of a polynomial p(x) byusing some properties of the discriminant of p(x) andapply these estimates to improve the fastestdeterministic algorithm for approximatingpolynomial factorization over the complex field. 000657An Approximation Algorithm for the Number of Zeros of Arbitrary Polynomialsover GF[q]Dima Grigoriev, Marek Karpinskitr-91-027199104__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1991/tr-91-027.pdfWe design the first polynomial time (for an arbitraryand fixed field GF[q])(epsilon,delta)-approximation algorithm for thenumber of zeros of arbitrary polynomial f(x_1, ...,x_n) over GF[q].It gives the first efficient methodfor estimating the number of zeros and nonzeros ofmultivariate polynomials over small finite fieldsother than GF[2] (like GF[3]), the case important forvarious circuit approximation techniques. Thealgorithm is based on the estimation of the number ofzeros of an arbitrary polynomial f(x_1, ... ,x_n)over GF[q] in the function on the number m of its terms.The bounding ratio number is proved to be m**((q-1)log q) which is the main technical contribution ofthis paper and could be of independent algebraicinterest.

Keywords: Approximation Algorithms, Counting Problems, Multivariate Polynomials, Finite Fields. 000658The Packet Pair Flow Control ProtocolSrinivasan Keshavtr-91-028199105__000006This paper presents a reactive flow controlmechanism for networks that do not reservebandwidth. We assume a round-robin-like FairQueueing service discipline in the output queues ofswitches and routers, which enables us to model aconversation as a sequence of D/D/1 queues. Thismodel is used to derive a rate-based flow controlprotocol called Packet-pair, or 2P. 2P uses shortpacket bursts to estimate the service rate of aconversation at its bottleneck, and to adapt itssending rate to the network state. We describe thedesign and impementation of 2P in detail.Simulations compare the scheme with some well knownflow control schemes in deterministic as well asstochastic scenarios. Anaysis and simulationsindicate that 2P is able to use available bandwidthefficiently and to achieve low queueing delays,particulary in networks where the bandwidth-delayproduct is large. Further, 2P responds quickly andcorrectly to dynamic changes in the network. 000659On the Decidability Problem for a Topological Syllogistic Involving theNotion of Topological ProductDomenico Cantone, Vincenzo Cutellotr-91-029199105__000006A two-level, multi-sorted language of sets withcartesian product is introduced. The solvability ofthe satisfiability problem for the correspondingclass of unquantified formulae is shown to be usefulin order to automatically verify the validity ofcertain topological statements involving thenotion of product of spaces.

The underlyingmotivation for this study is to enrich the class oftheoretical results that can be used for aset-theoretic proof verifier. 000660Probability estimation by feed-forward networks in continuous speechrecognitionSteve Renals, Nelson Morgan, Herve Bourlardtr-91-030199108__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1991/tr-91-030.pdfWe review the use of feed-forward networks asestimators of probability densities in hiddenMarkov modelling. In this paper we are mostlyconcerned with radial basis functions (RBF)networks. We note the isomorphism of RBF networks totied mixture density estimators; additionally wenote that RBF networks are trained to estimateposteriors rather than the likelihoods estimated bytied mixture density estimators. We show how theneural network training should be modified toresolve this mismatch. We also discuss problems withdiscriminative training, particularly the problemof dealing with unlabelled training data and themismatch between model and data priors. 000661pSather monitors: Design, Tutorial, Rationale and ImplementationJerome A. Feldman, Chu-Cheow Lim, Franco Mazzantitr-91-031198909__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1989/tr-91-031.pdfSather is a new object-oriented programminglanguage under development at the InternationalComputer Science Institute. The initial beta testrelease of the language was in June, 1991. From theoutset, one goal of the Sather project has been theincorporation of constructs to support parallelprogramming. pSather is a parallel extension ofSather aimed at shared memory parallelarchitectures. A prototype of the language iscurrently being implemented on a Sequent Symmetryand on SUN Sparc-Stations. pSather monitors are oneof the basic new features introduced in the languageto deal with parallelism. The current design ispresented and discussed in detail. 000662GAL: Networks that grow when they learn and shrink when they forgetEthem Alpaydintr-91-032199105__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1991/tr-91-032.pdfLearning when limited to modification of someparameters has a limited scope; the capability tomodify the system structure is also needed to get awider range of the learnable. In the case ofartificial neural networks, learning by iterativeadjustment of synaptic weights can only succeed ifthe network designer predefines an appropriatenetwork structure, i.e., number of hidden layers,units, and the size and shape of their receptive andprojective fields. This paper advocates the viewthat the network structure should not, as usuallydone, be determined by trial-and-error but should becomputed by the learning algorithm. Incrementallearning algorithms can modify the networkstructure by addition and/or removal of units and/orlinks. A survey of current connectionist literatureis given on this line of thought. ``Grow and Learn''(GAL) is a new algorithm that learns an association atone-shot due to being incremental and using a localrepresentation. During the so-called ``sleep''phase, units that were previously stored but whichare no longer necessary due to recent modificationsare removed to minimize network complexity. Theincrementally constructed network can later befinetuned off-line to improve performance. Anothermethod proposed that greatly increases recognitionaccuracy is to train a number of networks and vote overtheir responses. The algorithm and its variants aretested on recognition of handwritten numerals andseem promising especially in terms of learningspeed. This makes the algorithm attractive foron-line learning tasks, e.g., in robotics. Thebiological plausibility of incremental learning isalso discussed briefly.

Keywords: Incremental learning, supervised learning, classification, pruning, destructive methods, growth, constructive methods, nearest neighbor. 000663Polymorphic Processor ArraysMassimo Marescatr-91-033199105__000006A Polymorphic Processor Array (PPA) is atwo-dimensional mesh- connected array ofprocessors, in which each processor is equipped witha switch able to interconnect its four NEWS ports. PPAis an abstract architecture based upon theexperience acquired in the design and in theimplementation of a VLSI chip, namely thePolymorphic Torus (PT) chip, and, as a consequence,it only includes capabilities that have been provedto be supported by cost-effective hardwarestructures. The main claims of PPA are that 1) itmodels a realistic class of parallel computers, 2) itsupports the definition of high level programmingmodels, 3) it supports virtual parallelism and 4) itsupports low complexity algorithms in a number ofapplication fields. In this paper we present both thePPA computation model and the PPA programming model;we show that the PPA computation model is realistic byrelating it to the design of the PT chip and show thatthe PPA programming model is scalable bydemonstrating that any algorithm having 0(p)complexity on a virtual PPA of size (square root m) X(square root m), has 0(kp) complexity on a PPA of size(square root n) X (square root n), with m=kn and kinteger. We finally show some applicationalgorithms in the area of numerical analysis andgraph processing. 000664Sather Language Design and Performance EvaluationChu-Cheow Lim, Andreas Stolcketr-91-034199105__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1991/tr-91-034.pdfSather is an object-oriented language recentlydesigned and implemented at the InternationalComputer Science Institute in Berkeley. It compilesinto C and is intended to allow development ofobject-oriented, reusable software whileretaining C's efficiency and portability. Weinvestigate to what extent these goals were metthrough a comparative performance study andanalysis of Sather and C programs on a RISC machine.Several language design decisions in Sather aremotivated by the goal of efficient compilation tostandard architectures. We evaluate the reasoningbehind these decisions, using instruction set usagestatistics, cache simulations, and other datacollected by instrumented Sather-generated code.

We conclude that while Sather users still pay amoderate overhead for programming convenience (inboth run time and memory usage) the overall CPU andmemory usage profiles of Sather programs arevirtually identical to those of comparable Cprograms. Our analysis also shows that each of thechoices made in Sather design and implementation iswell justified by a distinctive performanceadvantage. It seems, then, that Sather proves thefeasibility of its own design goal of makingobject-oriented programming efficient on standardarchitectures using a combination of judiciouslanguage design and efficient implementation. 000665HiPNeT-1: A Highly Pipelined Architecture for Neural Network TrainingKrste Asanovic, Brian E. D. Kingsbury, Nelson Morgan, John Wawrzynektr-91-035199106__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1991/tr-91-035.pdfCurrent artificial neural network (ANN) algorithmsrequire extensive computational resources.However, they exhibit massive fine-grainedparallelism and require only moderate arithmeticprecision. These properties make possible customVLSI implementations for high performance, low costsystems. This paper describes one such system, aspecial purpose digital VLSI architecture toimplement neural network training in a speechrecognition application.

The network algorithm hasa number of atypical features. These include: sharedweights, sparse activation, binary inputs, and aserial training input stream. The architectureillustrates a number of design techniques to exploitthese algorithm-specific features. The result is ahighly pipelined system which sustains a learningrate of one pattern per clock cycle. At a clock rate of20MHz each "neuron" site performs 200 millionconnection updates per second. Multiple suchneurons can be integrated onto a modestly sized VLSIdie. 000666Experimental Determination of Precision Requirements for Back-PropagationTraining of Artificial Neural NetworksKrste Asanovic, Nelson Morgantr-91-036199106__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1991/tr-91-036.pdfThe impact of reduced weight and output precision onthe back-propagation training algorithm isexperimentally determined for a feed-forwardmulti-layer perceptron. In contrast with previoussuch studies, the network is large with over 20,000weights, and is trained with a large, real-world dataset of over 130,000 patterns to perform a difficulttask, that of phoneme classification for acontinuous speech recognition system.

The resultsindicate that 16b weight values are sufficient toachieve training and classification resultscomparable to 32b floating point, provided thatweight and bias values are scaled separately, andthat rounding rather than truncation is employed toreduce the precision of intermediary values. Outputprecision can be reduced to 8 bits withoutsignificant effects on performance. 000667A Brief History of the Association for Women in Mathematics: The Presidents'PerspectivesLenore Blumtr-91-037199106__000006A talk with transparencies presented at the 20thanniversary celebration of the Association forWomen in Mathematics, January, 1991. 000668Test Complexity of Generic PoynomialsPeter Buergisser, Thomas Lickteig, Michael Shubtr-91-038199107__000006We investigate the complexity of algebraic decisionmembership in a hypersurface X propersubset (Csuperscript m). We prove an optimal lower bound on thenumber of additions, subtractions and comparisonsand an asymptotically optimal lower bound on thenumber of multiplications, divisions andcomparisons that are needed to decide membership in ageneric subsurface X propersubset (C superscriptm).

In the situation over the reals where in additionto equality branching also \leq-branching allowed,we prove an analogous statement for irreducible"generic" hypersurfaces X propersubset (Rsuperscript m). In the case m=1 we give also a lowerbound for finite subsets of X propersubset R. 000669Verification Complexity of Linear Prime IdealsPeter Buergisser, Thomas Lickteigtr-91-039199107__000006The topic of this paper is the complexity of algebraicdecision trees deciding membership in an algebraicsubset X propersubset (R superscript m) where R is areal or algebraically closed field). We define anotion of verification complexity of a (real) primeideal (in a prime cone) which gives a lower bound on thedecision complexity. We exactly determine theverification complexity of some prime ideals oflineary type generalizing a result by Winograd[Win-70]. As an application we show uniformoptimality with respect to the number ofmultiplications and divisions needed for twoalgorithms:

000670Efficient Visual Search: A Connectionist SolutionSubutai Ahmad, Stephen Omohundrotr-91-040199107__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1991/tr-91-040.pdfSearching for objects in scenes is a natural task forpeople and has been extensively studied bypsychologists. In this paper we examine this taskfrom a connectionist perspective. Computationalcomplexity arguments suggest that parallelfeed-forward networks cannot perform this taskefficiently. One difficulty is that, in order todistinguish the target from distractors, acombination of features must be associated with asingle object. Often called the binding problem,this requirement presents a serious hurdle forconnectionist models of visual processing whenmultiple objects are present. Psychophysicalexperiments suggest that people use covert visualattention to get around this problem. In this paper wedescribe a psychologically plausible system whichuses a focus of attention mechanism to locate targetobjects. A strategy that combines top-down andbottom-up information is used to minimize searchtime. The behavior of the resulting system matchesthe reaction time behavior of people in severalinteresting tasks. 000671Virtual Parallelism Support in Reconfigurable Processor ArraysMassimo Maresa, Hungwen Litr-91-041199107__000006Reconfigurable Processor Arrays (RPAs) are aspecial class of mesh connected computers in whicheach node is equipped with a switching system able tointernally interconnect its NEWS ports and toestablish paths between non-neighborhood nodes.The best known proposals in the area of RPAs are theMesh with Reconfigurable Bus [Miller, et al., 1988],the Processor Arrays with Reconfigurable BusSystems [Wang and Chen, 1990], the Gated ConnectionNetwork [Shu and Nash] and Polymorphic ProcessorArray [Li and Maresca, 1989]. In this paper we showthat only one of these architectures, namely thePolymorphic Processor Array, supports virtualparallelism. The support of virtual parallelism isimportant because it allows the complexitymeasurements of the parallel algorithms to be scaledto real implementations, where the size of theprocessor array can be smaller than the problem size.We demonstrate that: 1) the RPAs that allow theestablishment of an arbitrary shapetwo-dimensional bus do not support virtualparallelism and 2) the Polymorphic Processor Array,with its connection power to one-dimensional buses,supports virutal paralellism. 000672Hierarchical Node Clustering in Polymorphic Processor ArraysMassimo Maresa, Hungwen Litr-91-042199107__000006Massively parallel computers are implemented bymeans of modules at different packaging levels. Thispaper discusses a hierarchical node clusteringscheme (HNC) for packaging a class of reconfigurableprocessor arrays called Polymorphic ProcessorArrays (PPA) which use circuit-switching-basedrouters at each node to deliver a different topologyat every instruction. The PPA family suffers from anunknown signal delay between two arbitrary nodesconnected by the circuit- switched paths. Thiseither forces the hardware clock to compromise to theworst signal or makes the software dependent on thesystem size. The use of the HNC scheme allows one toobtain communication speed-up and automaticcontrol, at the compiler lever, over signalpropagation delay. 000673Efficiency of Asynchronous Transfer Mode Networks in Transporting Wide-AreaData TrafficRamon Cacerestr-91-043199107__000006For performance and economic reasons, ATM networksmust efficiently support the Internet family ofprotocols. We calculate the transmissionefficiency achieved by a range of ATM-relatedprotocols when transporting TCP and UDP wide-areatraffic. We also compare the efficiency efforts ofseveral non-standard compression techniques. Toassure an accurate workload characterization, wedrive these calculations with millions of wide-areapacket lenghts measured on the current Internet.

Wefind that networks using standard ATM procedures aredismally inefficient in carrying traditional datatraffic -- depending on the protocols used,efficiency as seen by an application program rangesbetween 40 and 53 percent. Moreover, due tointeraction between TCP- IP datagram lengths and ATMcell padding, efficiency responds abruptly tochanges in certain protocol parameters -- forexample, a 4-byte increase in ATM cell payload sizecan yield a 10 percent increase in efficiency. Usingone compression technique in isolation can improveefficiency by 12 percent, and simultaneously usingthree techniques can improve it by 34 percent. Theseissues should be considered when designing futureATM networks. 000674VC Dimension and Sampling Complexity of Learning Sparse Polynomials andRational FunctionsMarek Karpinski, Thorsten Werthertr-91-044199108__000006This paper presents the recent results on the VCdimension and the sampling complexity of learningsparse polynomials and rational functions. Some ofthe direct applications of these results have alsobeen presented. 000675The Automatic Worst Case Analysis of Parallel Programs: Single ParallelSorting and Algorithms on GraphsWolf Zimmermantr-91-045199108__000006No Abstract. 000676A Characterization of Space Complexity Cases and Subexponential Time Classesas Limiting Polynomially Decidable SetsGiorgio Ausiello, Marco Protasi, Michele Angelacciotr-91-046199108__000006The concept of limiting approximation, originallyintroduced by Gold for recursive functions, has beenpreviously adapted by the authors to the polynomiallevel of complexity in order to study complexityclasses of sets polynomially computable in thelimit. In this paper new results concerning thecharacterization of space complexity classes (fromPSPACE to Grzegorczyk classes) as classes of setspolynomially decidable in the limit are presented.Besides tight trade-offs between the rate ofconvergence of the approximating sequences and theconstants of their polynomially running time areshown. Finally the limiting polynomialapproximation for classes of sets between P andPSPACE is investigated under the hypothesis that P isdifferent from PSPACE. 000677CLOS, Eiffel, and Sather: A ComparisonHeinz W. Schmidt, Stephen M. Omohundrotr-91-047199109__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1991/tr-91-047.pdfThe Common Lisp Object System defines a powerful andflexible type system which builds on more than 15years of experience with object-orientedprogramming. Most current implementations includea comfortable suite of Lisp support tools includingan Emacs lisp editor, an interpreter, an incrementalcompiler, a debugger, and an inspector whichtogether promote rapid prototyping and design. Whatelse might one want from a system? We argue that statictyping yields earlier error detection, greaterrobustness, and higher efficiency and that greatersimplicity and more orthogonality in the languageconstructs leads to a shorter learning curve and moreintuitive programming. These elements can be foundin Eiffel and a new object-oriented language,Sather, that we are developing at ICSI. Languagesimplicity and static typing are not for free,though. Programmers have to pay with loss ofpolymorphism and flexibility in prototyping. Wegive a short comparison of CLOS, Eiffel and Sather,addressing both language and environment issues.

The different approaches taken by the languagesdescribed in this paper have evolved to fulfilldifferent needs. While we have only touched on theessential differences, we hope that this discussionwill be helpful in understanding the advantages anddisadvantages of each language. 000678ICSIM: An Object-Oriented Connectionist SimulatorHeinz W. Schmidt, Benedict Gomestr-91-048199111__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1991/tr-91-048.pdfICSIM is a connectionist net simulator underdevelopment at ICSI and written in Sather. It isobject-oriented to meet the requirements forflexibility and reuse of homogeneous and structuredconnectionist nets and to allow the user toencapsulate efficient customized implementationsperhaps running on dedicated hardware. Nets arecomposed by combining off-the-shelf libraryclasses and, if necessary, by specializing some oftheir behaviour. General user interface classesallow a uniform or customized graphic presentationof the nets being modeled.

The report gives anoverview of the simulator. Its main concepts, theclass structure of its library and some of the designdecisions are sketched and a number of example netsare used to illustrate how net structure,interconnection and behavior are defined. 000679VISIT: An Efficient Computational Model Of Human Visual AttentionSubutai Ahmadtr-91-049199109__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1991/tr-91-049.pdfThesis One of the challenges for models of cognitivephenomena is the development of efficient andflexible interfaces between low level sensoryinformation and high level processes. For visualprocessing, researchers have long argued that anattentional mechanism is required to perform many ofthe tasks required by high level vision. This thesispresents VISIT, a connectionist model of covertvisual attention that has been used as a vehicle forstudying this interface. The model is efficient,flexible, and is biologically plausible. Thecomplexity of the network is linear in the number ofpixels. Effective parallel strategies are used tominimize the number of iterations required. Theresulting system is able to efficiently solve twotasks that are particularly difficult for standardbottom-up models of vision: computing spatialrelations and visual search. Simulations show thatthe network's behavior matches much of the knownpsychophysical data on human visual attention. Thegeneral architecture of the model also closelymatches the known physiological data on the humanattention system. Various extensions to VISIT arediscussed, including methods for learning thecomponent modules. 000680Learning Spatial Concepts Using a Partially-Structured ConnectionistArchitectureTerry Regiertr-91-050199110__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1991/tr-91-050.pdfThis paper reports on the learning of spatialconcepts in the L0 project. The challenge ofdesigning an architecture capable of learningspatial concepts from any of the world's languages isfirst highlighted by reviewing the spatial systemsof a number of languages which differ strikingly fromEnglish in this regard. A partially structuredconnectionist architecture is presented which hassuccessfully learned concepts from the languagesoutlined. In this architecture, highly structuredsubnetworks, specialized for the spatial conceptlearning task, feed into an unstructured,fully-connected upper subnetwork. The system'ssuccess at the learning task is attributed on the onehand to the constrained search space which resultsfrom structuring, and on the other hand to theflexibility afforded by the unstructured uppersubnetwork. 000681Evaluation of Overflow Probabilities in Resource ManagementDinesh Chandra Verma, Domenico Ferraritr-91-051199110__000006In a number of network and database managementapplications, we need to evaluate an overflowprobability, which is an upper bound on theprobability that the capacity of a server will beexceeded. The problem can be essentially reduced toevaluating the probability that the sum of Nindependent random variables exceed a giventhreshold. Evaluation of this probability bybrute-force enumeration requires exponentialtime, so attempts have been made to approximate theoverflow probability by using Chernoff bounds. Thispaper presents a simple scheme that can be used toevaluate the overflow probability with a higherdegree of accuracy and lower computational effortsthan the Chernoff bound approach. 000682CHCL--A Connectionist Inference SystemSteffen Hoelldobler, Franz Kurfesstr-91-052199110__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1991/tr-91-052.pdfCHCL is a "c"onnectionist inference system for"H"orn logic which is based on the "c"onnectionmethod and uses "l"imited resources. This papergives an overview of the system and itsimplementation. 000683Unification with ICSIMFranz Kurfesstr-91-053199108__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1991/tr-91-053.pdfThis document describes the implementation of adistributed unification algorithm using theconnectionist simulator ICSIM. The algorithm isbased on S. Hoelldobler's work, as described in[Hoelldobler, 1990b]. Unification problems arespecified according to a simple language,describing the terms, functions, variables andconstants occurring in such a problem; the terms to beunified are represented as less than term_1 = term_2is greater than (e.g., less than f(x, x, x) = f(g(a), y,g(z)) is greater than).

A parser extracts relevantinformation and creates intermediate datastructures needed for the construction of theconnectionist network. Essential data structuresdescribe the symbols occurring in the terms, thehierarchical structure of the terms (functions andtheir arguments), and the occurrences of the symbolsin the terms. The connectionist unification networkis constructed based on these intermediatestructures. It is hierarchically organized, its toplevel NET consisting of POSITIONS, which correspondto the nodes in the term structure. A POSITIONconsists of a SYMBOL, either of type VARIABLE orCONSTANT. Symbols comprise a TERM UNIT and a number ofUNIFICATION UNITS, depending on the number ofpositions in the terms to be unified. Initially, TERMUNITS are set according to the occurrences of theirsymbols in the term structure; based on the linkswithin the network and the activation of UNIFICATIONUNITS, more TERM UNITS are activated as required bythe unification algorithm. The final set of activeTERM UNITS is used to construct the most generalunifier for the terms to be unified. The network can beeasily extended to detect inconsistencies in theterm structure or to perform an occur check. 000684Knowledge Selection with ANNsDimitris Karagiannis, Franz Kurfess, Heinz-Wilhelm Schmidttr-91-054199110__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1991/tr-91-054.pdf(32 Pages) The access to information contained inpossibly large knowledge bases is a crucial factor inthe usability of such a knowledge base. In this paper,we present a method to select information relevantfor a query in knowledge bases where the informationis represented in a rule-based way. An approach basedon artificial neural networks is used to pre-selectthe set of relevant rules, thus facilitating the taskof the inference mechanism by restricting the searchspace to be traversed considerably. In addition tothe information contained in the query itself, dataderived from the environment in which the query issituated is used to further trim down the searchspace. Sources for this derivation process are dataabout the task under investigation as well as thehistory of user interactions.

We refer to the firstway of diminishing the search space via the query as"identification"; the second one is referred to as"adaptation", since the selection process isadapted to the current task. The third one, takinginto account the history of interactions betweenuser and knowledge base, is called "prediction",aiming at a possible prediction of the next query, or asubset of rules relevant for the next query.

Animplementation of the artificial neural networksused for these tasks is based on ICSIM, aconnectionist simulator developed at ICSI. 000685Potentiality of Parallelism in LogicFranz Kurfesstr-91-055199110__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1991/tr-91-055.pdfThe processing of knowledge is becoming a major areaof applications for computer systems. In contrast todata processing, the current stronghold of computeruse, where well-structured data are manipulatedthrough well-defined algorithms, the treatment ofknowledge requires more intricate representationschemes as well as refined methods to manipulate therepresented information. Among the many candidatesproposed for representing and processingknowledge, logic has a number of importantadvantages, although it also suffers from somedrawbacks. One of the advantages is the availabilityof a strong formal background with a large assortmentof techniques for dealing with the representationand processing of knowledge. A considerabledisadvantage so far is the amount and complexity ofcomputation required to perform even simple tasks inthe area of logic. One promising approach to overcomethis problem is the use of parallel processingtechniques, enabling an ensemble of processingelements to cooperate in the solution of a problem.The goal of this paper is to investigate thecombination of parallelism and logic. 000686Distributed Delay Jitter Control in Packet-Switching InternetworksDomenico Ferraritr-91-056199110__000006Delay jitter is the variation of the delays with whichpackets travelling on a network connection reachtheir destination. For good quality of reception,continuous-media (video, audio, image) streamsrequire that jitter be kept below a sufficientlysmall upper bound. This paper proposes a distributedmechanism for controlling delay jitter in apacket-switching network. The mechanism can beapplied to an internetwork that satisfies theconditions detailed in the paper, and can coexistwith other schemes (including the absence of anyscheme) for jitter control within the same network,the same node, and even the same real-time channel.The mechanism can guarantee small jitter bounds evenwhen the clocks of the host systems and the gatewaysalong a channel's route are only looselysynchronized; furthermore, it makes thedistribution of buffer space requirements moreuniform over the channel's route, and reduces by anon-neglible amount the total buffer space needed bya channel. The paper argues that, if these advantagesare sufficient to justify the higher costs of thedistributed jitter control mechanism with respectto a non-distributed one, it would be useful to offerto the network's users a jitter control service basedon the mechanism proposed here. 000687A Method for Obtaining Randomized Algorithms with Small Tail ProbabilitiesH. Alt, L. Guibas, K. Mehlhorn, R. Karp, A. Wigdersontr-91-057199109__000006We study strategies for converting randomizedalgorithms of the Las Vegas type into randomizedalgorithms with small tail probabilities. 000688Detecting Skewed SymmetriesStefan Poschtr-91-058199110__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1991/tr-91-058.pdfMany surfaces of objects in our world are bounded byplanar bilaterally symmetric figures. When thesefigures are imaged under orthographic projection askewed symmetric contour results. In this paper a newfast, local method to recover skewed symmetries fromcurve segments is proposed. It can be applied tocomplete as well as to occluded contours.Furthermore, the skewed symmetry property isemployed to overcome fragmentation of a contourduring segmentation. 000689Line Labeling Using Markov Random FieldsTerry Regiertr-91-059199110__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1991/tr-91-059.pdfThe task of obtaining a line labeling from a greyscaleimage of trihedral objects presents difficultiesnot found in the classical line labeling problem. Asoriginally formulated, the line labeling problemassumed that each junction was correctlypre-classified as being of a particular junctiontype (e.g. T, Y, arrow); the success of the algorithmsproposed have depended critically upon getting thisinitial junction classification correct. In realimages, however, junctions of different types mayactually look quite similar, and thispre-classification is often difficult to achieve.This issue is addressed by recasting the linelabeling problem in terms of a coupled probabilisticsystem which labels both lines and junctions. Thisresults in a robust system, in which prior knowledgeof acceptable configurations can serve to overcomethe problem of misleading or ambiguous evidence. 000690Oracle Computations in Parallel Numerical Linear AlgebraB. Codenotti, M. Leoncini, G. Restatr-91-060199110__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1991/tr-91-060.pdfWe analyze the relative complexity of severalnumerical linear algebra problems, when errors inthe computation occur. We show that the simpleparallel complexity classes of the exact case do notseem to preserve under approximation. 000691Combinatory Differential Fields: An Algebraic Approach to ApproximateComputation and Constructive AnalysisKarl Aberertr-91-061199110__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1991/tr-91-061.pdfThe algebraic structure of combinatorydifferential fields is constructed to provide asemantics for computations in analysis. In thissetting programs, approximations, limits andoperations of analysis are represented as algebraicterms. Analytic algorithms can be derived byalgebraic methods. The main tool in thisconstruction are combinatory models which are inneralgebras of Engeler graph models. As an universaldomain of denotational semantics the latticestructure of the graph models allows to give astriking simple semantics for computations withapproximations. As models of combinatory algebrathey provide all essential computationalconstructs, including recursion. Combinatorymodels are constructed as extensions of first ordertheories. The classical first order theory todescribe analysis is the theory of differentialfields. It turns out that two types of computationalconstructs, namely composition and piecewisedefinition of functions, are preferably introducedas extensions of the differential fields theory.Combinatory differential fields are then thecombinatory models of these enriched differentialfields. We show for basic algorithms ofcomputational analysis how their combinatorycounterparts are derived in the algebraic setting.We illustrate how these algorithms are suitable to beimplemented in a computer algebra environment likemathematica. 000692Self-Testing/Correcting with Applications to Numerical Problems (RevisedVersion)Manuel Blum, Michael Luby, Ronitt Rubinfeldtr-91-062199111__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1991/tr-91-062.pdfSuppose someone gives us an extremely fast program$P$ that we can call as a black box to compute a function$f$. Should we trust that $P$ works correctly? A {\emself-testing/correcting pair} for $f$ allows us to:(1) estimate the probability that $P(x) \not= f(x)$when $x$ is randomly chosen; (2) on {\em any} input$x$, compute $f(x)$ correctly as long as $P$ is not toofaulty on average. Furthermore, both (1) and (2) taketime only slightly more than the original runningtime of $P$.

We present general techniques forconstructing simple to programself-testing/\-correcting pairs for a variety ofnumerical functions, including integermultiplication, modular multiplication, matrixmultiplication, inverting matrices, computing thedeterminant of a matrix, computing the rank of amatrix, integer division, modular exponentiationand polynomial multiplication. 000693How to Solve Interval Constraint Networks: The Definitive Answer - ProbablyPeter Ladkin, Alexander Reinefeldtr-91-063199111__000006We implemented and tested an algorithm for solvinginterval constraint problems which returnedsolutions in less than or equal to 0.5 seconds on theaverage, with the hardest problem taking less than orequal to 0.5 minutes on a RISC workstation. This is asurprising result considering the problem is knownto be NP-complete. We conclude that our algorithmsuffices for solving random interval constraintproblems in practice.

Other conclusions are thatpath-consistency is an excellent pruning techniquefor solution search, which becomes almost a linearselection of atomic labels; also thatpath-consistency by itself is an excellentconsistency heuristic for networks with fewer thansix or greater than 15 nodes. We tested the algorithmon over two million randomly generated intervalnetworks of various sizes, hence our title. 000694Distortion Accumulation in Image Transform Coding/Decoding CascadesMichael Gilgetr-91-064199112__000006With an increasing number of applications thatemploy transform coding algorithms for datareduction, the effect of distortion accumulationcaused by multiple coding needs to be investigated.Multiple coding occurs when more than one codingsystem is connected in a cascade. From the secondstage on, the coding algorithm operates on data thathas been previously coded/decoded. First a genericimage communication system is being modelled andsituations that can lead to distortion accumulationare analyzed. These results show two main reasons fordistortion accumulation, which are separately andjointly investigated using a JPEG-type compressionalgorithm. The first situation involves geometricoperations between the decoding and next codingstep. Measurements show however that these spatialmanipulations are the main contributors todistortion accumulation. The second reason fordistortion accumulation is a misalignment of theblock segmentation reference point in subsequenttransform operations. A block raster detectionalgorithm is derived that can find the position of theblock raster that was introduced in a previous codingstep. If this information is used in the blocksegmentation of the following coding step,distortion accumulation can be avoided. Simulationresults are given for an extended algorithm thatregisters regions of homogeneous block raster inimages consisting of several subimages. 000695Motion Video Coding for Packet-Switching Networks -- An Integrated ApproachMichael Gilge, Riccardo Gusellatr-91-065199112__000006NOTE: This postscript file will preview just fine,but on most postscript printers it will refuse toprint past page 4. Hence the .BAD tag. This file isofferred AS-IS, and will likely not ever be fixed. Theadvantages of packet video, constant image quality,service integration and statistical multiplexing,are overshadowed by packet loss, delay and jitter. Byintegrating network-control into the image datacompression algorithm, the strong interactionsbetween the coder and the network can be exploited andthe available network bandwidth can be used best. Inorder to enable video transmission over today'snetworks without reservation or priorities and inthe presence of high packet loss rates, congestionavoidance techniques need to be employed. This isachieved through rate and flow control, wherefeedback from the network is used to adapt codingparameters and vary the output rate. From the codingpoint of view the network is seen as data buffer.Analogously to constant bit rate applications,where a controller measures buffer fullness, weattempt to avoid network congestion (eq. bufferoverflow) by monitoring the network and adapting thecoding parameters in real-time. 000696A Graph-Theoretic Game and its Application to the k-Server ProblemNoga Alon, Richard M. Karp, David Peleg, Douglas Westtr-91-066199112__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1991/tr-91-066.pdfThis paper investigates a zero-sum game played on aweighted connected graph G between two players,the tree player and the edge player. Ateach play, the tree player chooses a spanning tree Tand the edge player chooses an edge e. The payoff tothe edge player is cost(T,e), defined as follows:If e lies in the tree T then cost(T,e)=0;if e does not lie in the tree then cost(T,e) =cycle(T,e)/w(e), where w(e) is the weight of edgee and cycle(T,e) is the weight of the unique cycleformed when edge e is added to the tree T. Our mainresult is that the value of the game on any n-vertexgraph is bounded above by \exp(O(\sqrt{\log n\log\log n})).

The game arises in connection withthe k-server problem on a road network; i.e.,a metric space that can be represented as a multigraphG in which each edge e represents a road of lengthw(e). We show that, if the value of the game on G isVal(G,w), then there is a randomized strategythat achieves a competitive ratio of k(1 +Val(G,w)) against any oblivious adversary. Thus,on any n-vertex road network, there is a randomizedalgorithm for the k-server problem that isk\cdot\exp(O(\sqrt{\log n \log\logn}))-competitive against oblivious adversaries.

At the heart of our analysis of the game is an algorithmthat, for any n-vertex weighted, connectedmultigraph, constructs a spanning tree T such thatthe average, over all edges e, of cost(T,e) isless than or equal to \exp(O(\sqrt{\log n \log\logn})). This result has potential application to thedesign of communication networks.

[The on-line copy of this technical report was created from a laterversion (1992). A revised and expanded version of the paper appeared inthe SIAM J. on Computing, Volume 24, (1995), pages 78-100.] 000697Probabilistic Recurrence Relations for Parallel Divide-and-ConquerAlgorithmsMarek Karpinski, Wolf Zimmermanntr-91-067199112__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1991/tr-91-067.pdfWe study two probabilistic recurrence relationsthat arise frequently in the analysis of parallel andsequential divide-and-conquer algorithms (cf.[Karp 91]). Suppose a problem of size x has to besolved. In order to solve it we divide it intosubproblems of size h_1(x), ... ,h_k(x) and thesesubproblems are solved recursively. We assume thatsize(h_i(z)) are random variables. This occurs ifeither the break up step is randomized or theinstances to be solved are drawn from a probabilitydistribution. The running time T(z) of a parallelalgorithm is therefore determined by the maximum ofthe running times T(h_i(z)) of the subproblems whilethe sequential algorithm is determined by the sum ofthe running times of the subproblems. We give a methodfor estimating tight upper bounds on the probabilitydistribution of T(x) for these two kinds ofrecurrence relations, answering the open questionsin [Karp 91].

Keywords: Probabilistic Recurrence Relations, Devide-and-Conquer Algorithms, Parallel Algorithms, Upper Bounds on Probability Distribution. 000698Construction of a pseudo-random generator from any one-way functionJohan Hastad, Russell Impagliazzo, Leonid A. Levin, Michael Lubytr-91-068199112__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1991/tr-91-068.pdfWe show how to construct a pseudo-random generatorfrom any one-way function. In contrast, previousworks have constructed pseudo-random generatorsonly from one-way functions with special structuralproperties. Our overall approach is different inspirit from previous work; we concentrate onextracting and smoothing entropy from a singleiteration of the one-way function using universalhash functions. 000699RASTA-PLP Speech AnalysisHynek Hermansky, Nelson Morgan, Aruna Bayya, Phil Kohntr-91-069199112__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1991/tr-91-069.pdfMost speech parameter estimation techniques areeasily influenced by the frequency response of thecommunication channel. We have developed atechnique that is more robust to such steady-statespectral factors in speech. The approach isconceptually simple and computationallyefficient. The new method is described, andexperimental results are reported, showing asignificant advantage for the proposed method. 000700Connectionist Speech Recognition: Status and ProspectsSteve Renals, Nelson Morgan, Herve Bourlard, Michael Cohen, Horacio Franco, Chuck Wooters, Phil Kohntr-91-070199112__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1991/tr-91-070.pdfWe report on recent advances in the ICSIconnectionist speech recognition project.Highlights include:

1. Experimental resultsshowing that connectionist methods can improve theperformance of a context independent maximumlikelihood trained HMM system, resulting in aperformance close to that achieved using state of theart context dependent HMM systems of much highercomplexity;

2. Mixing (context independent)connectionist probability estimates with maximumlikelihood trained context dependent models toimprove the performance of a state of the art system;

3. The development of a network decomposition methodthat allows connectionist modelling of contextdependent phones efficiently and parsimoniously,with no statistical independence assumptions.

000701GDNN: A Gender-Dependent Neural Network for Continuous Speech RecognitionYochai Konig, Nelson Morgan, Claudia Chandratr-91-071199112__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1991/tr-91-071.pdfConventional speaker-independent speechrecognition systems do not considerspeaker-dependent parameters in the probabilityestimation of phonemes. These recognition systemsare instead tuned to the ensemble statistics overmany speakers. Most parametric representations ofspeech, however, are highly speaker dependent, andprobability distributions suitable for a certainspeaker may not perform as well for other speakers. Itwould be desirable to incorporate constraints onanalysis that rely on the same speaker producing allthe frames in an utterance. Our experiments take afirst step towards this speaker consistencymodeling by using a classification network to helpgenerate gender-dependent phonetic probabilitiesfor a statistical recognition system. Our resultsshow a good classification rate for the genderclassification net. Simple use of such a model toaugment an existing larger network that estimatesphonetic probabilities does not help speechrecognition performance. However, when the new netis properly integrated in an HMM recognizer, itprovides significant improvement in word accuracy. 000702SPERT: A VLIW/SIMD Microprocessor for Artificial Neural Network ComputationsKrste Asanovic, James Beck, Brian E. D. Kingsbury, Phil Kohn, Nelson Morgan, John Wawrzynektr-91-072199112__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1991/tr-91-072.pdfSPERT (Synthetic PERceptron Testbed) is a fullyprogrammable single chip microprocessor designedfor efficient execution of artificial neuralnetwork algorithms. The first implementation willbe in a 1.2 micron CMOS technology with a 50MHz clockrate, and a prototype system is being designed tooccupy a double SBus slot within a Sun Sparcstation.

SPERT will sustain over 300 million connections persecond during pattern classification, and around100 million connection updates per second whilerunning the popular error backpropagation trainingalgorithm. This represents a speedup of around twoorders of magnitude over a Sparcstation-2 foralgorithms of interest. An earlier system producedby our group, the Ring Array Processor (RAP), usedcommercial DSP chips. Compared with a RAPmultiprocessor of similar performance, SPERTrepresents over an order of magnitude reduction incost for problems where fixed-point arithmetic issatisfactory.

This report describes the currentarchitecture, and gives the results of detailedsimulations. The report also makes a shortcomparison to other high-performance digitalneurocomputing chips. 000703Connectionist Layered Object-Oriented Network Simulator (CLONES): User'sManualPhil Kohntr-91-073199112__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1991/tr-91-073.pdfCLONES is an object-oriented library for constructing, training andutilizing layered connectionist networks. The CLONES library containsall the object classes needed to write a simulator with a smallamount of added source code (examples are included). The size ofexperimental ANN programs is greatly reduced by using an object-oriented library; at the same time these programs are easier to read,write and evolve. The library includes database, network behaviorand training procedures that can be customized by the user. It isdesigned to run efficiently on data parallel computers (such as theRAP [6] and SPERT [1]) as well as uniprocessor workstations. Whileefficiency and portability to parallel computers are the primarygoals, there are several secondary design goals:



1. minimize the learning curve for using CLONES,

2. minimize the additional code required for new experiments,

3. allow heterogeneous algorithms and training procedures to beinterconnected and trained together.



Within these constraints we attempt to maximize the variety ofartificial neural network algorithms that can be supported. 000704Recent Work in VLSI Elements for Digital Implementations of ArtificialNeural NetworksBrian E. D. Kingsbury, Bertrand Irissou, Krste Asanovic, John Wawrzynek, Nelson Morgantr-91-074199112__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1991/tr-91-074.pdfA family of high-performance, area-efficient VLSIelements is being developed to simplify the design ofartificial neural network processors. Thelibraries are designed around the MOSIS ScalableCMOS design rules, giving users the option offabricating designs in 2.0um or 1.2um n-wellprocesses, and greatly simplifying migration of thelibraries to new MOSIS technologies. To date,libraries and generators have been created forsaturating and nonsaturating adders, atwo's-complement multiplier, and a triple-portedregister file. The SPERT processor currently beingdesigned at ICSI will be based upon these libraries,and is expected to run at 50 MHz when realized in a 1.2umCMOS technology. 000705Incomplete Factorizations for Certain Toeplitz matricesC. Bernini, B. Codenotti, M. Leoncini, G. Restatr-91-075199112__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1991/tr-91-075.pdfWe propose some incomplete factorizations for banded Toeplitz matrices andwe show their application to the direct and iterative solution of severalspecial Toeplitz linear systems. 000706Real-Time Communication in an Internetwork;Domenico Ferraritr-92-001199201__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1992/tr-92-001.pdfCan end-to-end communication performance beguaranteed by a packet-switching internetwork?This paper addresses the question by examining thefeasibility of extending to an internetwork theTenet approach to real-time communication servicedesign. The conditions to be satisfied by aninternetwork so that the approach can be extended toit are investigated. These include conditions forthe scheduling discipline to be used in the nodes ofthe internetwork.

The original Tenet approach toreal-time communication applies to a networkconsisting of hosts, homogeneous nodes (orswitches), and physical links connecting nodes andhosts in an arbitrary topology. The nodes arestore-and-forward, and are scheduled by amulti-class version of the Earliest Due Datedeadline-based policy.

The discussion presented inthis paper results in extendibility conditions thatare quite broad; hence, the Tenet approach may be usedto establish and run real-time channels in a vastclass of internetworks. A case study is alsodiscussed, involving a simple network, whose nodesare scheduled by FCFS-based disciplines, and theconnection of such a network to an internetwork withdeadline-based and hierarchical round robinscheduling. 000707Constraint Relaxation and Nonmonotonic ReasoningGerhard Brewka, Hans Werner Guesgen, Joachim Hertzbergtr-92-002199201__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1992/tr-92-002.pdfThe purpose of this paper is to bring together the twoAI areas of constraint-based and nonmonotonicreasoning. In particular, we analyze the relationbetween different forms of constraint relaxationand a particular approach to nonmonotonicreasoning, namely, preferred subtheories. Ineffect, we provide formal semantics for therespective forms of constraint relaxation. 000708Rate-Controlled Static Priority QueueingHui Zhang, Domenico Ferrraritr-92-003199201__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1992/tr-92-003.pdfWe propose a new service discipline, called theRate-Controlled Static-Priority (RCSP) queueingdiscipline, that can provide throughput, delay,delay jitter, and loss free guarantees in aconnection-oriented packet-switching network.The proposed RCSP queueing discipline avoidsproblems in previous proposed solutions. Itachieves flexibility in the allocation of delay andbandwidth, as well as simplicity of implementation.The key idea is to separate rate-control anddelay-control functions in the design of the server.Applying this separation of functions will result ina class of service disciplines, of which RCSP is aninstance. 000709Best-First Model Merging for Dynamic Learning and RecognitionStephen M. Omohundrotr-92-004199201__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1992/tr-92-004.pdf"Best-first model merging" is a general techniquefor dynamically choosing the structure of a neural orrelated architecture while avoiding overfitting.It is applicable to both learning and recognitiontasks and often generalizes significantly betterthan fixed structures. We demonstrate the approachapplied to the tasks of choosing radial basisfunctions for function learning, choosing localaffine models for curve and constraint surfacemodelling, and choosing the structure of a balltreeor bumptree to maximize efficiency of access. 000710New algorithmic results for lines-in-3-space problemsLeonidas J. Guibas and Marco Pellegrinitr-92-005199202__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1992/tr-92-005.pdfIn the first part of the report we consider someincidence and ordering problems for lines in3-space. We solve the problem of detectingefficiently if a query simplex is collision-freeamong polyhedral obstacles. In order to solve thisproblem we develop new on-line data structures todetect intersections of query halfplanes with setsof lines and segments.

Then, we consider thenearest-neighbor problems for lines. Given a setof$n$ lines in 3-space, the shortest verticalsegment between any pair of lines is found inrandomized expected time $O(n^{8/5+\epsilon})$for every $\eps>0$. The longest connecting verticalsegment is found in time $O(n^{4/3+\eps})$. Theshortest connecting segment is found in time$O(n^{5/3 + \epsilon})$.

Problems involving lines,points and spheres in 3-space have importantapplications in graphics, CAD and optimization. Inthe second part of the report we consider severalproblems of this kind. We give subquaraticalgorithms to count the number of incidences betweena set of lines and a set of spheres, and to find theminimum distance between a set of lines and a set ofpoints. We show that the sphere of minimum radiusintersecting every line in a set of $n$ lines can befound in optimal expected time $O(n)$. Given $m$possibly intersecting spheres we solveray-shooting queries in $O(\log^2 m)$ time using adata structure of size $O(m^{5+\eps})$.

Thistechnical report collects part of the secondauthor's work at ICSI from September 1991 toJanuary 1992. 000711The LOGIDATA+ Object AlgebraUmberto Nanni, Silvio Salza, Mario Terranovatr-92-006199202__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1992/tr-92-006.pdfIn this paper we present the LOGIDATA+ Object Algebra(LOA), an algebra for complex objects which has beendeveloped within the LOGIDATA project funded by theItalian National Research Council (CNR). LOGIDATA+is intended to provide a rule based language on a datamodel with structured data types, object identityand sharing. LOA is a set-oriented manipulationlanguage which was conceived as an internal languagefor a prototype system supporting such a richenvironment. The algebra refers to a data model thatincludes structured data types and object identity,thus allowing both classes of objects andvalue-based relations.

LOA must deal with a rulebased language with possible recursive programswith limited forms of negation. LOA programsexplicitly include a "fixpoint" operator over a setof algebraic equations. Figures are omitted in theftp-able version of the paper. A complete version isavailable from ICSI. 000712The LOGIDATA+ Prototype SystemUmberto Nanni, Silvio Salza, Mario Terranovatr-92-007199202__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1992/tr-92-007.pdfIn this paper we present a prototype system developedwithin LOGIDATA+, a national project funded by theItalian National Research Council (CNR). Theprototype supports a rule based language on a datamodel with structured data types, object identityand sharing. The system has an interactive userinterface, with a unit of interaction consisting of aLOGIDATA+ program, to extract information from theknowledge base and/or modify the schema. A programconsists of a set of rules, and of additionaldirectives to handle the data output and/or theupdates to the schema. The prototype handles atemporary (user) environment where updates areperformed and a permanent one, updated on request.The system uses LOA (LOGIDATA+ Object Algebra) as anintermediate internal language (see ICSItr-92-006.ps.gz). User programs are translatedinto LOA programs, i.e. sequences of fixpointsystems of algebraic equations. The prototype isbuilt on the top of a relational DBMS, that handles SQLtransactions and provides the basic support for thepermanent storage of data as well as for concurrencycontrol and recovery. A main memory database has beenincluded in the architecture, to improve theperformance in the evaluation of the fixpointsystems, by keeping in main memory the intermediateresults. Figures are omitted in the ftp-able versionof the paper. A complete version is available fromICSI. 000713Linear Time Algorithms for Liveness and Boundedness in Conflict-free PetriNetsPaola Alimonti, Esteban Feuerstain, Umberto Nannitr-92-008199202__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1992/tr-92-008.pdfIn this paper we consider the problems of deciding theset of potentially firable transitions, theliveness and boundedness for the class ofConflict-Free Petri Nets. For these problems wepropose algorithms which are linear in the size of thedescription of the net, dramatically improving thebest previous known results for these problems.Moreover the algorithm for the first problem isincremental: it is possible to perform an arbitrarysequence of updates, introducing new transitionsand increasing the initial marking of the net, andqueries, asking whether any transition is firable orany place reachable. Queries are answered inconstant time, and the total cost for all themodifications is still linear in the size of the finalnet. Our approach is based on a representation ofconflict-free Petri nets by means of directedhypergraphs. Figures are omitted in the ftp-ableversion of the paper. A complete version is availablefrom ICSI. 000714Fish in Schools or Fish in Cans Evolutionary Thinking and FormalizationDirk Siefkestr-92-009199202__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1992/tr-92-009.pdfGregory Bateson maintains that individualdevelopment and natural evolution follow the sameprinciples --he parallels learning and evolution. Itry to establish the precise mechanism of humanlearning by attributing the role of genes toconcepts. We develop our thoughts conceptuallythrough selection, in the same way that living beingsdevelop genetically. Thus, thoughts evolve in ourmind like fish in a cove, thoughts yielding conceptsas the genetic material from which new thoughtsarise. 000715A New Algorithm for Counting Circular Arc IntersectionsMarco Pellegrinitr-92-010199202__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1992/tr-92-010.pdfWe discuss the following problem: given a collection$\Gamma$ of $n$ circular arcs in the plane, count allintersections between arcs of $\Gamma$. We presentan algorithm whose expected running time is$O(n^{3/2+\eps})$, for every $\eps >0$. If the arcshave all the same radius the expected time bound is$O(n^{4/3+\eps})$, for every $\eps>0$. Bothresults improve on the time bounds of previouslyknown asymptotically fastest algorithms. Thetechnique we use is quite general and it is applicableto other counting problems. 000716The Weighted List Update Problem and the Lazy AdversaryFabrizio d'Amore, Alberto Marchetti-Spaccamela, Umberto Nannitr-92-011199202__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1992/tr-92-011.pdfThe "List Update Problem" consists in maintaining a dictionary as anunsorted linear list. Any request specifies an item to be found bysequential scanning through the list. After an item has been found,the list may be rearranged in order to reduce the cost of processinga "sequence" of requests.

Several kinds of adversaries can be considered to analyze thebehavior of heuristics for this problem. The "Move-to-Front" (MTF)heuristic is 2-competitive against a "strong" adversary, matchingthe deterministic lower bound for this problem [21].

But, for this problem, moving elements does not help the adversary.A "lazy" adversary has the limitation that he can use only a staticarrangement of the list to process (off-line) the sequence ofrequests: still, no algorithm can be better than 2-competitiveagainst the lazy adversary [3].

In this paper we consider the "Weighted List Update Problem"(WLUP) where the cost of accessing an item depends on the itemitself. It is shown that MTF is not competitive by any consentfactor for this problem against a lazy adversary. Two heuristics,based on the MTF strategy, are presented for WLUP: "Random Move-to-Front" is randomized and uses biased coins; "Counting Move-to-Front" is deterministic, and replaces coins by counters. Both areshown to be 2-competitive against a lazy adversary. This isoptimal for the deterministic case.

We apply this approach for searching items in a tree, proving thatany "c"-competitive heuristic for the weighted list updateproblem provides a "c"-competitive heuristic for the "Tree UpdateProblem". 000717Towards a Complexity Theory for ApproximationKarl Aberer, Bruno Codenottitr-92-012199202__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1992/tr-92-012.pdfThis paper presents a novel approach to the analysisof numerical problems, which is closely related tothe actual nature of numerical algorithms. In fact,models of computation are introduced which take intoaccount such issues as adaptivity and error.Moreover, complexity vs error bounds and examplesregarding the role of adaptivity are provided.Finally, it is shown that the overall approach fitsnaturally into an algebraic framework. 000718Competitive On-line Algorithms for Paging and Graph ColoringSandy Iranitr-92-013199201__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1992/tr-92-013.pdfWe analyze the competitiveness of on-linealgorithms for two problems: paging and on-linegraph coloring. In the first problem, we develop arefinement of competitive analysis for pagingalgorithms which addresses some of the areas wheretraditional competitive analysis fails torepresent what is observed in practice. For example,traditional competitive analysis is unable todiscern between LRU and FIFO, although in practiceLRU performs much better than FIFO. In addition, thetheoretical competitiveness of LRU is much morepessimistic than what is observed in practice. Wealso address the following important question:given some knowledge of a program's referencepattern, can we use it to improve paging performanceon that program?

We address these concerns byintroducing an important practical element thatunderlies the philosophy behind paging: locality ofreference. We devise a graph-theoretical model, theaccess graph, for studying locality of reference.

The second problem that we consider is on-line graphcoloring. In the spirit of competitiveness, weevaluate on-line graph coloring algorithms by theirperformance ratio which measures the number ofcolors the algorithm uses in comparison to thechromatic number of the graph. We consider the classof d-inductive graphs. A graph G is d-inductive if thevertices of G can be numbered so that each vertex has atmost d edges to higher numbered vertices. We analyzethe greedy algorithm and show that if G is d-inductivethen FF uses O( d log n) colors on G. We show that thisbound is tight. Since planar graphs are 5-inductive,and chordal graphs are c(G)-inductive, (where c(G)is the chromatic number of the graph G), our resultsyield bounds on the performance ratio of greedy onthese important classes of graphs. We also examineon-line graph coloring with lookahead. An algorithmis on-line with lookahead l, if it must color vertex iafter examining only the first l+i vertices. We showthat for l < (n / log n) no on-line algorithm withlookahead l can perform better than First Fit ond-inductive graphs.

Keywords: on-line algorithms, competitive analysis, paging, locality of reference, on-line graph coloring, lookahead. 000719Backwards Analysis of Randomized Geometric AlgorithmsRaimund Seideltr-92-014199202__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1992/tr-92-014.pdfThe theme of this paper is a rather simple method thathas proved very potent in the analysis of the expectedperformance of various randomized algorithms anddata structures in computational geometry. Themethod can be described as ``analyze a randomizedalgorithm as if it were running backwards in time,from output to input.'' We apply this type of analysisto a variety of algorithms, old and new, and obtainsolutions with optimal or near optimal expectedperformance for a plethora of problems incomputational geometry, such as computing Delaunaytriangulations of convex polygons, computingconvex hulls of point sets in the plane or in higherdimensions, sorting, intersecting line segments,linear programming with a fixed number ofvariables,and others. 000720Queueing Delays in Rate Controlled NetworksAnindo Banerjea, Srinivasan Keshavtr-92-015199203__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1992/tr-92-015.pdfThis paper addresses the problem of finding the worstcase end-to-end delay and buffer occupancy bounds innetworks of rate-controlled, non-work conservingservers.

The calculations are based on a simple fluidmodel, but care is taken so that the computed delay andbuffer occupancy values are upper bounds on actualvalues. A simple algorithm is presented to performthese calculations in linear time.

Simulationresults compare the computed worst case delays withthe actual delays obtained on some simple networktopologies. The algorithm is found to predict nodedelays well for bursty input traffic, but poorly forsmooth input traffic. Buffer requirements arepredicted well in both cases. 000721A Framework for the Study of Pricing in Integrated NetworksColin J. Parris, Srinivasan Keshav, Domenico Ferraritr-92-016199203__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1992/tr-92-016.pdfIntegrated networks of the near future are expectedto provide a wide variety of services, which couldconsume widely differing resources. We present aframework for pricing services in integratednetworks, and study the effect of pricing on userbehavior and network performance. We first describea network model that is simple, yet models detailssuch as the wealth distribution in society,different classes of service, peak and off-peaktraffic and call blocking due to budgetaryconstraints.

We then perform experiments to studythe effect of setup, per packet, and peak load priceson the blocking probability of two classes of callspassing through a single node enforcing admissioncontrol. Some selected results are that a)increasing prices first increases the net revenue toa provider, then causes a decrease b) peak-loadpricing spreads network utilization more evenly,raising revenue while simultaneously reducing callblocking probability.

Finally, we introduce a novelmetric for comparing pricing schemes, and prove thatfor the most part, a pricing scheme involving setupprices is better than a pricing scheme with no setupcost. 000722The Sather Language and LibrariesStephen Omohundro, Chu-Cheow Limtr-92-017199203__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1992/tr-92-017.pdfSather is an object-oriented language derived fromEiffel which is particularly well suited for theneeds of scientific research groups. It is designedto be very efficient and simple while supportingstrong typing, garbage collection,object-oriented dispatch, multiple inheritance,parameterized types, and a clean syntax. It compilesinto portable C code and easily links with existing Ccode. The compiler, debugger and several hundredlibrary classes are freely available by anonymousFTP. This paper describes aspects of the languagedesign, implementation and libraries. 000723A Resource Based Pricing Policy for Real-Time Channels in a Packet-SwitchingNetworkColin J. Parris, Domenico Ferraritr-92-018199203__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1992/tr-92-018.pdfIn the packet switching networks of the future theneed for guaranteed performance on a wide variety oftraffic characteristics will be of paramountimportance. The generation of revenue, to recovercosts and provide profit, and the multiple type ofservices offered will require that new pricingpolicies be implemented.

This paper presents aresource based pricing policy for real-timechannels ( ie., channels with guaranteedperformance ) in a packet switching network. Thepolicy is based on a set of specific criteria, and thecharges for any channel are based on the resourcesreserved for use by the channel. This reservationcharge is based on the type of service requested, thetime of day during which the channel exists, and thelifetime of the channel. We argue that thetraditional resources are not sufficient todetermine a fair reservation charge for a channeloffering guaranteed delay bounds, and we introducethe notion of a delay resource in our chargingformula. The type of service requested is thuscharacterized by the amount of the bandwidth, bufferspace, CPU, and delay resources reserved. Theanalysis of this pricing policy is reduced to theanalysis of a single node of the network, assuming ahomogeneous network. This single-nodecharacteristic increases the scalability andflexibility of the policy. An example of animplementation of this policy is provided. 000724Design of a Continuous Media Data Transport Service and ProtocolMark Moran, Bernd Wolfingertr-92-019199204__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1992/tr-92-019.pdfApplications with real-time data transportrequirements fall into two categories: those whichrequire transmission of data units at regularintervals, which we call continuous media (CM)clients, e.g. video conferencing, voicecommunication, high-quality digital sound; andthose which generate data for transmission atrelatively arbitrary times, which we call real-timemessage-oriented clients. Because CM clients arebetter able to characterize their future behaviorthan message-oriented clients, a data transportservice dedicated for CM clients can use this a prioriknowledge to more accurately predict their futureresource demands. Therefore, a separate transportservice can potentially provide a morecost-effective service along with additionalfunctionality to support CM clients. The design ofsuch a data transport service for CM clients and itsunderlying protocol (within the BLANCA gigabittestbed project) will be presented in this document.This service provides unreliable, in-sequencetransfer (simplex, periodic) of so-called streamdata units (STDUs) between a sending and a receivingclient, with performance guarantees on loss, delay,and throughput. 000725Read-Once Threshold Formulas, Justifying Assignments, and GenericTranformationsNader H. Bshouty, Thomas R. Hancock, Lisa Hellerstein, Marek Karpinskitr-92-020199203__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1992/tr-92-020.pdfWe present a membership query (i.e. interpolation)algorithm for exactly identifying the class ofread-once formulas over the basis of booleanthreshold functions. Using a generictransformation from [Angluin, Hellerstein,Karpinski 89], this gives an algorithm usingmembership and equivalence queries for exactlyidentifying the class of read-once formulas over thebasis of boolean threshold functions and negation.We also present a series of generic transfor- mationsthat can be used to convert an algorithm in onelearning model into an algorithm in a differentmodel.

Keywords: Learning Algorithms, Queries, Read-Once Formulas, Threshold Functions. 000726Local Properties of Some NP-Complete ProblemsBruno Codenotti, Luciano Margaratr-92-021199204__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1992/tr-92-021.pdfIt has been shown that certain NP-complete problems,i.e. TSP, min cut, and graph partitioning, withspecific notions of neighborhood, satisfy a simpledifference equation. In this paper, we extend theseresults by proving that TSP with 2-change,2+3-new-change, and 3-new-change notions ofneighborhood satisfy such a difference equation,and we derive some properties of local search whenperformed with the above definitions ofneighborhood. 000727Petri Net Based Software Validation: Prospects and LimitationsMonika Heinertr-92-022199203__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1992/tr-92-022.pdfPetri net based software validation to check thesynchronization structure against some data orcontrol flow anomalies (like unboundednesss ornon-liveness) has been a well-known and widely usedapproach for about ten years. To decrease thecomplexity problem and because the simpler themodel, the more efficient the analysis, thevalidation is usually tried with the help of placetransition Petri nets. However, the modelling withthis Petri net class involves two importantabstractions of actual software properties -- thetime consumption of any action and the datadependencies among conflict decisions. Basically,this paper discusses some problems resulting fromthese abstractions in the models analyzed which arevery often neglected and have therefore not been wellunderstood up to now. Furthermore, discussing thepros and cons of the Petri net approach is done byoffering a rough overview of the given background ofdependable distributed software engineering.Suggestions for a related workstation supportingdifferent net-based methods are outlined. 000728Quality-of-Service Negotiation in a Real-Time Communication NetworkJean Ramaekers, Giorgio Ventretr-92-023199204__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1992/tr-92-023.pdfIn the recent years new protocols and algorithms havebeen proposed to guarantee performance andreliability in exchanging data in real-timecommunication networks, and new services have beenpresented to allow cooperative office work,distributed conferencing, etc. Less attention hasbeen paid to how applications and, more generally,clients of real-time communication services caninteract with the network in order to specify andnegotiate the quality-of-service of a connection.We believe that this problem is going to become a keyissue for the success of future distributed systems,since it affects both client and networkperformances. In this paper we present a newmechanism for the establishment of real-timeconnections in a quality-of-service networkdeveloped for the Tenet real-time protocol suite. Byimproving the information exchanged between thenetwork and the clients, the model allows to reducethe complexity and the time required to establish areal-time connection, and increases the networkutilization. Additionally, we introduced a newclass of real-time communication service to supportadaptive quality-of-service, in order to enhancethe possibilities of the network to face congestionsituations. 000729Communicating with Low-Diffraction Lasers and MirrorsRichard Beigeltr-92-024199204__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1992/tr-92-024.pdfOptical interconnection networks, in which eachprocessor contains a set of lasers for communicationwith other processors, have long been studied. In the``regular optics'' model of Murdocca a boundednumber of planar mirrors are used to redirect lightbeams, and each processor has a bounded number oflasers directed at a fixed set of angles, independentof the processor.

It is theoretically interesting toignore diffraction, and assume that lasers beamstravel in a straight line. In the regular opticalmodel, we present elegant layouts for processornetworks including the shuffle, grids, andMargulis' expander graph. We also disprove theexistence of a certain kind of 3-dimensional layoutfor shuffles.

Using slightly more complicatedoptical devices, such as beam splitters, we design a``light guide,'' which allows simultaneousbroadcasts, subject only to the limitations of lightsensors. In particular, the light guide can performsingle broadcasts. Given accurate enough clocks, itcan perform arbitrary permutations. 000730Tree Matching with Recursive Distributed RepresentationsAndreas Stolcke, Dekai Wutr-92-025199204__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1992/tr-92-025.pdfWe present an approach to the structure unificationproblem using distributed representations ofhierarchical objects. Binary trees are encodedusing the recursive auto-association method(RAAM), and a unification network is trained toperform the tree matching operation on the RAAMrepresentations. It turns out that this restrictedform of unification can be learned without hiddenlayers and producing good generalization if we allowthe error signal from the unification task to modifyboth the unification network and the RAAMrepresentations themselves. 000731On the Power of Discontinous Approximate ComputationsKarl Aberer, Bruno Codenottitr-92-026199204__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1992/tr-92-026.pdfThe set of operations S_1={+,-,*,/,>} is used inalgebraic computations to avoid degeneracies(e.g., division by zero), but is also used innumerical computations to avoid huge roundofferrors (e.g., division by a small quantity). On theother hand, the classes of algorithms usingoperations from the set S_2={+,-,*,/} or from the setS_3={+,-,*} are the most studied in complexitytheory, and are used, e.g., to obtain fast parallelalgorithms for numerical problems. In this paper, westudy, by using a simulation argument, the relativepower of the sets S_1, S_2, and S_3 for computing withapproximations. We prove that S_2 does veryefficiently simulate S_1, while S_3 does not; thisfact shows and measures the crucial role of divisionin computations introducing roundoff errors. Wealso show how to construct algorithms usingoperations {+,-,*,/} which achieve for most inputsthe same error bounds as algorithms using operations{+,-,*,/,>}. To develop our simulation strategy wecombine notions imported from approximation theoryand topology with complexity and error bounds. Moreprecisely, to find conditions under which thissimulation can take place, we quantitativelydescribe the interplay between algebraic,approximation, topological, and complexitynotions and we provide lower and upper bounds on thecost of simulation. 000732The Quality of Separation Between NP and Exponential Time; Reducing theCasesGerhard Lischketr-92-027199204__000006We consider three aspects of quality of separationbetween complexity classes: inclusion, immunityand sparseness in the differences. These aspects arediscussed in general and investigated especiallyfor the relationship between NP and deterministicexponential linear time, where we can reduce thenumber of possible cases from 24 to 8. Seven of the 8cases are realizable in appropriate relativizedworlds; one case remains open. Also, we found an errorin former papers on this subject. 000733Proposal of an External Processor Scheduling in Micro-Kernel Based OperatingSystemsWinfried Kalfatr-92-028199205__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1992/tr-92-028.pdfUntil now, the management of resources was a task ofthe operating systems kernel. The applicationsrunning on the operating system were in general,similiar to each other. Thus the limited policy of theresource manager could satisfy the demands ofapplications. With the advent of computer systemscapable handling new applications such asmulti-media and of new operating systems based onmicro-kernels and supporting object paradigm in adistributed environment, an external resourcemanager became important for both traditionaloperating systems like UNIX with new applicationsand new object oriented and micro- kernel basedoperating systems. In this paper an approach to anexternal scheduling on the basis of the operatingsystem BirliX is given. The proposal is based on ascheduler implemented in the user space. Problems ofthe implementation are described by means of theoperating system BirliX as an example. Because theoperating system is a distributed object orientedopera- ting system, our proposal deals with local anddistributed managers. Coming from a system model ofthe BirliX, a resource mode, and a process model, thescheduling model is developed.

Keywords: Distributed Operating Systems, External Processor Scheduler, Micro-Kernel, BirliX 000734Efficient Computation of Spatial JoinsOliver Günthertr-92-029199205__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1992/tr-92-029.pdfSpatial joins are join operations that involvespatial data types and operators. Due to some basicproperties of spatial data, many conventional joinprocessing strategies suffer serious performancepenalties or are not applicable at all in this case. Inthis paper we explore which of the join strategiesknown from conventional databases can be applied tospatial joins as well, and how some of thesetechniques can be modified to be more efficient in thecontext of spatial data. Furthermore, we describe aclass of tree structures, called generalizationtrees, that can be applied efficiently to computespatial joins in a hierarchical manner. Finally, wemodel the performance of the most promisingstrategies analytically and conduct a comparativestudy. 000735Checking Approximate Computations over the RealsSigal Ar, Manuel Blum, Bruno Codenotti, Pete Gemmelltr-92-030199205__000006This paper provides the first systematicinvestigation of checking approximate numericalcomputations, over subsets of the reals. In mostcases, approximate checking is more challengingthan exact checking. Problem conditioning, i.e.,the measure of sensitivity of the output to slightchanges in the input, and the presence ofapproximation parameters foil the directtransformation of many exact checkers to theapproximate setting. We can extend exact checkersonly if they have a very smooth dependence on thesensitivity of the problem. Furthermore,approximate checking over the reals is complicatedby the lack of nice finite field properties such as theexistence of a samplable distribution which isinvariant under addition or multiplication by ascalar. We overcome the above problems by using suchtechniques as testing and checking over similar butdistinct distributions, using functions' randomand downward self-reducibility properties, andtaking advantage of the small variance of the sum ofindependent identically distributed randomvariables. 000736Decision Procedures for Flat Set-Theorectical Syllogistics.I. General Union,Powerset and Singleton OperatorsDomenico Cantone, Vincenzo Cutellotr-92-031199205__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1992/tr-92-031.pdf(Pages 30) In this paper we show that a class ofunquantified multi-sorted set-theoretic formulaeinvolving the notions of powerset, general union,and singleton has a solvable satisfiabilityproblem. We exhibit a normalization procedure thatgiven a model for a formula in our theory, it produces asimpler and "a priori" bounded model whosecardinality depends solely on the size of the givenformula. 000737A Model for Amalgamation in Group Decision MakingVincenzo Cutello, Javier Monterotr-92-032199205__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1992/tr-92-032.pdf(Pages 14) In this paper we present a generalizationof the model proposed by Montero in [Mon87a, Mon87b,Mon92], by allowing non complete fuzzy binaryrelations for individuals. A degree ofunsatisfaction can be defined in this case,suggesting that any democratic aggregation ruleshould take into account not only ethical conditionsor some degree of rationality in the amalgamatingprocedure, but also a minimum support for the set ofalternatives subject to the group analysis. 000738A Characterization of Rational Amalgamation OperationsVincenzo Cutello, Javier Monterotr-92-033199205__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1992/tr-92-033.pdf(Pages 24) This paper deals with amalgamation offuzzy opinions when a fixed number of individuals isfaced with an unknown number of alternatives. Theaggregation rule is defined by means of intensityaggregation operations that verify certain ethicalconditions, and assuming fuzzy rationality asdefined in [6, 7]. A necessary and sufficientcondition for non-irrationality is presented,along with comments on the importance of the number ofalternatives. 000739Ambiguities in Object Specifications in View of Data TestingDieter Richtertr-92-034199206__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1992/tr-92-034.pdfChecking data only relying on their specification isof importance when using neutral or standardizedobject models. Ambiguities arise during the testsbecause of specifications leaving a certain degreeof freedom to the implementation. Based on anexperimental background the observations andreflections about the reasons are systematicallypresented. It turns out that the transition (ormapping) from a specification of an object to aphysical instance (or data set) has to take intoconsideration when defining neutral models. Thistransition which often has been seen as a technicalquestion of the implementation or as the internal(hided) feature of a system appears as a particularpoint of the concept besides the specification of thesemantics.

One crucial point is the instancehandling with respect to assign and comparisonoperations. The mapping from a specification into adatabase can be realized in various manners whichleads to interpretation defects when testingindependently. Another point is the weak scopedefinition in specifications. Several ambiguitiesare caused by it. A very frequent reason ofmisunderstandings is the imprecise or wrongunderstanding of the different relations betweenobjects, logical and physical instances. There areapproaches for more clear specifications. The lastpoint is the representation of failures or moregenerally of the state of instances. A concept basedon multiple inheritance seems to increase theabstraction level of state specifications on thesame level as the used specification language is of. 000740Experiments with Noise Reduction Neural Networks for Robust SpeechRecognitionMichael Trompftr-92-035199205__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1992/tr-92-035.pdfSpeech recognition systems with small and mediumvocabularies are used as natural human interface in avariety of real world applications. Though they workwell in a laboratory environment, a significant lossin recognition performance can be observed in thepresence of background noise. In order to make such asystem more robust, the development of a neuralnetwork based noise reduction module is described inthis paper. Based on function approximationtechniques using multilayer feedforward networks(Hornik et al. 1990), this approach offers inherentnonlinear capabilities as well as easy training frompairs of corresponding noisy and noise-free signalsegments. For the development of a robustnonadaptive system, information about thecharacteristics of the noise and speech componentsof the input signal and its past and future context istaken into account. Evaluation of each step is done bya word recognition task and includes experimentswith changing signal parameters and sources to testthe robustness of this neural network basedapproach. 000741Efficient Clustering Techniques for the Geometric Traveling Salesman ProblemBruno Codenotti, Luciano Margaratr-92-036199206__000006This paper presents some direct and iterativeheuristic methods for the geometric TravelingSalesman Problem (TSP). All these methods are basedon a particular notion of mass density, which can beused to construct a tour for the geometric TSP in anincremental fashion. In the iterative method, thistechnique is combined with the Lin-Kernighan method(LK), and this allows us to obtain better tours thanthose found by using LK itself. More precisely, thetour length we get is only 1.1% off the optimum. Thedirect method finds a solution passing through asequence of subsolutions over progressively largersets of points. These points are the relative maximaof the mass density obtained by using differentparameter settings. The method has O(n^3) worst caserunning time and finds tours whose length is 9.2% offthe optimal one. 000742Measuring the Latency Time of Real-Time Unix-like Operating SystemsNewton Fallertr-92-037199206__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1992/tr-92-037.pdfWith the advent of continuous-media applications,real-time operating systems, once confined toprocess control and other specializedapplications, are coming to the desktop. Thepopularity of UNIX made this operating system thefirst choice for use with such real-time desktopapplications. However, since UNIX kernel does notprovide real-time responsiveness, some softwaredevelopers have been trying to adapt it to respond tothis new requirements, while others have beenproposing its total redesign. Though the evaluationof the performance of a real-time operating systemdepends on many factors, a predictable small latencytime in responding to external events is alwaysessential. In this paper, after a discussion aboutthe probable sources of latency, it is presented amethod for collecting information aboutcontext-switching and interrupt-acknowledgetimes in UNIX-like operating systems withoutrequiring external measuring tools. It is alsoproposed, a form of presentation of these data aimedat facilitating the comparison with previouslycollected data obtained from the same or from othersystems. The paper is illustrated with actualresults obtained by the application of the method toTROPIX, a real-time UNIX-like operating system,running on a Motorola 68010-based computer. Theimpact of kernel preemption and some practicalmeasurement interference considerations due todynamic memory refresh, DMA operation and diskmultiblock access are also discussed. 000743Fuzzy Evolutionary AlgorithmsHans-Michael Voigttr-92-038199206__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1992/tr-92-038.pdfEvolutionary algorithms (EA) combine different approaches for solvingcomplex problems based on principles, models, and mechanisms ofnatural evolution. Typical representatives of such algorithms areGenetic Algorithms (GA) and Evolution Strategies (ES), which areclosely related in principle but show different emphasis on therepresentational and operational level. The basic ideas and conceptsfor GAs and ESs dates back to the early sixties. Central concepts ofthese approaches include the replication, recombination, mutation,selection, isolation-migration, and diffusion of individuals withinor between populations or subpopulations, respectively. Thesealgorithms do not take into account the development of an individualor organism from the gene level to the mature phenotype level. Thisdevelopment is a multistage decision process influenced by theenvironment and by interspecific as well as intraspecific competitionand cooperation such that usually no inferences can be drawn fromphenotype to genotype. The goal of this paper is to introduce a fuzzyrepresentation and fuzzy operations to model the developmental processbased on fuzzy decisions. Some first conclusions with respect tooptimization will be stated.

The appendices include an up-to-date software survey for EvolutionaryAlgorithms and the description of "The Evolution Machine". 000744Boot AlgebrasD. Schuett, U. Eckhardt, P. Sudatr-92-039199206__000006The paper surveys our recent work in the field ofBoolean algebra. It begins with an introduction intothe theory of Boolean algebras and discussesproblems related to the separation of an algebra intoa family of factors so that the Cartesian product ofthe family is isomorphic to the given algebra. Such aproduct is called a "Boo"lean "t"uple algebra or forshort a Boot algebra if each factor is completelycontained in the original algebra. Some examples aretaken from the field of digital circuit design andimage processing. They demonstrate how Bootalgebras can be applied. 000745Robot Shaping: Developing Situated Agents through LearningMarco Colombetti, Marco Dorigotr-92-040199208__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1992/tr-92-040.pdf1993] Learning plays a vital role in the developmentof situated agents. In this paper, we explore the useof reinforcement learning to "shape" a robot toperform a predefined target behavior. We connectboth simulated and real robots to Alecsys, a parallelimplementation of a learning classifier system withan extended genetic algorithm. After classifyingdifferent kinds of Animat-like behaviors, weexplore the effects on learning of different types ofagent's architecture (monolithic, flat andhierarchical) and of training strategies. Inparticular, hierarchical architecture requiresthe agent to learn how to coordinate basic learnedresponses. We show that the best results are achievedwhen both the agent's architecture and the trainingstrategy match the structure of the behavior patternto be learned. We report the results of a number ofexperiments carried out both in simulated and in realenvironments, and show that the results ofsimulations carry smoothly to real robots. Whilemost of our experiments deal with simple reactivebehavior, in one of them we demonstrate the use of asimple and general memory mechanism. As a whole, ourexperimental activity demonstrates thatclassifier systems with genetic algorithms can bepractically employed to develop autonomous agents.

Keywords: machine learning, adaptive systems, genetic algorithms, learning classifier systems, behavior-based robotics. 000746The NC Equivalence of Integer Linear Programming and Euclidean GCDVictor Pantr-92-041199212__000006We show NC-reduction of integer linear programmingwith two variables to the evaluation of the remaindersequence arising in the application of the Euclideanalgorithm to two positive integers. Due to theprevious result of Deng, this impliesNC-equivalence of both of these problems, whosemembership in NC, as well as P-completeness, remainunresolved open problems. 000747A Framework for Cumulative Default LogicsGehard Brewkatr-92-042199207__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1992/tr-92-042.pdfWe present a framework for default reasoning whichhas its roots in Reiter's Default Logic. Contrary toReiter, however, we do not consider defaults asinference rules used to generate extensions of aclassical set of facts. In our approach defaults areelements of the logical language, and we will defineinference rules on defaults. This has severaladvantages. First of all, we can reason aboutdefaults, not just with defaults. This makes it easyto include different intuitions about the rightbehaviour of a default logic in an explicit form.Secondly, we can show how some of the problems ofReiter's logic and of some recent proposals to solvethem can be handled adequately by exploiting thedependency information contained in deriveddefaults. 000748A Symbolic Complexity Analysis of Connectionist Algorithms forDistributed-Memory MachinesJonathan Bachrachtr-92-043199207__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1992/tr-92-043.pdfThis paper attempts to rigorously determine thecomputation and communication requirements ofconnectionist algorithms running on adistributed-memory machine. The strategy involves(1) specifying key connectionist algorithms in ahigh-level object-oriented language, (2)extracting their running times as polynomials, and(3) analyzing these polynomials to determine thealgorithms' space and time complexity. Results arepresented for various implementations of theback-propagationalgorithm~\cite{Rumelhart-Hinton-Williams}. 000749On-Line Algorithms Versus Off-Line Algorithms: How Much is it Worth to Knowthe Future?Richard M. Karptr-92-044199207__000006An "on-line algorithm" is one that receives asequence of requests and performs an immediateaction in response to each request. On-linealgorithms arise in any situation where decisionsmust be made and resources allocated withoutknowledge of the future. The effectiveness of anon-line algorithm may be measured by its"competitive ratio", defined as the worst-caseratio between its cost and that of a hypotheticaloff-line algorithm which knows the entire sequenceof requests in advance and chooses its actionsoptimally. In a variety of settings, we discusstechniques for proving upper and lower bounds on thecompetitive ratios achievable by on-linealgorithms. In particular, we discuss theadvantages of randomized on-line algorithms overdeterministic ones. 000750Persistence in the Object-Oriented Database Programming Language VMLWolfgang Klas, Volker Turautr-92-045199207__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1992/tr-92-045.pdfIn this paper the principles of handling persistentobjects in the object-oriented databaseprogramming language VML is presented. The maindesign criteria of VML with respect to persistencewere: persistence independent programming, datatype completeness and operations manipulating theextension of a class. After defining the abovementioned concepts an example is used to compare themodelling and computational power of VML with thedatabase programming languages Adaplex, PS-algol,and Galileo. The distinction of types and classes isthe basis for defining persistence in VML. Instancesof classes are always persistent and those of datatypes are always transient. All instances arereferenced by object identifiers, values ofdatatypes are referenced independently of the factwhether they are attached to persistent objects (andare therefore persistent itself) or whether they are"stand alone". 000751An Object-Oriented Approach to the Design of Graphical User InterfaceSystemsFabio Paternotr-92-046199208__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1992/tr-92-046.pdfIn this paper the problems concerning the design ofgraphical user interface systems composed of a set ofinteraction objects allowing users to interact withstructured graphics are presented. Here we want topoint out the problems and the requirements that areraised in performing such a design in anobject-oriented environment. For this purpose theimportance of task-oriented design of interactionobjects in order to make the traslation from the usertask to the system functions easier is addressed. Thedesign of a hierarchy of interaction objectsfollowing this approach is proposed. This contrastwith the current window systems toolkits designbecause it is mainly driven by the semantics of theinteraction object rather than their appearance.Finally an example of common graphical interfaceperformed by the proposed approach is presented. 000752An Adaptive Classification Scheme to Approximate Decision Boundaries UsingLocal Bayes Criteria - The "Melting Octree" NetworkL. Miguel Encarnacao, Markus H. Grosstr-92-047199207__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1992/tr-92-047.pdfThe following paper describes a new method toapproximate the minimum error decision boundary forany supervised classification problem by means of alinear neural network consisting of simple neuronsthat use a local Bayes criterium and a next neighbordecision rule. The neurons can be interpreted ascentroids in feature space or as a set of particlesmoving towards the classification boundary duringtraining. In contrary to existing LVQ methods and RCEnetworks each neuron has a receptive field of anadjustable width e and the goal of the supervisedtraining method is completely different.Furthermore, the network is able to grow in the senseof generating new entities in order to decrease theclassification error after learning.

For thispurpose we initialize the network via amultidimensional octree representation of thetraining data set. The neurons generated duringinitialization only depend on the maximum number ofdata in a single octree cell. The learning methodintroduced ensures that all neurons move towards theclass boundaries by checking the local Bayescriterium in their receptive field. For this processcan also be interpreted as a melting away of theinitial octree, we called the network "The MeltingOctree" network.

This report first describes thealgorithms used for initialization, training aswell as for growing of the net. The classificationperformance of the algorithm is then illustrated bysome examples and compared with those of a Kohonenfeature Map (LVQ) and of a backpropagatedmultilayered perceptron.

Note: The charts are page39 of the techreport. I stored them undertr-92-047.charts.ps.Z. They're not absolutelynecessary for the report; just to complete it. 000753A Study of Perceptually Grounded Polysemy in a Spatial MicrodomainJordan Zlatevtr-92-048199208__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1992/tr-92-048.pdfThis paper attempts to exemplify the advantages of perceptuallygrounded semantics with respect to traditional formalist approaches inelucidating the nature of the controversial notion of linguisticpolysemy, or multiplicity of meaning. It is also suggested how someaspects of language typically associated with compositionality couldbe modeled, without there being a strictly ``compositionalsemantics''.

This is done through a series of experiments, using modifications ofTerry Regier's connectionist system for learning spatial relationswhich constitutes a part of the L subscript 0 project concerned withassociating descriptions in an arbitrary language with an analogenvironment, (sequences of) pictures of simple two-dimensional scenes.

The emphasis is above all on the English preposition `over', famousfor its polysemy, and analyzed in detail by [Brugman, 1981] and[Lakoff, 1987], but some modeling has also been done of the meaningof `under', as well as some rudimentary semantics for simple verbssuch as `be', `go' and `fly' that combine with the two prepositions.

Three kinds of connectionist architectures have been used in trying tocapture what might be called a `polysemous over'. It is suggestedthat the first seems to treat polysemy like what has traditionallybeen regarded as generality, where distinctions are neutralized andsenses are not distinct, while the second reduces polysemy tohomonomy where they are distinct but not related. It is the thirdtype of (structured) connectionist architecture that managed best inboth learning different senses and reflecting the polysemousstructure of the lexical item in analyses of the relevant hiddenlayers. In this architecture polysemy emerges as an effect of thecombinatorics of words and their pairing with the environment.

The main theoretical claim is that polysemy is best regarded as acontextual rather than a purely lexical phenomenon. This on itspart suggests support for the claim made in [Geeraerts, 1992] thatthe distinction between polysemy and generality is unstable, and fora semantics that is radically anti-reificational. The results fromthis study suggest that such a semantics can account for thegenerativity and systematicity of language, despite claims to thecontrary made by formalists.

Keywords: computational linguistics, polysemy, perceptually grounded semantics, neural networks, partially structured connectionism. 000754An Abductive Framework for Generalized Logic Programs: Preliminary ReportGerhard Brewkatr-92-049199207__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1992/tr-92-049.pdfWe present an abductive semantics for generalizedpropositional logic programs which defines themeaning of a logic program in terms of its extensions.This approach extends the stable model semantics fornormal logic programs in a natural way. The newsemantics is equivalent to stable semantics for alogic program $P$ whenever $P$ is normal and has astable model. The existence of extensions isguaranteed for all normal programs. The semanticscan be applied without further modification togeneralized logic programs where disjunctions andnegation signs may appear in the head of rules. Ourapproach is based on an idea recently proposed byKonolige for causal reasoning. Instead ofmaximizing in abduction the set of used hypothesesalone we maximize the union of the used and refutedhypotheses. 000755The Degrees of Discontinuity of some Translators between Representations ofthe Real NumbersKlaus Weihrauchtr-92-050199207__000006Representations like decimal representation areused for defining computability on the set of realnumbers. Translatability between differentrepresentations has been studied in the past byseveral authors. Most of the not computably solvabletranslation problems are not even continuouslysolvable. In this paper the degrees of discontinuityof translations between a number of commonrepresentations are compared and characterized.Mainly three degrees are considered: the first onewith translations between the standardrepresentation and the weak cut representations,the second one contains among others thetranslations between ``m''-adic and ``n''-adicrepresentations, and the third one containstranslations concerning proper cutrepresentations and the iterated fractionrepresentation. 000756Improved Parallel Polynomial Division and Its ExtensionsDario Bini, Victor Pantr-92-051199208__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1992/tr-92-051.pdfWe compute the first N coefficients of the reciprocalr(x) of a given polynomial p(x), (r(x)p(x)=1 mod x^N,p(0) not equal to 0), by using, under the PRAMarithmetic models, O(h log N) time-steps andO((N/h)(1+2^{-h}log^{(h)}N)), processors, forany h, h=1, 2,... log^*N, provided that O(log m) stepsand m processors suffice to perform DFT on m points andthat log^{(0), N=N, log^{(h)}N = log_2 log^{(h-1)}N, h=1,...,log^*N, log^*N = max{h:\log^{(h)} N > 0. The same complexity estimates applyto some other computations, such as the division witha remainder of two polynomials of degrees O(N) and theinversion of an N times N triangular Toeplitz matrix.This improves the known estimates of Reif-Tate andGeorgiev. We also show how to extend our techniques toparallel implementation of other recursiveprocesses, such as the evaluation modulo x^N of them^th root, p(x)^{1/m, of p(x) (for any fixed naturalm), for which we need 0(log N log log N) time-steps andO(N/log log N) processors. The paper demonstratessome new techniques of supereffective slowdown ofparallel algebraic computations, which we combinewith a technique of stream contraction. 000757Improved Parallel Computations with Toeplitz-like and Hankel-like MatricesDario Bini, Victor Pantr-92-052199208__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1992/tr-92-052.pdfThe known parallel algorithms for computations withgeneral Toeplitz, Hankel, Toeplitz-like, andHankel-like matrices are inherently sequential. Wedevelop some new techniques in order to devise fastparallel algorithms for such computations,including the evaluation of Krylov sequences forsuch matrices, traces of their power sums,characteristic polynomials and generalizedinverses. This has further extensions to computingthe solution or a least-squares solution to a linearsystem of equations with such a matrix and to severalpolynomial evaluations (such as computing gcd, lcm,Pade approximation and extended Euclidean schemefor two polynomials), as well as to computing theminimum span of a linear recurrence sequence. Thealgorithms can be applied over any field ofconstants, with the resulting dvantages of usingmodular arithmetic. The algorithms consist of implecomputational blocks (mostly reduced to fastFourier transforms, FFT's) and have potentialpractical value. We also develop the techniques forextending all our results to the case of matricesrepresentable as the sums of Toeplitz-like andHankel-like matrices and in addition show some moreminor innovations, such as an improvement of thetransition to the solution to a Toeplitz linearsystem Tx=b from two computed columns of T^-1. 000758A Mechanism for Dynamic Re-routing of Real-time ChannelsColin Parris, Hui Zhang, Domenico Ferraritr-92-053199208__000006Various solutions have been proposed to providereal-time services (i.e., services with guaranteedperformance requirements) in packet-switchednetworks. These solutions usually require fixedrouting and resource reservation for eachconversation. The routing and reservationdecisions, combined with load fluctuations,introduce the problems of network unavailabilityand loss of network management flexibility. Webelieve that these problems can be alleviated byproperly balancing the network load. In this paper,we present a mechanism that dynamically reroutes areal-time channel without disruption of service tothe clients. This mechanism is one component in aframework to investigate load balancing in areal-time internetwork. We show that the mechanismcan be incorporated into the Tenet real-timeprotocol suite with minimal changes and overhead. 000759Process Grammar Processor: An Architecture for a Parallel ParserMassimo Marinotr-92-054199208__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1992/tr-92-054.pdfA parallel architecture of a parser for NaturalLanguage is described. A serial architecture hasbeen already realized and is currently used in asystem for the design and testing of Natural Languagegrammars and the generation of the correspondingparsers. This system works using a Process GrammarProcessor running a model of grammar suited for thegeneration of Natural Language applications. Thegrammar model, named Process Grammar (PG), is anextension of an augmented context-freephrase-structure grammar, and the parser isdesigned to use such a grammar model. A PG is a set ofrules that are treated by the processor asdescriptors of processes that are scheduled andapplied if the conditions for their execution hold:from this the name Process Grammar. In this report thePG model is extended in order to allow a morestructured and modular construction of grammars,even of big dimensions, keeping separated parsingcontrol, and syntactic and semanticspecifications, partitioning a PG in clusters ofrules, completely independent one from each other,carrying on their own dedicated recognition ofspecific parts of speech. The parallel architectureis composed by parallel processes cooperating andcommunicating by means of a message passingprotocol. This allows the realization of someparsing strategies and the implementation ofparsing mechanisms extending the recognitioncapacity of the parser that could not be possible in astandard and serial context-free parsingenvironment. Both serial and parallel versions ofthe parser are introduced and described, looking ingreater detail the mechanisms of process schedulingand how they can be used and extended for implementingvarious cases of parsing strategies. 000760A New Approach to Fast Polynomial Interpolation and Multipoint EvaluationVictor Pantr-92-055199208__000006The fastest known algorithms for the problems ofpolynomial evaluation and multipointinterpolation are devastatingly unstablenumerically because of their recursive use ofpolynomial divisions. We apply a completelydistinct approach to compute approximatesollutions to both problems equally fast but withimproved numerical stability. Our approach relieson new techniques, so far not used in this area: wereduce the problems to Vandermonde matrixcomputations and then exploit some recent methodsfor improving computations with structuredmatrices. 000761On-line Graph Algorithms for Incremental CompilationAlberto Marchetti-Spaccamela, Umberto Nanni, Hans Rohnerttr-92-056199208__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1992/tr-92-056.pdfCompilers usually construct various datastructures which often vary only slightly fromcompilation run to compilation run. This paper givesvarious solutions to the problems of quicklyupdating these data structures instead of buildingthem from scratch each time. All problems we found canbe reduced to graph problems. Specifically, we givealgorithms for updating data structures for theproblems of topological order, loop detection, andreachability from the start routine. 000762Describing and Recognizing Shape through Size FunctionsClaudio Uras, Alessandro Verritr-92-057199209__000006According to a recent mathematical theory theintuitive concept of shape can be formalized throughfunctions, named "size functions", which conveyinformation on both the topological and metricproperties of the viewed shape. In this paper the mainconcepts and results of the theory are first reviewedin a somewhat intuitive fashion. Then, an algorithmfor the computation of discrete size functions ispresented. Finally, by introducing a suitabledistance function, it is shown that size functionscan be successfully used for both shape descriptionand recognition from real images. 000763Planar Passive Navigation: One Dimension is Better than TwoEnrico De Micheli, Alessandro Verritr-92-058199211__000006This paper is based on the observation that if aviewing camera is appropriately mounted on a vehiclewhich moves on a planar surface, i.e. the image planeof the camera is orthogonal to the planar surface andthe optical axis parallel to the instantaneousdirection of translation, then the angular velocityis the only motion parameter to be computed.Consequently, the problem of motion and structurerecovery from optical flow becomes linear and, inprinciple, can be solved locally. Elementary erroranalysis shows that the angular velocity can berobustly estimated by averaging the horizontalcomponent of the optical flow along the vertical linethrough the center of the image. Once the angularvelocity has been recovered, depth can be computedfrom one component only of the optical flow. It isshown that the accuracy in the estimation of depthfrom the vertical component is more accurate,increases with the distance from the horizontalliine through the center of the image, and is almostindependent of the angular velocity. From thereported experiments on synthetic data and realimages it can be concluded that in applications likeautonomous robot navigation the computation of thetwo-dimensional (2D) optical flow over the entire 2Dimage plane can be probably avoided. 000764Learning Topology-Preserving Maps Using Self-Supervised Backpropagation on aParallel MachineArnfried Ossentr-92-059199209__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1992/tr-92-059.pdfSelf-supervised backpropagation is anunsupervised learning procedure for feedforwardnetworks, where the desired output vector isidentical with the input vector. Forbackpropagation, we are able to use powerfulsimulators running on parallel machines.Topology-preserving maps, on the other hand, can bedeveloped by a variant of the competitive learningprocedure. However, in a degenerate case,self-supervised backpropagation is a version ofcompetitive learning. A simple extension of the costfunction of backpropagation leads to a competitiveversion of self-supervised backpropagation, whichcan be used to produce topographic maps. Wedemonstrate the approach applied to the TravelingSalesman Problem (TSP). The algorithm wasimplemented using the backpropagation simulator(CLONES) on a parallel machine (RAP). 000765Ring Array Processor: Programmer's Guide to the RAP LibrariesMichael C. Greenspontr-92-060199209__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1992/tr-92-060.pdfThe RAP machine is a high performance DSP-baseddistributed memory parallel processor developed atICSI as described in previous technical reports.This report documents the application programinterfaces to the high-level computationalroutines provided by the RAP class librariescorresponding to software release 1.0. It isintended as both an introductory guide and standardlibrary reference for C++ and C programmersundertaking software development for the RAPmachine. The RAP library classes and methodsdocumented in this report transparently implementdata-parallel operations on distributed memoryobjects. Thus client programs written to theseinterfaces automatically achieve scalabilityacross different sized RAP machines. Additionally,the high-level interfaces provide a degree ofgeneral hardware independence, increasing thelikelihood that client code will port easily tofuture parallel platforms under development atICSI. This report also provides an introduction tothe internals of the distributed objectimplementation with tips and examples forprogrammers wishing to extend the libraries in astructured fashion. 000766Can we Utilize the Cancellation of the Most Significant Digits?Victor Pantr-92-061199212__000006If the sum of several positive and negative numbershas a small magnitude, relative to the magnitudes ofthe summands, then we show how to decrease theprecision of the computation of this sum (withoutaffecting the output precision). Furthermore, ifthe magnitude of the inner product of two vectors issmall and if one of them is filled with "short" binarynumbers, each represented with only a few bits, thenwe decrease the precision of the computation of suchan inner product (without affecting the outputprecision), and we extend this result to theiterative improvement algorithm for a linear systemof equations, whose coefficients are represented by"short" binary numbers. We achieve this bytruncating both the least and the most significantdigits of the operands, according to our new scheme of"backward binary segmentation". 000767The Acquisition of Lexical Semantics for Spatial Terms: A ConnectionistModel of Perceptual CategorizationTerry Regiertr-92-062199209__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1992/tr-92-062.pdfThis thesis describes a connectionist model whichlearns to perceive spatial events and relations insimple movies of 2-dimensional objects, so as to namethe events and relations as a speaker of a particularnatural language would. Thus, the model learnsperceptually grounded semantics for naturallanguage spatial terms. The design and constructionof this system have resulted in several technicalcontributions. The first is a very simple buteffective means of learning without explicitnegative evidence. This thesis also presents thenotion of partially-structured connectionism, amarriage of structured and unstructured networkdesign techniques capturing the best of eachparadigm. Finally, the idea of learning withinhighly specialized structural devices isintroduced. Scientifically, the primary result ofthe work described here is a computational model ofthe acquisition of visually grounded semantics.This model successfully learns terms for spatialevents and relations from a range of languages withwidely differing spatial systems, includingEnglish, Mixtec (a Mexican Indian language),German, Bengali, and Russian. And perhaps mostimportantly, the model does more than justrecapitulate the data; it also generates a number offalsifiable linguistic predictions regarding thesorts of semantic features, and combinations offeatures, one might expect to find in lexemes forspatial events and relations in the world's naturallanguages. 000768Block Korkin-Zolotarev Bases and Successive MinimaC. P. Schnorrtr-92-063199209__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1992/tr-92-063.pdfUsing block Korkin--Zolotarev bases we improveBabai's construction of a nearby lattice point.Given a block Korkin--Zolotarev basis with blocksize beta of the lattice L and given a point x in the spanof L, a lattice point v can be found in timebeta^{O(beta)} satisfying |x-v|^2 less then orequal to m gamma^{2m/{beta-1}_beta min_u epsilon L}|x-u|. These results also bear improvements for themethod of solving integer programming problems viabasis reduction. 000769Competitive Analysis of Financial GamesR. El-Yaniv, A. Fiat, R. Karp, G. Turpintr-92-064199209__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1992/tr-92-064.pdfIn the unidirectional conversion problem an on-lineplayer is given the task of converting dollars to yenover some period of time. Each day, a new exchange rateis announced, and the player must decide how manydollars to convert. His goal is to minimize thecompetitive ratio, defined as sup_EP_OPT(E)\P_{X}(E), where E ranges over exchangerate sequences, P_OPT(E) is the number of yenobtained by an optimal off-line algorithm, andP_{X}(E) is the number of yen obtained by the on-linealgorithm X. We also consider a continuous version ofthe problem, in which the exchange rate varies over acontinuous time interval. The on-line player's apriori information about the fluctuation ofexchange rates distinguishes different variants ofthe problem. For three variants we show that a simplethreat-based strategy is optimal for the on-lineplayer and determine its competitive ratio. We alsoderive and analyze an optimal policy for the on-lineplayer when he knows the probability distribution ofthe maximum value that the exchange rate will reach.Finally, we consider a bidirectional conversionproblem, in which the player may trade dollars for yenor yen for dollars. 000770The Impact of Multimedia Data on Database Management SystemsKarl Aberer, Wolfgang Klastr-92-065199209__000006NOTE: Many have reported problems printing thisfile. Thus we have renamed it with a .BAD tag. We offerthis techreport "as-is" and cannot offer helpprinting it. This paper analyzes the impact ofmultimedia data on database management systems andproposes some solutions which allow for a high degreeof integrated handling of multimedia data by amultimedia database system. We first give acharacterization of multimedia data with respect toissues like time dependency and amount of data. Thenwe derive major requirements which need to besatisfied in order to provide the integration. Theserequirements include e.g., dynamic datamanagement, non-transparent parallelism,scheduling, several kinds of abstractions,resource distribution transparency, and advancedinteraction models satisfying real timeconstraints. We show how some of the requirements canbe met by exploiting concepts from theobject-oriented paradigm and database systems.Then we discuss extensions needed with respect todata integration, scheduling, parallelism, andreal time streams. 000771Physical Mapping of Chromosomes: A Combinatorial Problem in MolecularBiologyFarid Alizadeh, Richard M. Karp, Lee A. Newberg, Deborah K. Weissertr-92-066199309__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-92-066.pdfA fundamental tool for exploring the structure of along DNA sequence is to construct a ``library''consisting of many cloned fragments of the sequence.Each fragment can be replicated indefinitely andthen ``fingerprinted'' to obtain partialinformation about its structure. A common type offingerprinting is restriction fingerprinting, inwhich an enzyme called a restriction nucleasecleaves the fragment wherever a particular shortsequence of nucleotides (letters `A', `G', `C', and`T') occurs, and the lengths of the resulting piecesare measured. An important combinatorial problem isto determine, from such fingerprint information,the most probable arrangement of the clonedfragments along the overall sequence. However, for agiven arrangement, even the likelihood functioninvolves a complicated multifold integral andtherefore difficult to compute. We propose anapproximation to the likelihood function anddevelop local search algorithms based on thisapproximate objective function. Our local searchtechniques are extensions of similar strategies forthe travelling salesman problem. We provide somecomputational results which support our choice ofobjective function. We also briefly studyalternative approaches based on pairwiseprobabilities that two fragments overlap. 000772Integrating a Relational Database System into VODAK using its MetaclassConceptW. Klas, G. Fischer, K. Aberertr-92-067199208__000006This paper presents a specific approach ofintegrating a relational database system into afederated database system. The underlying databaseintegration process consists of three steps: first,the external database systems have to be connected tothe integrated database system environment and theexternal data models have to be mapped into acanonical data model. This step is often calledsyntactic transformation including structuralenrichment and leads to component schemas for eachexternal DBMS. Second, the resulting schemas fromthe first step are used to construct export schemaswhich are then integrated into global, individualschemas or views. In this paper we focus on the firststep for relational databases, i.e., the connectionof a relational database system and the mapping of therelational model into a canonical data model. We takePOSTGRES as the relational database system and theobject-oriented federated database system VODAK asthe integration platform which provides the open,object-oriented data model as the canonical datamodel for the integration. We show differentvariations of mapping the relational model. Byexploiting the metaclass concept provided by VML weshow how to tailor VML such that the canonical datamodel meets the requirements of integratingPOSTGRES into the global database system VODAK in anefficient way. 000773Public Randomness in CryptographyAmir Herzberg, Michael Lubytr-92-068199210__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1992/tr-92-068.pdfThe main contribution of this paper is theintroduction of a formal notion of public randomnessin the context of cryptography. We show how thisnotino affects the definition of the security of acryptoigraphic primitive and the definition of howmuch security is preserved when one cryptographicprimitive is reduced to another. Previous worksconsidered the public random bits as a part of theinput, and security was parameterized in terms of thetotal length of the input. We parameterize securitysolely in terms of the length of the private input, andtreat the public random bits as a separate resource.This separation allows us to independently addressthe important issues of how much security ispreserved by a reduction and how many public randombits are used in the reduction.

To exemplify these newdefinitions, we present reductions from weakone-way permutations to one-way permutations withstrong security preserving properties that aresimpler than previously known reductions. 000774Inductive learning of compact rule sets by using effcient hypothesesreductionThomas Kochtr-92-069199209__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1992/tr-92-069.pdfA method is described which reduces the hypothesesspace with an efficient and easily interpretablereduction criteria called a - reduction. A learningalgorithm is described based on a - reduction andanalyzed by using probability approximate correctlearning results. The results are obtained byreducing a rule set to an equivalent set of kDNFformulas. The goal of the learning algorithm is toinduce a compact rule set describing the basicdependencies within a set of data. The reduction isbased on criterion which is very flexible and gives asemantic interpretation of the rules which fulfillthe criteria. Comparison with syntacticalhypotheses reduction show that the a - reductionimproves search and has a smaller probability ofmissclassification. 000775On Randomized Algebraic Test ComplexityPeter Buergisser, Marek Karpinski, Thomas Lickteigtr-92-070199210__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1992/tr-92-070.pdfWe investigate the impact of randomization on thecomplexity of deciding membership in a(semi-)algebraic subset $X \subset \rr^m$.Examples are exhibited where allowing for a certainerror probability $\epsilon$ in the answer of thealgorithms the complexity of decision problemsdecreases. A randomized$(\Omega^k,\{=,\leq\})$-decision tree ($k\subseteq\rr$ a subfield) over $m$ will be defined asa pair $(T,\mu)$ where $\mu$ a probability measure onsome $\rr^n$ and $T$ is a $(\Omega^k,\{=,\leq\})$-decision tree over $m+n$. We prove a general lowerbound on the average decision complexity for testingmembership in an irreducible algebraic subset $X\subset \rr^m$ and apply it to $k$-generic completeintersection of polynomials of the same degree,extending results in [4, 6]. We also giveapplications to nongeneric cases, such as graphs ofelementary symmetric functions,$\mbox{SL}(m,\rr)$, and determinant varieties,extending results in \cite{Li:90}. 000776An Efficient Parallel Algorithm for Computing a Maximal Independent Set in aHypergraph of Dimension 3Elias Dahlhaus, Marek Karpinski, Peter Kelsentr-92-071199210__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1992/tr-92-071.pdfThe paper considers the problem of computing amaximal independent set in a hypergraph (see\cite{BL} and \cite{KR}). We present an efficientdeterministic NC algorithm for finding a maximalindependent set in a hypergraph of dimension $3$: thealgorithm runs in time $O(\log^4 n)$ time on $n+m$processors of an EREW PRAM and is optimal up to apolylogarithmic factor. Our algorithm adapts thetechnique of Goldberg and Spencer (\cite{GS}) forfinding a maximal independent set in a graph (orhypergraph of dimension $2$). It is the firstefficient NC algorithm for finding a maximalindependent set in a hypergraph of dimension greaterthan 2. 000777Network Support For Multimedia: A Discussion of the Tenet ApproachDomenico Ferrari, Anindo Banerjea, Hui Zhangtr-92-072199210__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1992/tr-92-072.pdfMultimedia communication can be supported in anintegrated-services network in the generalframework of realtime communication. The TenetGroup has devised an approach that provides someinitial solutions to the realtime communicationproblem. This paper attempts to identify theprinciples behind these solutions. We also describea suite of protocols, and their implementations inseveral environments, that embody theseprinciples, and work in progress that will leadtowards more complete solutions. 000778Optimal Traversal of Directed HypergraphsGiorgio Ausiello, Giuseppe F. Italiano, Umberto Nannitr-92-073199209__000006A ``directed hypergraph'' is defined by a set of nodesand a set of ``hyperarcs'', each of which connects aset of ``source'' nodes to a single ``target'' node.Directed hypergraphs are used in several contexts tomodel different combinatorial structures, such asfunctional dependencies [20], Horn clauses inpropositional calculus [6], AND-OR graphs [17],Petri nets [18]. A ``hyperpath'', similarly to theanalogous notion of path in directed graphs,consists of a connection among nodes usinghyperarcs. Unlike paths in graphs, hyperpaths aresuitable of different definitions of measure,corresponding to different concepts arising invarious applications.

In this paper we consider theproblem of finding optimal hyperpaths according toseveral optimization criteria. We show that some ofthese problems are NP-hard but, if the measurefunction on hyperpaths matches certain conditions(namely if it is ``value-based''), the problem turnsout to be tractable. We describe efficientalgorithms and data structures to find optimalhyperpaths which can be used with any value-basedmeasure function, since it appears in parametricform. The achieved time bound is O(|H| + n log n) for ahypergraph with n nodes and an overall description ofsize |H|. Dynamic maintenance of optimal hyperpathsis also considered, and the proposed solutionsupports insertions of hyperarcs. 000779When is the Assignment Bound Tight for the Asymmetric Traveling-SalesmanProblem?Alan Frieze, Richard Karp, Bruce Reedtr-92-074199211__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1992/tr-92-074.pdfWe consider the probabilistic relationship betweenthe value of a random asymmetric traveling salesmanproblem ATSP(M) and the value of its assignmentrelaxation AP(M). We assume here that the costs aregiven by an n\times n matrix M whose entries areindependently and identically distributed. Wefocus on the relationship betweenPr(ATSP(M)=AP(M)) and the probability p_n that anyparticular entry is zero. If np_n\rightarrow \inftywith n then we prove that ATSP(M)=AP(M) withprobability 1-o(1). This is shown to be best possiblein the sense that if np(n)\rightarrow c, c>0 andconstant, then Pr(ATSP(M)=AP(M))<1-\phi(c) forsome positive function phi. Finally, ifnp_n\rightarrow 0 thenPr(ATSP(M)=AP(M))\rightarrow 0. 000780Genetic and Non Genetic Operators in Alecsys - Revised VersionMarco Dorigotr-92-075199212__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1992/tr-92-075.pdfIt is well known that standard learning classifiersystems, when applied to many different domains,exhibit a number of problems: payoff oscillation,difficult to regulate interplay between the rewardsystem and the background genetic algorithm (GA),rule chains instability, default hierarchiesinstability, are only a few. Alecsys is a parallelversion of a standard learning classifier system(CS), and as such suffers of these same problems. Inthis paper we propose some innovative solutions tosome of these problems. We introduce the followingoriginal features. Mutespec, a new genetic operatorused to specialize potentially useful classifiers.Energy, a quantity introduced to measure globalconvergence in order to apply the genetic algorithmonly when the system is close to a steady state.Dynamical adjustment of the classifiers setcardinality, in order to speed up the performancephase of the algorithm. We present simulationresults of experiments run in a simulatedtwo-dimensional world in which a simple agent learnsto follow a light source.

Keywords: learning classifier systems, genetic algorithms, robotics. 000781Approximate Evaluation of a Polynomial on a Set of Real PointsVictor Pantr-92-076199211__000006The previous best algorithm for approximateevaluation of a polynomial on a real set was due toRokhlin and required the order of mu + (nu superscript3) infinite precision arithmetic operations toapproximate [on a fixed bounded set X(m) of m + 1 realpoints] a degree n polynomial p(x) = (sum(superscript n) (subscript i=0)) (p subscript i) (xsuperscript i) within the error bound (2 superscript-u) (sum (superscript n) (subscript i=0)) |psubscript i|. We develop an approximationalgorithm, which decreases Rokhlin's recordestimate to O(m (log superscript 2) u + n min (u, logn)). For log u = o(log n), this result may also befavorably compared with the record bound O((m+n)(log superscript 2) n) on the complexity of the exactmultipoint polynomial evaluation. The newalgorithm can be performed in the fields (or rings)generated by the input values, which enables us todecrease the precision of the computations [by usingmodular (residue) arithmetic] and to simplify ourcomputations further in the case where u = O(log n).Our algorithm allows NC and simultaneouslyprocessor efficient parallel implementation.Because of the fundamental nature of the multipointpolynomial evaluation, our results have furtherapplications to numerical and algebraiccomputational problems. By passing, we also show asubstantial improvement in the Chinese remainderalgorithm for integers based on incorporatingKaminski's fast residue computation. 000782Polynomial Uniform Convergence and Polynomial-Sample LearnabilityAlberto Beroni, Paola Campadelli, Anna Morpurgo, Sandra Panizzatr-92-077199211__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1992/tr-92-077.pdfIn the PAC model, polynomial-sample learnability inthe distribution dependent framework has beencharacterized in terms of minimun cardinality of$\epsilon$-covers. In this paper we propose anotherapproach to the problem by investigating therelationship between polynomial-samplelearnability and uniform convergence, in analogy towhat was done for the distribution free setting.First of all, we introduce the notion of polynomialuniform convergence, giving a characterization forit in terms of an entropic measure, then we study itsrelationship with polynomial- samplelearnability. We show that, contrarily to whathappens in the distribution independent setting,polynomial uniform convergence is a sufficient butnot necessary condition for polynomial-samplelearnability. 000783On Randomized Versus Deterministic ComputationMarek Karpinski, Rutger Verbeektr-92-078199211__000006In contrast to deterministic or nondeterministiccomputation, it is a fundamental open problem inrandomized computation how to separate differentrandomized time classes (at this point we do not evenknow how to separate linear randomized time from O(nsuperscript (log n)) randomized time) or how tocompare them relative to correspondingdeterministic time classes. In another words we arefar from understanding the power of ``random cointosses'' in the computation, and the possible ways ofsimulating them deterministically.

In this paper westudy the relative power of linear and polynomialrandomized time compared with exponentialdeterministic time. Surprisingly, we are able toconstruct an oracle A such that exponential time(with or without the oracle A) is simulated by lineartime Las Vegas algorithms using the oracle A. We arealso able to prove, for the first time, that in somesituations the randomized reductions areexponentially more powerful than deterministicones (cf. [Adleman, Manders, 1977]).

Furthermore, aset B is constructed such that Monte Carlo polynomialtime (BPP) under the oracle B is exponentially morepowerful than deterministic time withnondeterministic oracles. This strengthensconsiderably a result of Stockmeyer [St85] about thepolynomial time hierarchy that for some decidableoracle B, (BPP superscript B) (not subseteq)(Deltasubscript 2)(P superscript B). Under our oracle BPPsuperscript B is exponentially more powerful than(Delta subscript 2)(P superscript B), and B does notadd any power to (Delta subscript 2)(EXPTIME). 000784Computation of the Additive Complexity of Algebraic Circuits with RootExtractingMarek Karpinski, Rutger Verbeektr-92-079199211__000006We design an algorithm for computing the generalized(algebraic circuits with root extraction)``additive complexity'' of any rational function.It is the first computability result of this sort onthe additive complexity of algebraic circuits (cf.[SW80]). 000785Simulating Threshold Circuits by Majority CircuitsMikael Goldmann, Marek Karpinskitr-92-080199212__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1992/tr-92-080.pdfWe prove that a single threshold gate can be simulatedby an explicit polynomial size depth 2 majoritycircuit. In general we show that a depth d thresholdcircuit can be simulated uniformly by a majoritycircuit of depth d+1. Goldmann, Hastad and Razborovdemonstrated that a non-uniform simulation exists.Our construction answers two open questions posed intheir work: We give an explicit construction whereasGoldmann, Hastad and Razborov use a randomizedexistence argument, and we show that such asimulation is possible even if the depth d grows withthe number of variables n (the simulation in theirwork gives polynomial size circuits only when d isconstant). 000786Connectionist Probability Estimation in HMM Speech RecognitionSteve Renals, Nelson Morgantr-92-081199212__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1992/tr-92-081.pdfThis report is concerned with integratingconnectionist networks into a hidden Markov model(HMM) speech recognition system. This is achievedthrough a statistical understanding ofconnectionist networks as probability estimators,first elucidated by Herve- Bourlard. We review thebasis of HMM speech recognition, and point out thepossible benefits of incorporating connectionistnetworks. We discuss some issues necessary to theconstruction of a connectionist HMM recognitionsystem, and describe the performance of such asystem, including evaluations on the DARPAdatabase, in collaboration with Mike Cohen andHoracio Franco of SRI International. In conclusion,we show that a connectionist component improves astate of the art HMM system. 000787Perfect Zero-Knowledge Arguments for NP Can Be Based on General ComplexityAssumptionsMoni Naor, Rafail Ostrovskytr-92-082199212__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1992/tr-92-082.pdf"Zero-knowledge arguments" is a fundamental cryptographic primitivewhich allows one polynomial-time player to convince anotherpolynomial-time player of the validity of an NP statement, withoutrevealing any additional information in the information-theoreticsense. Despite their practical and theoretical importance, it wasonly known how to implement zero-knowledge arguments based on specificalgebraic assumptions; basing them on a general complexity assumptionwas open since their introduction in 1986 [BCC, BC, CH]. In thispaper, we finally show a general construction, which can be based onany one-way permutation.

We stress that our scheme is "efficient": both players can executeonly polynomial-time programs during the protocol. Moreover, thesecurity achieved is "on-line": in order to cheat and validate afalse theorem, the prover must break a cryptographic assumptionon-line "during the conversation", while the verifier can not find(ever!) any information unconditionally (in the informationtheoretic sense). 000788Invariant Signatures and Non-Interactive Zero-Knowledge Proofs areEquivalentShafi Goldwasser, Rafail Ostrovskytr-92-083199212__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1992/tr-92-083.pdfThe standard definition of digital signaturesallows a document to have many valid signatures. Inthis paper, we consider a subclass of digitalsignatures, called invariant signatures, in whichall legal signatures of a document must be identicalaccording to some polynomial-time computablefunction (of a signature) which is hard to predictgiven an unsigned document. We formalize this notionand show its equivalence to non-interactivezero-knowledge proofs. 000789Implicit Parallelism in Genetic AlgorithmsAlberto Bertoni, Marco Dorigotr-93-001199301__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-001.pdfThis paper is related to Holland's result on implicitparallelism. Roughly speaking, Holland showed a lowerbound of the order of (n^3)/(c1*sqrt(l)) to the numberof schemata usefully processed by the genetic algorithmin a population of n=c1*2^l binary strings, with c1 a smallinteger. We analyze the case of population of n = 2*beta*lbinary strings where beta is a positive parameter (Holland'sresult is related to the case beta=1). In the main result,for all beta>0 we state a lower bound on the expected numberof processed schemata; moreover, we prove that this bound istight up to a constant for all beta>=1 and, in this case, westrengthen in probability the previous result.

Keywords: genetic algorithms, implicit parallelism. 000790Optimization Problems: Expressibility, Approximation Properties and ExpectedAsymptotic Growth of Optimal SolutionsT. Behrendt, K. Compton, E. Graedeltr-93-002199301__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-002.pdfWe extend the recent approach of Papadimitrou andYannakakis that relates the approximationproperties of optimization problems to theirlogical representation.

Our work builds on resultsby Kolaitis and Thakur who sytematically studied theexpressibility classes MS_n and MP_n ofmaximization problems and showed that they form ashort hierarchy of four levels. The two lowestlevels, MS_0 and MS_1 coincide with the classes MaxSnp and Max Np of Papadimitriou and Yannakakis; theycontain on ly problems that are approximable inpolynomial time up to a constant factor and thusprovide a logical criterion for approximability.However, there are computationally very easymaximization problems, such as Maximum ConnectedComponent (MCC) that fail to satisfy this criterion.

We modify these classes by allowing the formulae tocontain predicates that are definable in leastfixpoint logic. In addition, we maximize not onlyover relations but also over constants. We call theextended classes MSF_i and MPF_i. The proof ofPapadimitriou and Yannakakis can be extended toMSF_1 to show that all problems in this class areapproximable. Some problems, such as MCC, descendfrom the highest level in the original hierarchy tothe lowest level MSF_0 in the new hierarchy. Thus ourextended class MSF_1 provides a more powerfulsufficient criterion for approximability than theoriginal class MS_1.

We separate the extendedclasses and prove that a number of important problemsdo not belong to MSF_1. These include Max Clique, MaxIndependent Set, V-C Dimension and Max CommonInduced Subgraph.

To do this we introduce a new methodthat characterizes rates of growth of aveage optimalsolution sizes. For instance, it is known that theexpected size of a maximal clique in a random graphgrows logarithmically with respect to thecardinality of the graph. We show that no problem inMSF_1 can have this property, thus proving that MaxClique is not in MSF_1. This technique is related tolimit laws for various logics and to theprobabilistic method from combinatorics. Webelieve that this method may be of independentinterest.

In contrast to the recent results on thenon-approximability of many maximizationproblems, among them Max Clique, our results do notdepend on any unproved hypothesis from complexitytheory, such as P does not equal NP. 000791Simple Multivariate Polynomial MultiplicationVictor Pantr-93-003199308__000006We observe that polynomial evaluation andinterpolation can be performed fast over amultidimensional grid (lattice), and we apply thisobservation in order to obtain the bounds M(c,m) isgreater than or equal to (c superscript m)(1 + m + 1.5m +2 (log subscript 2) c) over the fields of constantssupporting FFT on c points, c being a power of 2, and M(c, m) = 0[N log N log log c], over any field, where N = (csuperscript m), and M (c, m) denotes the number ofarithmetic operations required in order to multiply(over any field F) a pair of m-variate polynomialswhose product has degree at most c - 1 in each variable,so that M (c, m) = 0(N log N) if c=0(1), m right arrowinfinity (over any field F), versus the known bound ofO (N log N log log N). 000792Mixture Models and the EM Algorithm for Object Recognition withinCompositional Hierarchies. Part 1: RecognitionJoachim Utanstr-93-004199301__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-004.pdfWe apply the Expectation Maximization (EM)algorithm to an assignment problem where in additionto binary assignment variables analog parametersmust be estimated. As an example, we use the problem ofpart labelling in the context of model based objectrecognition where models are stored in from of acompositional hierarchy. This problem has beenformulated previously as a graph matching problemand stated in terms of minimizing an objectivefunction that a recurrent neural network solves.Mjolsness has introduced a "stochastic visualgrammar" as a model for this problem; there thematching problem arises from an index renumberingoperation via a permutation matrix. Theoptimization problem w.r.t the match variables isdifficult and Mean Field Annealing techniques areused to solve it. Here we propose to model the partlabelling problem in terms of a mixture ofdistributions, each describing the parameters of apart. Under this model, the match variablescorrespond to the a posteriori estimates of themixture coefficients. The parts in the input imageare unlabelled, this problem can be stated as missingdata problem and the EM algorithm can be used torecover the labels and estimate parameters. Theresulting update equations are identical to theElastic Net equations; however, the update dynamicsdiffer.

Keywords: EM algorithm, object recognition, compositional hierarchy, elastic matching, mean field annealing. 000793A Dynamic Connection Management Scheme for Guaranteed Performance Servicesin Packet-Switching Integrated Services NetworksColin Parristr-93-005199301__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-005.pdfWith the demand for multimedia and computationalscience applications, guaranteed performancecommunication services have become a necessaryfeature of future high-speed networks. Thesecommunications services should possess a high levelof sophistication so that they can easily adapt thenetwork to the wide variety of applications soon to beseen, thereby allowing the network to increase itsavailability and flexibility. Availability is theability of the network to accommodate as manyreal-time clients as possible without violating anyclient's performance guarantees, whileflexibility is the ability to adapt to changingnetwork state and client demands in order to maintainthe performance guarantees and quality of servicepromised to the client. Flexibility also refers tothe ability of the network to easily increase thevariety of real-time services that it offers. It isour contention that availability and flexibilitycan be enhanced in a network by providing the networkwith the ability to modify the performanceparameters and/or the route of any guaranteedperformance connection in the network withoutviolating the previously made performancecontracts.

In this paper, we present a scheme fordynamically managing guaranteed performanceservice connections and experimental results toverify the correctness and usefulness of the scheme.The motivation for this scheme, Dynamic ConnectionManagement (DCM), is discussed, and detaileddescriptions of the DCM modification contracts andalgorithms are provided. A survey of guaranteedperformance services protocols, architectures,and routing algorithms are presented together withtheir relevance to this work. A simulator has beenbuilt, and preliminary experiments and analyseswere done on the scheme. The paper concludes with asummary and some topics for future work. 000794A Characterization of Multi-Party Interactive Multimedia ApplicationsClemens Szyperski, Giorgio Ventretr-93-006199301__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-006.pdfThis document tries to define and characterize aclass of applications called Multi-PartyInteractive Multimedia (MIM), for which manyexamples are given. This class includesapplications such as CSCW, teleconferencing, andremote education; its consideration in this reportis based on the observation that MIM applications areboth important and representative for the area ofhigh-performance real-time communication. Purelyfunctional criteria are used to capture the MIMclass, {i.e.} ones that are not related to anyparticular way of implementation. Thus, futuredirections are sketched that give some indicationson what a network architecture will need to provide,in order to effectively support such applications. 000795On Removing Randomness from a Parallel Algorithm for Minimum CutsMichael Luby, Joseph Naor, Moni Naortr-93-007199302__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-007.pdfThe weighted minimum cut problem in a graph is afundamental problem in combinatorialoptimization. Recently, Karger suggested arandomized parallel algorithm for this problem. Weshow that a similar algorithm can be implementedusing only $O(\log^2 n)$ random bits. We also showthat our result holds for computing minimum weightk-cuts, where k is fixed. 000796Galileo: a Tool for Simulation and Analysis of Real-Time NetworksEdward W. Knightly, Giorgio Ventretr-93-008199303__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-008.pdfGalileo is a flexible tool for simulation ofheterogeneous real-time communication networksand for development and verification of networkprotocols. Galileo provides several uniquefeatures that make it particularly suitable for thesimulation and analysis of networks that providequality-of-service guarantees. First, itsobject-oriented programming environment providesthe means for a modular, hierarchical,heterogeneous description of networks. Second, itsmultimedia device interface provides the tools for aqualitative analysis of network protocols.Finally, Galileo's network interface providesinteraction with actual networks to access real dataand simulate realistic multimedia scenarios. 000797On Deterministic Approximation of DNFMichael Luby, Boban Velickovictr-93-009199303__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-009.pdfWe develop efficient deterministic algorithms forapproximating the fraction of truth assignmentsthat satisfy a disjunctive normal form formula.Although the algorithms themselves aredeterministic, their analysis is probabilistic anduses the notion of limited independence betweenrandom variables. 000798Optimal Speedup of Las Vegas AlgorithmsMichael Luby, Alistair Sinclair, David Zuckermantr-93-010199303__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-010.pdfLet A be a Las Vegas algorithm, i.e., A is a randomizedalgorithm that always produces the correct answerwhen it stops but whose running time is a randomvariable. We consider the problem of minimizing theexpected time required to obtain an answer from~Ausing strategies which simulate A as follows: run Afor a fixed amount of time t_1, then run Aindependently for a fixed amount of time t_2, etc. Thesimulation stops if A completes its execution duringany of the runs. Let S=(t_1,t_2,...) be a strategy,and let \ell_A=inf_{S}T(A,S), where T(A,S) is theexpected value of the running time of the simulationof A under strategy S.

We describe a simple universalstrategy Sopt, with the property that, for anyalgorithm A, T(A,S^univ)=O(\ell_A log(\ell_A)).Furthermore, we show that this is the bestperformance that can be achieved, up to a constantfactor, by any universal strategy. 000799Graceful Adaptation of Guaranteed Performance Service ConnectionsColin Parris, Giorgio Ventre, Hui Zhangtr-93-011199303__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-011.pdfMost of the solutions proposed to support real-timecommunication services in a packet-switchingnetwork adopt a connection-oriented andreservation-oriented approach. In this approach,the resource allocation and route selectiondecisions are made before the start of theapplication on the basis of resource availabilityand real-time network load at that time, and areusually kept for the duration of the application.However, such an approach shows two majorlimitations: first, the communication serviceprovided is usually fixed, with no or limitedcapability of adaptation to dynamic changes in theclients' requirements; second, a low utilization ofthe network may be observed. In this paper, we presenta flexible management scheme that allows gracefuladaptation of guaranteed performance serviceconnections. Mechanisms have been devised to allowchanging of the traffic and performance parametersof a real-time communication during its lifetime.These mechanisms, together with an adaption policy,can make more efficient use of the network resourcesby performing cooperative, consenting, high-levelmultiplexing. We distinguish between two types ofadaptation: client initiated adaptation andnetwork initiated adaptation. We give examples forboth types and we also present results fromsimulation experiments to verify the correctnessour proposal. 000800Estimation of noise spectrum and its application to SNR-estimation andspeech enhancementHans-Günter Hirschtr-93-012199303__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-012.pdfOne possible solution to improve recognition ofnoisy speech is the application of noise reductiontechniques. Spectral subtraction is one well knowntechnique to reduce stationary background noise incase of recording with a single microphone. Anestimation of the noise spectrum is necessary toapply this method. The determination of segmentscontaining just noise is usually a difficult task.This report describes a method to estimate the noisespectrum without the need of distinguishing betweensegments of noisy speech and segments of pure noise.The estimation of noise power inside one subband isbased on an analysis of the histogram of a certainnumber of past short-term energy values inside thissubband. This technique for estimating the noisespectrum can be used to estimate the actualsignal-to-noise ratio (SNR). Another applicationis the integration inside a spectral subtractiontechnique for speech enhancement. 000801Optimal Stochastic Quadrature Formulas For Convex FunctionsErich Novak, Knut Petrastr-93-013199303__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-013.pdfWe study optimal stochastic (or Monte Carlo)quadrature formulas for convex functions. Whilenonadaptive Monte Carlo methods are not better thandeterministic methods we prove that adaptive MonteCarlo methods are much better. 000802Optimal Recovery and n-Widths For Convex Classes of FunctionsErich Novaktr-93-014199303__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-014.pdfWe study the problem of optimal recovery in the case ofa nonsymmetric convex class of functions. Inparticular we show that adaptive methods may be muchbetter than nonadaptive methods. We define certainGelfand-type widths that are useful fornonsymmetric classes and prove relations to optimalerror bounds for adaptive and nonadaptive methods,respectively. 000803Channel Groups: A Unifying Abstraction for Specifying Inter-stream RelationshipsAmit Gupta, Mark Morantr-93-015199303__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-015.pdfA single distributed application typicallyrequires setting up a number of real-timeconnections, or channels. Current schemes usuallyassume that different channels are independent,when in reality, important relationships oftenexist between them. We introduce a new abstractioncalled channel groups that allows network clients todescribe these relationships explicitly to thenetwork service provider. For example, bydescribing sharing relationships betweenchannels, the network client enables the network toshare resource allocations among relatedchannels-lowering the cost and improving thescalability of communication. In addition,specification of other relationships, such asinter-stream synchronization, disjoint-pathrouting, relative dropping priorities, andsimultaneous establishment provide a richer, moreefficient service. Channel groups provide aunifying abstraction and an easily-extensibleinterface for specifying these and otherrelationships. This report presents a generaldescription of the channel group abstraction anddemonstrates its usefulness in describing severaltypes of inter-stream relationships. 000804Accelerated Solution of the Tridiagonal Symmetric Eigenvalue ProblemVictor Pantr-93-016199303__000006We present new algorithms that accelerate thebisection method for the symmetric eigenvalueproblem. The algorithms rely on some new techniques,which include acceleration of Newton's iterationand can also be further applied to acceleration ofsome other iterative processes, in particular, ofiterative algorithms for approximating polynomialzeros. 000805Efficient Multicasting for Interactive Multimedia ApplicationsClemens Szyperski, Giorgio Ventretr-93-017199303__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-017.pdfA specific class of multimedia applications isexpected to be of importance for futurecommunication networks: Multi-Party InteractiveMultimedia (MIM). Based on the isolation andcharacterization of MIM applications, concretenetwork support requirements are derived in thispaper. The varying degree of connectivity, thevastly different sizes in terms of participants andthe reliance on a guaranteed Quality of Service makeMIM support a difficult problem. Starting with thedefinition of multimedia communicationabstractions, principles of solutions aresketched. For an important subclass of applicationsa particularly efficient and practicablealternative implementation based on half-duplexchannels is introduced. Finally, interfaces at boththe transport and network layers are considered. 000806Navigation Without Perception of Coordinates and DistancesArmin Hemmerlingtr-93-018199303__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-018.pdfWe consider the target--reaching problem in planescenes for a point robot which has a tactile sensor andcan locate the target ray. It might have a compass,too, but it is not able to perceive the coordinates ofits position nor to measure distances. Thecomplexity of an algorithm is measured by the numberof straight moves until reaching the target, as afunction of the number of vertices of the (polygonal)scene.

It is shown how the target point can be reachedby exhaustive search without using a compass, withthe complexity exp(O(n^{2})). Using a compass,there is a target--reaching algorithm, based onrotation counting, with the complexity O(n^{2}).

The decision problem, to recognize if the targetcannot be reached because it belongs to an obstacle,cannot be solved by our type of robot. If the behaviourof a robot without compass is periodic in ahomogeneous environment, it cannot solve thetarget--reaching problem.

Keywords: motion planning, on-line algorithms, labyrinth problems, exhaustive search, rotation counting, trap constructions, power of compass. 000807Matchings in Lattice Graphs (Preliminary Version)Claire Kenyon, Dana Randall, Alistair Sinclairtr-93-019199303__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-019.pdfWe study the problem of counting the number ofmatchings of given cardinality in a d-dimensionalrectangular lattice. This problem arises in severalmodels in statistical physics, includingmonomer-dimer systems and cell-cluster theory. Aclassical algorithms due to Fisher, Kasteleyn andTemperley counts perfect matchings exactly in twodimensions, but is not applicable in higherdimensions and does not allow one to count matchingsof arbitrary cardinality. In this paper, we presentthe first efficient approximation algorithms forcounting matchings of arbitrary cardinality in(i)~d-dimensional ``periodic'' lattices (i.e.,with wrap-around edges) in any fixed dimension~d;and (ii)~two-dimensional lattices with ``fixedboundary conditions'' (i.e., no wrap-aroundedges). Our technique generalizes to approximatelycounting matchings in any bipartite graph that is theCayley graph of some finite group. 000808Design Principles of Parallel Operating Systems: ---A PEACE Case Study---Wolfgang Schröder-Preikschattr-93-020199304__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-020.pdfForthcoming massively parallel systems aredistributed memory architectures. They consist ofseveral hundreds to thousands of autonomousprocessing nodes interconnected by a high-speednetwork. A major challenge in operating systemdesign for massively parallel architectures is todesign a structure that reduces system bootstraptime, avoids bottlenecks in serving system calls,promotes fault tolerance, is dynamicallyalterable, and application-oriented. In additionto that, system-wide message passing is demanded tobe of very low latency and very high efficiency. Stateof the art parallel operating systems design mustobey the maxim not to punish an application byunneeded system functions. This requires to design aparallel operating system as a family of programmodules, with parallel applications being anintegral part of that family, and motivates objectorientation to achieve an efficientimplementation.

Keywords: MIMD systems, parallel operating systems, microkernel family, object orientation 000809CNS-1 Architecture Specification: A Connectionist Network SupercomputerKrste Asanovic, James Beck, Tim Callahan, Jerry Feldman, Bertrand Irissou, Brian Kingsbury, Phil Kohn, John Lazzaro, Nelson Morgan, David Stoutamire, John Wawrzynektr-93-021199304__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-021.pdfA Collaboration of the University of California,Berkeley and the International Computer ScienceInstitute. The Connectionist NetworkSupercomputer, or CNS-1, is a multi-year effort toincorporate recent advances in VLSI design andapplication-specific computer architecture forthe realization of a massively parallel machine.Application targets for the CNS-1 includeconnectionist networks in the areas of speechrecognition, language modeling, vision, andhardware simulation for VLSI. This technical reportpresents the background and motivation forhigh-level design decisions, along withdescriptions of several hardware and softwareelements. The document represents a "snapshot" ofthe design, which is expected to be operational in1995.

Keywords: connectionist networks, VLSI, computer architecture, Torrent, Hydrant, application-specific, massively parallel. 000810A Multivalued Evolutionary AlgorithmHans-Michael Voigt, Joachim Born, Ivan Santibanez-Koreftr-93-022199304__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-022.pdfWith this paper we present a MultivaluedEvolutionary Algorithm (MEA) which is inspired byfuzzy set theory. The genetic representation andencoding is done in such a way that no inferences can bedrawn from phenotype to genotype. Thisrepresentation influences the used geneticoperators. The basic operators of the algorithm willbe explained and comparisons for globaloptimization problems with recently publishedresults will be presented. 000811Training Agents to Perform Sequential BehaviorMarco Colombetti, Marco Dorigotr-93-023199309__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-023.pdfThis paper is concerned with training an agent toperform sequential behavior. In previous work wehave been applying reinforcement learningtechniques to control a reactive robot. Obviously, apure reactive system is limited in the kind ofinteractions it can learn. In particular, it can onlylearn what we call pseudo-sequences, that issequences of actions in which the transition signalis generated by the appearance of a sensorialstimulus. We discuss the difference betweenpseudo-sequences and proper sequences, and theimplication that these differences have on trainingprocedures. A result of our research is that, in caseof proper sequences, for learning to be successfulthe agent must have some kind of memory; moreover it isoften necessary to let the trainer and the learnercommunicate. We study therefore the influence ofcommunication on the learning process. First weconsider trainer-to-learner communicationintroducing the concept of reinforcement sensor,which let the learning robot explicitly know whetherthe last reinforcement was a reward or a punishment;we also show how the use of this sensor induces thecreation of a set of error recovery rules. Then weintroduce learner-to-trainer communication,which is used to disambiguate indeterminatetraining situations, that is situations in whichobservation alone of the learner behavior does notprovide the trainer with enough information todecide if the learner is performing a right or a wrongmove. All the design choices we make are discussed andcompared by means of experiments in a simulatedworld.

Keywords: machine learning, adaptive systems, genetic algorithms, learning classifier systems, behavior-based robotics, reinforcement learning. 000812Generalized Vandermonde Determinants over the Chebyshev BasisThorsten Werthertr-93-024199304__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-024.pdfThe recent developments in the area of interpolationand learnability of sparse polynomials over thereals are based on the nonsingularity of thegeneralized Vandermonde matrix. In this paper westudy real polynomials that admit sparserepresentations in the Chebyshev basis. The mainresult of the paper states the analogy of Michell'stheorem for the Chebyshev case, i.e. the determinantof the generalized Vandermonde matrix build over theChebyshev basis can be represented in this basis asthe product of the standard Vandermonde determinantand a polynomial with nonnegative integercoefficients. An immediate consequence of thisresult is the nonsingularity of Vandermondematrices over the Chebyshev basis provided that theindeterminates take distinct values greater 1.

As anapplication, we investigate the relationshipbetween the number of real roots of a polynomial andits sparsity with respect to the Chebyshev basis. Weprove that the number of real zeros of a polynomial,either to the left or to the right of the interval oforthogonality, does not exceed its sparsity withrespect to the Chebyshev basis. The bound on thenumber of real roots is used to prove finiteness of theVapnik- Chervonenkis dimension (and therebyuniform learnability) of the class of polynomials ofbounded sparsity over the Chebyshev basis. 000813Kohonen Feature Maps and Growing Cell Structures --a Performance ComparisonBernd Fritzketr-93-025199305__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-025.pdfA performance comparison of two self-organizingnetworks, the Kohonen Feature Map and the recentlyproposed Growing Cell Struc- tures is made. For thispurpose several performance criteria forself-organizing networks are proposed andmotivated. The models are tested with three exampleproblems of increasing difficulty. The KohonenFeature Map demonstrates slightly superior resultsonly for the simplest problem. For the other moredifficult and also more realistic problems theGrowing Cell Structures exhibit significantlybetter performance by every criterion. Additionaladvantages of the new model are that all parametersare constant over time and that size as well asstructure of the network are determinedautomatically.

Keywords: feature map, incremental learning, Kohonen

* Presented at NIPS 5 in Denver 000814Growing Cell Structures - a self-organizing network for unsupervised andsupervised learningBernd Fritzketr-93-026199305__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-026.pdfWe present a new self-organizing neural networkmodel having two variants. The first variantperforms unsupervised learning and can be used fordata visualization, clustering, and vectorquantization. The main advantage over existing ap-proaches, e.g., the Kohonen feature map, is theability of the model to automatically find a suitablenetwork structure and size. This is achieved througha controlled growth process which also includesoccasional removal of units. The second variant ofthe model is a supervised learning method whichresults from the combination of the abovementionedself-organizing network with the radial basisfunction (RBF) approach. In this model it is possible- in con- trast to earlier approaches - to perform thepositioning of the RBF units and the supervisedtraining of the weights in parallel. Therefore, thecurrent classification error can be used todetermine where to insert new RBF units. This leads tosmall networks which generalize very well. Resultson the two- spirals benchmark and a vowelclassification problem are present- ed which arebetter than any results previously published.

Keywords: Self-organization, incremental learning, radial basis function, clustering, data visualization, pattern classification, two spiral problem, feature map 000815A Stochastic Model of Actions and Plans for Anytime Planning underUncertaintySylvie Thiebaux, Joachim Hertzberg, William Shoaff, Moti Schneidertr-93-027199305__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-027.pdfBuilding planning systems that operate in realdomains requires coping with both uncertainty andtime pressure. This paper describes a model ofreaction plans, which are generated using aformalization of actions and of state descriptionsin probabilistic logic, as a basis for anytimeplanning under uncertainty.

The model has thefollowing main features. At the action level, wehandle incomplete and ambiguous domaininformation, and reason about alternative actioneffects whose probabilities are given. On thisbasis, we generate reaction plans that specifydifferent courses of action, reflecting the domainuncertainty and alternative action effects; ifgeneration time was insufficient, these plans may beleft unfinished, but they can be reused,incrementally improved, and finished later. At theplanning level, we develop a framework for measuringthe quality of plans that takes domain uncertaintyand probabilistic information into account usingMarkov chain theory; based on this framework, one candesign anytime algorithms focusing on those parts ofan unfinished plan first, whose completion promisesthe most ``gain''. Finally, the plan quality can beupdated during execution, according to additionalinformation acquired, and can therefore be used foron-line planning. 000816pSather: Layered Extensions to an Object-Oriented Language for EfficientParallel ComputationStephan Murer, Jerome A. Feldman, Chu-Cheow Lim, Martina-Maria Seideltr-93-028199306__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-028.pdfpSather is a parallel extension of the existing object-oriented languageSather. It offers a shared-memory programming model which integrates bothcontrol- and data-parallel extensions. This integration increases theflexibility of the language to express different algorithms and datastructures, especially on distributed-memory machines (e.g.\ CM-5). Thisreport describes our design objectives and the programming language pSatherin detail. 000817Labeling RAAMAlessandro Sperdutitr-93-029199305__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-029.pdfIn this report we propose an extension of theRecursive Auto-Associative Memory (RAAM) byPollack. This extension, the Labeling RAAM (LRAAM),is able to encode labeled graphs with cycles byrepresenting pointers explicitly. A theoreticalanalysis of the constraints imposed on the weights bythe learning task under the hypothesis of perfectlearning and linear output units is presented.Cycles and confluent pointers result to beparticularly effective in imposing constraints onthe weights. Some technical problems encountered inthe RAAM, such as the termination problem in thelearning and decoding processes, are solved morenaturally in the LRAAM framework. Therepresentations developed for the pointers seem tobe robust to recurrent decoding along a cycle. Dataencoded in a LRAAM can be accessed by pointer as well asby content. The direct access by content can beachieved by transforming the encoder network of theLRAAM in a Bidirectional Associative Memory (BAM).Different access procedures can be definedaccording to the access key. The access proceduresare not wholly reliable, however they seem to have ahigh likelihood of success. A geometricinterpretation of the decoding process is given andthe representations developed in the pointer spaceof a two hidden units LRAAM are presented anddiscussed. In particular, the pointer space resultsto be partitioned in a fractal-like fashion. Someeffects on the representations induced by theHopfield-like dynamics of the pointer decodingprocess are discussed and an encoding scheme able toretain the richness of representation devised by thedecoding function is outlined. The application ofthe LRAAM model to the control of the dynamics ofrecurrent high-order networks is briefly sketchedas well. 000818Sensitivity of Boolean Functions, Harmonic Analysis, and Circuit ComplexityAnna Bernasconi, Bruno Codenottitr-93-030199306__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-030.pdfWe exploit the notion of sensitivity of Booleanfunctions to find complexity results. We firstanalyze the distribution of the average sensitivityover the set of all the Boolean functions, and showsome applications of this analysis. We then useharmonic analysis on the cube to study how the averagesensitivity of a Boolean function propagates if thefunction corresponds, e.g., to an oracle availableto compute another function. We use this relation toprove that symmetric functions in $AC^0$ haveexponentially decreasing average sensitivity. 000819On Some Stability Properties of the LRAAM ModelAlessandro Sperdutitr-93-031199306__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-031.pdfIn this report we discuss some mathematicalproperties of the LRAAM model. The LRAAM model is anextension of the RAAM model by Pollack. It allows oneto obtain distributed reduced representations oflabeled graphs. In particular, we give sufficientconditions on the asymptotical stability of thedecoding process along a cycle of the encodedstructure. Data encoded in an LRAAM can also beaccessed by content by transforming the LRAAM in ananalog Hopfield network with hidden units andasymmetric connection matrix (CA network.)Different access procedures can be definedaccording to the access key. Each access procedurecorresponds to a particular constrained version ofthe CA network. We give sufficient conditions underwhich the property of asymptotical stability of afixed point in one particular constrained version ofthe CA network can be extended to related fixed pointsof different constrained versions of the CA network.An example of encoding of a labeled graph on which thetheoretical results are applied is given as well. 000820Repetitive Hidden-Surface-Removal for PolyhedraMarco Pellegrinitr-93-032199307__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-032.pdfThe repetitive hidden-surface-removal problem canbe rephrased as the problem of finding the mostcompact representation of all views of a polyhedralscene that allows efficient on-line retrieval of asingle view. In this paper we present a novel approachto this problem. We assume that a polyhedral scene in3-space is given in advance and is preprocessedoff-line into a data structure. Afterwards, the datastructure is accessed repeatedly with view-pointsgiven on-line and the portions of the polyhedravisible from each view-point are produced on-line.This mode of operation is close to that of realinteractive display systems. The main difficulty isto preprocess the scene without knowing the queryview-points.

Let $n$ be the number total of edges,vertices and faces of the polyhedral objects and let$k$ be the number of vertices and edges of the image.The main result of this paper is that, using anoff-line data structure of size $m$ with$n^{1+\epsilon} \leq m \leq n^{2+\epsilon}$, it ispossible to answer on-line hidden-surface-removalqueries in time $O(k\log n + \min\{n\log n,kn^{1+\epsilon}/m^{1/2}\})$, when the scene iscomposed of $c$-oriented polyhedra. This datastructure accommodates dynamic insertion anddeletion of polyhedral objects. The polyhedra mayintersect and may have cycles in the dominancerelation. We also improve worst-case time/storagebounds for the repetitive hidden surface removalproblem when the polyhedral scene is composed ofunrestricted polyhedra.

Preliminary version ofthis work is in the Proceedings of the 1993 Workshop onAlgorithms and Data Structures. 000821Turning an Action Formalism Into a Planner---A Case StudyJoachim Hertzberg, Sylvie Thiebauxtr-93-033199307__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-033.pdfThe paper describes a case study that explores theidea of building a planner with a neat semantics of theplans it produces, by choosing some action formalismthat is ``ideal'' for the planning application andbuilding the planner accordingly. In general---andparticularly so for the action formalism used in thisstudy, which is quite expressive---this strategy isunlikely to yield fast and efficient planners if theformalism is used naively. Therefore, we adopt theidea that the planner approximates thetheoretically ideal plans, where the approximationgets closer, the more run time the planner is allowed.As the particular formalism underlying our studyallows a significant degree of uncertainty to bemodeled and copes with the ramification problem, weend up in a planner that is functionally comparable tomodern anytime uncertainty planners, yet is based ona neat formal semantics. 000822On Lines Missing Polyhedral Sets in 3-SpaceMarco Pellegrinitr-93-034199307__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-034.pdfWe show some combinatorial and algorithmic resultsconcerning sets of lines and polyhedral objects in3-space. Our main results include:

(1) An$O(n^32^{c\sqrt{\log n}})$ upper bound on the worstcase complexity of the set of lines missing astar-shaped compact polyhedron with $n$ edges,where $c$ is a suitable constant.

(2) An $O(n^32^{c\sqrt{\log n}})$ upper bound on the worst casecomplexity of the set of lines that can be moved toinfinity without intersecting a set of $n$ givenlines, where $c$ is a suitable constant. This bound isalmost tight.

(3) An $O(n^{1.5+\eps})$ randomizedexpected time algorithm that tests whether adirection $v$ exists along which a set of $n$ red linescan be translated away from a set of $n$ blue lineswithout collisions.

(4) Computing the intersectionof two polyhedral terrains in 3-space with $n$ totaledges in time $O(n^{4/3+\eps} + k^{1/3}n^{1+\eps} +k\log^2 n)$, where $k$ is the size of the output, and$\epsilon >0$ an arbitrary small but fixed constant.This algorithm improves on the best previous resultof Chazelle at al.

The tools used to obtain theseresults include Plucker coordinates of lines,random sampling and polarity transformations in3-space.

A preliminary version of this work appearedin the Proceedings of the 9th ACM Symposium onComputational Geometry. 000823Perturbation: An Efficient Technique for the Solution of Very LargeInstances of the Euclidean TSPB. Codenotti, G. Manzini, L. Margara, G. Restatr-93-035199307__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-035.pdfIn this paper we introduce a technique for buildingefficient iterated local search procedures. Thistechnique, that we call perturbation, uses globalinformation on TSP instances to speed-up and improvethe quality of the tours found by heuristic methods.The experimental results done on up to 100,000cities, show that our techniques outperform theknown methods for iterating local search for verylarge instances.

Keywords: TSP, sensitivity, perturbation, heuristics, experimental evaluation. 000824Sparse Interpolation from Multiple DerivativesThorsten Werthertr-93-036199307__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-036.pdfIn this note, we consider the problem ofinterpolating a sparse function from the values ofits multiple derivatives at some given point. We giveefficient algorithms for reconstructing sparseFourier series and sparse polynomials overSturm-Liouville bases. In both cases, the number ofevaluations is linear in the sparsity. 000825An Algorithm to Learn Read-Once Threshold Formulas, and some genericTransformations between Learning Models (Revised Version)Nader H. Bshouty, Thomas R. Hancock, Lisa Hellerstein, Marek Karpinskitr-93-037199307__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-037.pdfWe present a membership query (i.e. black boxinterpolation) algorithm for exactly identifyingthe class of read-once formulas over the basis ofboolean threshold functions. We also present acatalogue of generic transformations that can beused to convert an algorithm in one learning modelinto an algorithm in a different model. 000826Exploitation of Structured Gating Connections for the Normalization of aVisual PatternAlessandro Sperdutitr-93-038199307__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-038.pdfStructured gating connections can be useful toreduce the complexity of networks with a high numberof inputs. An example of their application to thenormalization of a visual pattern with respect toscale and position is presented. The use of gatingconnections allows us to have a linear number ofconnections in the number of pixels. The connectionsare also very localized. 000827Building convex space partitions induced by pairwise interior-disjointsimplicesMarco Pellegrinitr-93-039199308__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-039.pdfGiven a set $S$ of $n$ pairwise interior-disjoint$(d-1)$-simplices in $d$-space, for $d \geq 3$, aConvex Space Partition induced by $S$ (denoted$CSP(S)$) is a partition of $d$-space into convexcells such that the interior of each cell does notintersect the interior of any simplex in $S$. In thispaper it is shown that a $CSP(S)$ of size $O(n^{d-1})$can be computed deterministically in time$O(n^{d-1})$. These bounds are worst case optimalfor $d=3$. The results are proved using a variation ofthe efficient hierarchical cuttings of Chazelle. 000828Efficient PRAM Simulation on a Distributed Memory MachineR. Karp, M. Luby, F. Meyer auf der Heidetr-93-040199308__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-040.pdfWe present algorithms for the randomized simulationof a shared memory machine (PRAM) on a DistributedMemory Machine (DMM). In a PRAM, memory conflictsoccur only through concurrent access to the samecell, whereas the memory of a DMM is divided intomodules, one for each processor, and concurrentaccesses to the same module create a conflict. Thedelay of a simulation is the time needed to simulate aparallel memory access of the PRAM. Any generalsimulation of an m processor PRAM on a n processor DMMwill necessarily have delay at least m/n. Arandomized simulation is called time-processoroptimal if the delay is O(m/n) with high probability.Using a novel simulation scheme based on hashing weobtain a time-processor optimal simulation withdelay O(\loglog(n)\logstn). The best previoussimulations use a simpler scheme based on hashing andhave much larger delay. 000829Optimal Parallelization of Las Vegas AlgorithmsMichael Luby, Wolfgang Erteltr-93-041199309__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-041.pdfLet $A$ be a Las Vegas algorithm, i.e., $A$ is arandomized algorithm that always produces thecorrect answer when it stops but whose running time isa random variable. In\lit{LSZ93} a method wasdeveloped for minimizing the expected time requiredto obtain an answer from~$A$ using sequentialstrategies which simulate $A$ as follows: run $A$ fora fixed amount of time $t_1$, then run $A$independently for a fixed amount of time $t_2$, etc.The simulation stops if $A$ completes its executionduring any of the runs.

In this paper, we considerparallel simulation strategies for this sameproblem, i.e., strategies where many sequentialstrategies are executed independently in parallelusing a large number of processors. We present a closeto optimal parallel strategy for the case when thedistribution of $A$ is known. If the number ofprocessors is below a certain threshold, we show thatthis parallel strategy achieves almost linearspeedup over the optimal sequential strategy. Forthe more realistic case where the distribution of $A$is not known, we describe a universal parallelstrategy whose expected running time is only alogarithmic factor worse than that of an optimalparallel strategy. Finally, the application of thedescribed parallel strategies to a randomizedautomated theorem prover confirms the theoreticalresults and shows that in most cases good speedup canbe achieved up to hundreds of processors, even onnetworks of workstations. 000830Lower Bounds on Complexity of Testing Membership to a Polygon for Algebraicand Randomized Decision TreesDima Grigoriev, Marek Karpinskitr-93-042199308__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-042.pdfWe describe a new method for proving lower bounds foralgebraic decision trees. We prove, for the firsttime, that the minimum depth for arbitrary decisiontrees for the problem of testing the membership to apolygon with N nodes is Omega(log N). Moreover, weprove that the corresponding lower bound for therandomized decision trees matches the above bound.Finally, we prove that for the algebraic exp-logdecision trees (cf. [GSY 93]), the minimum depth isOmega(sqrt(log N)). We generalize the last result tothe multidimensional case, showing that if anexp-log decision tree tests a membership to asemialgebraic set with a sum of Betti numbers M, thenthe depth of a tree is at least Omega(sqrt(log M)). 000831Finite Branching Processes and AND/OR Tree EvaluationRichard Karptr-93-043199312__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-043.pdfThis paper studies tail bounds on supercriticalbranching processes with finite distributions ofoffspring. Given a finite supercritical branchingprocess Z_n\_0^{infinity}, we derive upper bounds,decaying exponentially fast as c increases, on theright-tail probability \Pr[Z_n > c E(Z_n)]. Weobtain a similar upper bound on the left-tailprobability \Pr[Z_n < \frac{E(Z_n)}c] under theassumption that each individual generates at leasttwo offspring. As an application, we observe that theevaluation of an AND/OR tree by a canonical algorithmin certain probabilistic models can be viewed as atwo-type supercritical finite branching process,and show that the execution time of this algorithm islikely to concentrate around its expectation. 000832An application of a neural net for fuzzy abductive reasoningMatthias Kaisertr-93-044199308__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-044.pdfThis is a description of a simple system that is able ofperforming abductive reasoning over fuzzy datausing a back-propagation neural net for thehypothesis generation process.

I will first outlineand exemplify the notion of abduction as a process ofbuilding hypotheses on the basis of a given set ofdata, evaluating them to find the best hypothesis andgive explanation for the made selection. I extendthis notion to account for abductive reasoning overfuzzy data. As an example I describe theclassification of objects according to fuzzysensory features into previously learnedcategories that were represented by a set of objectsdescribed by feature-value-pairs from whichprototypes are detected which form the center of acategory.

In the following a brief description of theback-propagation algorithm and a design of ademonstration system that is capable of carrying outabductive reasoning in a small example domain isgiven. The system is able to learn to classify kinds offruit given certain feature-value-pairs and todetect the most prototypicalfeature-value-pair-clusters within a category.The trained neural net is used for the hypothesisgeneration process. It also provides very criticalinformation for the evaluation and explanation ofhypotheses. I then discuss the implementation of anevaluation and explanation component using thespecific capabilities of the neural net. 000833Sather Iters: Object-Oriented Iteration AbstractionStephan Murer, Steve Omohundro, Clemens Szyperskitr-93-045199308__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-045.pdfSather iters are a powerful new way to encapsulateiteration. We argue that such iterationabstractions belong in a class' interface on an equalfooting with its routines. Sather iters were derivedfrom CLU iterators but are much more flexible andbetter suited for object-oriented programming. Wemotivate and describe the construct along withseveral simple examples. We compare it withiteration based on CLU iterators, cursors, riders,streams, series, generators, coroutines, blocks,closures, and lambda expressions. Finally, wedescribe how to implement them in terms of coroutinesand then show how to transform this implementationinto efficient code. 000834A Performance Analysis of the CNS-1 on Large, Dense Backpropagation NetworksSilvia M. Müllertr-93-046199309__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-046.pdfWe determine in this study the sustained performanceof the CNS-1 during training and evaluation of largemultilayered feedforward neural networks. Using asophisticated coding, the 128-node machine wouldachieve up to 111 GCPS and 22 GCUPS. During recall themachine would archieve 87% of the peakmultiply-accumulate performance. The training oflarge nets is less efficient than the recall but onlyby a factor of 1.5 to 2.

The benchmark is parallelizedand the machine code is optimized before analyzingthe performance. Starting from an optimal parallelalgorithm, CNS specific optimizations still reducethe run time by a factor of 4 for recall and by a factor of3 for training. Our analysis also yields somestrategies for code optimization.

The CNS-1 is stillin design, and therefore we have to model the run timebehavior of the memory system and theinterconnection network. This gives us the option ofchanging some parameters of the CNS-1 system in orderto analyze their performance impact.

Keywords: CNS, performance analysis, run time model, backpropagation, parallelization. 000835Source-to-Source Code Generation Based on Pattern Matching and DynamicProgrammingWeimin Chen, Volker Turautr-93-047199308__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-047.pdfThis paper introduces a new technique forsource-to-source code generation based on patternmatching and dynamic programming. This techniquecan be applied to all source and target-languageswhich satisfy some requirements. The maindifferences to conventional approaches are thecomplexity of the target language, the handling ofside effects caused by function calls and theintroduction of temporaries. Code optimization isachieved by introducing a new cost- model. Thetechnique allows an incremental development basedon improvements of the target library. These requireonly a modification of the rewriting rules sincethose are separated from the pattern matchingalgorithm. Experience of an successful applicationof our technique is given. 000836The Sublogarithmic Space WorldMaciej Liskiewicz, Ruediger Reischuktr-93-048199308__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-048.pdf(Pages: 42) This paper tries to fully characterizethe properties and relationships of space classesdefined by Turing machines that use less thanlogarithmic space -- may they be deterministic,nondeterministic or alternating (DTM, NTM or ATM).We provide several examples of specific languagesand show that such machines are unable to accept theselanguages. The basic proof method is a nontrivialextension of the (1 superscript n) right arrow (1superscript (n+n!)) technique to alternating TMs 000837Precise Average Case Complexity MeasuresRuediger Reischuktr-93-049199308__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-049.pdf(Pages: 36) A new definition is given for the averagegrowth of a function f : (Sigma superscript *) rightarrow (IN) with respect to a probability measure mu on(Sigma superscript *). This allows us to definemeaningful average case distributional complexityclasses for arbitrary time bounds (previously, onecould not guarantee arbitrary good precision). It isshown that basically only the ranking of the inputs bydecreasing probabilities are of importance.

Tocompare the average and worst case complexity ofproblems we study average case complexity classesdefined by a time bound and a bound on the complexity ofpossible distributions. Here, the complexity ismeasured by the time to compute the rank functions ofthe distributions. We obtain tight and optimalseparation results between these average caseclasses. Also the worst case classes can be embeddedinto this hierarchy. They are shown to be identical toaverage case classes with respect to distributionsof exponential complexity. 000838Interior point methods in semidefinite progrmming with applications tocombinatorial optimizationFarid Alizadehtr-93-050199309__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-050.pdfWe study the semidefinite programming problem(SDP), i.e the optimization problem of a linearfunction of a symmetric matrix subject to linearequality constraints and the additional conditionthat the matrix be positive semidefinite. First wereview the classical cone duality as specialized toSDP. Next we present aninterior point algorithmwhich converges to the optimal solution inpolynomial time. The approach is a direct extensionof Ye's projective method for linear programming. Wealso argue that most known interior point methods forlinear programs can be transformed in a mechanicalway to algorithms for SDP with proofs of convergenceand polynomial time complexity also carrying over ina similar fashion. Finally we study the significanceof these results in a variety of combinatorialoptimization problems including the general 0-1integer programs, the maximum clique and maximumstable set problems in perfect graphs, the maximum$k$-partite subgraph problem in graphs, and variousgraph partitioning and cut problems. As a result, wepresent barrier oracles for certain combinatorialoptimization problems (in particular, clique andstable set problem for perfect graphs) whose linearprogramming formulation requires exponentiallymany inequalities. Existence of such barrieroracles refutes the commonly believed notion that inorder to solve a combinatorial optimization problemwith interior point methods, one needs its linearprogramming formulation explicitly. 000839Dynamic maintenance of approximate solutions of Min-Weighted Node Cover andMin-Weighted Set Cover problemsGiorgio Gambosi, Marco Protasi, Maurizio Talamotr-93-051199309__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-051.pdfIn this paper, we introduce new algorithms for thedynamic maintenance of approximated solutions ofMin-Weighted Node Cover and Min-Weighted Set Cover.For what concerns Min-Weighted Node Cover, for anysequence of edge insertions and deletions, thealgorithms maintain a solution whose approximationratio (that is, the ratio between the approximate andthe optimum value) is equal to the best asymptotic onefor the static case. The algorithms require O(1) timefor edge insertion, while an O(1) amortized time isrequired for edge deletion.

For what regardsMin-Weighted Set Cover, we present dynamicalgorithms whose approximation ratio matches one ofthe two different and incomparable best approx-imate bounds for the static case. The time complexityfor element insertion and its amortized complexityfor element deletion are proportional to the maximumredundancy of an element in the approximatesolution. 000840On a Criterion for Minimum Uncertainty SensingVincenzo Cagliotitr-93-052199310__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-052.pdfThis is an invited article for theStructural Complexity Column, edited by JurisHartmanis, which will appear in the Bulletin EATCS inOctober 1993. The scope of the article is indicated inthe following list of Sections: 1. Overview ofInformation-Based Complexity 2. BreakingIntractability 3. Verification 4. CombinatorialComplexity 5. Similarities and Differences withDiscrete Complexity 6. Brief History 7. Appendix 8.References 000841On a Criterion for Minimum Uncertainty SensingVincenzo Cagliotitr-93-053199309__000006A criterion is presented for the automatic selectionof a sensor detection aimed at observing the state of asystem, which is described both by discrete variableand by continuous ones. The criterion is based on theexpected value of the entropy variation relative tothe transition associated to the sensorobservation. This criterion is then applied toobject recognition and localization tasks, in whichthe observed system is described by object class(i.e., a discrete variable) and by the objectposition (i.e. a vector of continuous parameters).The proposed criterion allows to account for theinformation obtained in the case the observed objectis missed by the measurement. Finally, a simpleexample is discussed concerning an observed systemconstituted by an object. The state of the observedsystem is described in terms of the object identityand the object position and orientation parameters.The sensors used to observe the system are anorientable range finder and a mobile camera. 000842An Investigation into Fault Recovery in Guaranteed Performance ServiceConnectionsColin J. Parris, Anindo Banerjeatr-93-054199310__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-054.pdfAs high speed networks are starting to provideguaranteed performance services, it is imperativethat fault recovery techniques be revised to supportthis new service. In this paper we investigate oneaspect of fault recovery in this context, thererouting of guaranteed performance connectionsaffected by link faults in the network. Recovery isachieved by rerouting the affected connection so asto avoid the failed link while ensuring that thetraffic and performance guarantees made along theprevious route are satisfied along the new route. Thegoal of the rerouting schemes is to reroute as much ofthe affected traffic as quickly and efficiently aspossible. We investigate rerouting along the linesof two orthogonal components: the locus of reroute,which determines the node that does route selectionand the new route selected; and the timing component,which determines when the individual rerouteattempts are initiated. Within each of these twocomponents we examine approaches that span thespectrum of that component. We compare all possiblecombinations of these approaches under across-section of network workloads, using in ourcomparisons a novel metric, the Queuing Delay LoadIndex, that captures both the bandwidth and delayresources required by a connection. Extensivesimulation experiments were conducted on thevarious combinations and their results and analysisare presented in the paper. 000843Testable Algorithms for Self-Avoiding WalksDana Randall, Alistair Sinclairtr-93-055199309__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-055.pdfWe present a polynomial time Monte Carlo algorithmfor almost uniformly generating and approximatelycounting self-avoiding walks in rectangularlattices. These are classical problems that arise,for example, in the study of long polymer chains.While there are a number of Monte Carlo algorithmsused to solve these problems in practice, these areheuristic and their correctness relies on unprovenconjectures. In contrast, our algorithm relies on asingle, widely-believed conjecture that is simplerthan preceding assumptions, and, more importantly,is one which the algorithm itself can test. Thus ouralgorithm is reliable, in the sense that it eitheroutputs answers that are guaranteed, with highprobability, to be correct, or finds acounterexample to the conjecture. 000844Dynamic Join and Leave for Real-Time MulticastWolfgang Effelsberg, Eberhard Müller-Menradtr-93-056199310__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-056.pdfMany new applications in networks require supportfor multicast communication. In addition,continuous data streams such as audio and videorequire real-time performance guarantees as aquality of service. We introduce a model forreal-time mulitcast channels and present a set ofscalable algorithms for the dynamic joining andleaving of destination nodes in this environment. Inparticular we present an algorithm for finding a goodattachment point to the multicast tree. We alsodescribe detailed admission tests that preserve theguarantees given to existing channels. Ouralgorithm for a leaving node specifies in particularthe resources to be released in the network. We alsodiscuss tree reorganization issues.

Keywords: multicast, dynamic, membership, multicast tree 000845Second Order Backpropagation - Efficient Computation of the Hessian Matrixfor Neural NetworksRaul Rojastr-93-057199309__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-057.pdfTraditional learning methods for neural networksuse some kind of gradient descent in order todetermine the network's weights for a given task.Some second order learning algorithms deal with aquadratic approximation of the error functiondetermined from the calculation of the Hessianmatrix, and achieve improved convergence rates inmany cases. We introduce in this paper second orderbackpropagation, a method to calculate efficientlythe Hessian of a linear network of one-dimensionalfunctions. This technique can be used to get explicitsymbolic expressions or numerical approximationsof the Hessian and could be used in parallel computersto improve second order learning algorithms forneural networks. It can be of interest also forcomputer algebra systems.
[A newer version of this method is found in the book: Raul Rojas,Neural Networks, 1996, Springer-Verlag, available in English andGerman.] 000846Towards a cognitively based approach of a a description of spatial deixisMatthias Kaisertr-93-058199311__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-058.pdfIn this presentation an approach towards adescription of spatial deixis based on theperceptual and cognitive abilities of humans isoutlined. After a basic introduction into spaceperception and representation the findings of thispart are taken to form the basis for acharacterization of the phenomenon of deixis as wellas the conceptual components of deictic expressionsin a natural language. For the analysis of deicticexpressions a cross-linguistic view is applied tofind on the one hand universal components of thoseexpressions but also a number of potentiallyinfluencing factors. The goal is to find featuresthat may be components of deictic expressions andthus must be considered in a general model of spatialdeixis which can serve to classify and describe themeaning of spatial deictic expressions in anynatural language. 000847Constructive Deterministic PRAM Simulation on a Mesh-Connected ComputerAndrea Pietracaprina, Geppino Pucci, Jop F. Sibeyntr-93-059199310__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-059.pdfThe PRAM model of computation consists of acollection of sequential RAM machines accessing ashared memory in lock-step fashion. The PRAM is a veryhigh-level abstraction of a parallel computer, andits direct realization in hardware is beyond reach ofthe current (or even foreseeable) technology. Inthis paper we present a deterministic simulationscheme to emulate PRAM computation on amesh-connected computer, a feasible machine whereeach processor has its own memory module and isconnected to at most four other processors viapoint-to-point links. In order to achieve a goodworst-case performance, any deterministicsimulation scheme has to replicate each variable in anumber of copies. Such copies are stored in the localmemory modules according to a Memory OrganizationScheme (MOS), which is known to all the processors. Avariable is then accessed by routing packets to itscopies. All deterministic schemes in the literaturemake use of a MOS whose existence is proved via theprobabilistic method, but that cannot beefficiently constructed. We introduce a newconstructive MOS, and show how to employ it tosimulate an $n$-processor PRAM on an $n$-nodemesh-connected computer. Our simulation achievesalmost optimal slowdown for small memories. This isthe first constructive deterministic PRAMsimulation on a bounded-degree network 000848Improved Band Matrix ComputationsVictor Pantr-93-060199309__000006We solve a band linear system of equations and computethe determinant of a band matrix in NC over the complexfield and its subfields and in RNC over any field. Ouralgorithms support the optimum bound on thepotential work (the product of time and processorbounds); moreover, the algorithms are in NCsuperscript 1 or RNC superscript 1 if the bandwidth isa constant. These results substantially improve theprevious records of [E]. 000849Optimum Parallel Computations with Band MatricesVictor Pantr-93-061199309__000006We devise optimum parallel algorithms for solving aband linear system of equations and for computing thedeterminant of a band matrix, substantiallyimproving the previous record computationalcomplexity estimates of [E]. All our algorithms arein NC or RNC and processor efficient; almost all ofthem reach the optimum bound on the potential work(the product of time and processor bounds).Moreover, these algorithms are in NC superscript 1 orRNC superscript 1 if the bandwidth is a constant. 000850A Formalization of ViewpointsGiuseppe Attardi, Maria Simitr-93-062199310__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-062.pdfWe present a formalisation for the notion of"viewpoint", a construct meant for expressingseveral varieties of relativised truth. Theformalisation consists in a logic which extendsfirst order predicate calculus through anaxiomatization of provability and with the additionof proper reflection rules. The extension is notconservative, but consistency is granted.Viewpoints are defined as set of reified meta-levelsentences. A proof theory for viewponts is developedwhich enables to carry out proofs of sentencesinvolving several viewpoints. A semantic account ofviewpoints is provided, dealing with issues of selfreferential theories and paradoxes, and exploitingthe notion of "contextual entailment". Notions suchas beliefs, knowledge, truth and situations can beuniformly modeled as provability in specialisedviewpoints, obtained by imposing suitableconstraints on viewpoints.

Keywords: meta-level, logics for truth, belief and knowledge, situations, contexts. 000851A Parallel Object-Oriented System for Realizing Reusable and Efficient DataAbstractionsChu-Cheow Limtr-93-063199310__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-063.pdf(319 pages) We examine the use of an object-orientedlanguage to make programming multiprocessorseasier for the general programmer. We choose anobject-oriented paradigm because we believe thatits support for encapsulation and software reuseallows users who are writing general applicationprograms to reuse class libraries designed by expertlibrary writers.

We describe the design,implementation and use of a parallelobject-oriented language: parallel Sather(pSather). PSather has a shared address spaceindependent of the underlying multiprocessorarchitecture, because we believe that thecooperative nature of parallel programs is mosteasily captured by a shared-memory-like model. Toaccount for distributed-memory machines, pSatheruses an abstract model in which processors aregrouped in clusters. Associated with a cluster is apart of the address space with fast access; access toother parts of the address space is $\leq 2$ orders ofmagnitude slower. PSather integrates both controland data-parallel constructs to support a variety ofalgorithmic styles. We have an implementation ofpSather on the CM-5. The prototype shows that even ondistributed-memory machines withouthardware/operating system support for a sharedaddress space, it is still practical and reasonablyefficient for the shared address abstraction to beimplemented in the compiler/runtime. Theexperience also helps us understand the features oflow-level libraries that are necessary for anefficient realization of a high-level language. Forexample, even though low message latency is crucial,the message-passing paradigm (active vs. passive,polling vs. interrupt-driven) is also important indeciding how easy and efficient the languageimplementation will be. We also study certainstraight-forward compiler optimizations. Severalabstractions and applications have been written forthe CM-5 using the shared-address cluster model, andwe have achieved reasonable speedups. In some cases,we can further demonstrate good absoluteperformance for pSather programs (by getting theirspeedups relative to a 1-processor C program). Someof the abstractions are reused in severalapplications, to show how the object-orientedconstructs facilitate code reuse. The workdescribed here supports our optimism that pSather isa practical and efficient parallel object-orientedlanguage. There are, however, still many issues thatneed to be explored in order to provide parallelprogramming environments as powerful as the ones weare accustomed to on sequential environments. In theconclusion, we summarize some of the possible futureresearch directions. 000852Engineering a Programming Language: The Type and Class System of SatherClemens Szypersky, Stephen Omohundro, Stephan Murertr-93-064199311__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-064.pdfSather 1.0 is a programming language whose design hasresulted from the interplay of many criteria. Itattempts to support a powerful object-orientedparadigm without sacrificing either thecomputational performance of traditionalprocedural languages or support for safety andcorrectness checking. Much of the engineeringeffort went into the design of the class and typesystem. This paper describes some of these designdecisions and relates them to approaches taken inother languages. We particularly focus on issuessurrounding inheritance and subtyping and thedecision to explicitly separate them in Sather. 000853An Efficient Probabilistic Context-Free Parsing Algorithm that ComputesPrefix ProbabilitiesAndreas Stolcketr-93-065199311__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-065.pdfWe describe an extension of Earley's parser forstochastic context-free grammars that computes thefollowing quantities given a stochasticcontext-free grammar and an input string: a)probabilities of successive prefixes beinggenerated by the grammar; b) probabilities ofsubstrings being generated by the nonterminals,including the entire string being generated by thegrammar; c) most likely (Viterbi) parse of thestring; d) posterior expected number ofapplications of each grammar production, asrequired for reestimating rule probabilities. (a)and (b) are computed incrementally in a singleleft-to-right pass over the input. Our algorithmcompares favorably to standard bottom-up parsingmethods for SCFGs in that it works efficiently onsparse grammars by making use of Earley's top-downcontrol structure. It can process any context-freerule format without conversion to some normal form,and combines computations for (a) through (d) in asingle algorithm. Finally, the algorithm has simpleextensions for processing partially bracketedinputs, and for finding partial parses and theirlikelihoods on ungrammatical inputs. 000854Recovering Guaranteed Performance Service Connections from Single andMultiple FaultsAnindo Banerjea, Colin Parris, Domenico Ferraritr-93-066199311__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-066.pdfFault recovery techniques must be reexamined in thelight of the new guaranteed performance servicesthat networks will support. We investigate thererouting of guaranteed performance serviceconnections on the occurrence of link faults,focussing on the aspects of route selection andestablishment in the network. In a previousinvestigation, we explored some components ofrerouting in the presence of single link faults in thenetwork. In this paper we study the behavior of ourtechniques in the presence of multiple link faults inthe network, and also examine the technique ofretries to improve the success of rerouting. Ourschemes are simulated on a cross-section of networkworkloads, and compared using the criteria of thefraction of the affected traffic that could bererouted, the time to reroute and the amount ofresources consumed in the network. A novel metric,the Queueing Delay Load Index, which captures boththe bandwidth and delay demands made on the network bya connection, is used to present and analyze theresults. 000855A Software Reuse System for C CodesLe Van Huutr-93-067199312__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-067.pdfThis paper presents PRASSY, a hypertext system forthe storage and retrieval of procedure source codes,on the basis of the semantics of their comments. Theobjective of the system is to provide the programdeveloper with the possibility of retrieving andreusing the source code of C subroutines that havebeen previously built by his colleagues or that arealready present in the system. The approach adoptedby PRASSY is the analysis of the source code commentsand of the specification documents written innatural language, in order to extract indexinginformation. Such information is organized in ahypertext structure and the browsing mechanism isused by the user to select reusable softwarecomponents. The system provides a way for measuringthe semantic similarity between the userrequirements and the candidate node to be selected.The paper describes the system's architecture andfunctionalities. Some examples of the userinterface and the browsing mechanisms are reported.Finally, it describes the algorithm proposed byAragon-Ramirez and Paice and adopted by PRASSY fordefining the semantic similarity among phrasesexpressed in natural language.

Keywords: hypertext, software reuse, semantic phrases similarity 000856Lexical Modeling in a Speaker Independent Speech Understanding SystemCharles Clayton Wooterstr-93-068199311__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-068.pdfThis thesis presents an algorithm for theconstruction of models that attempt to capture thevariation that occurs in the pronunciations of wordsin spontaneous (i.e., non-read) speech. A techniquefor developing alternate pronunciations of wordsand then estimating the probabilities of thealternate pronunciations is presented.Additionally, we describe the development andimplementation of a spoken-language understandingsystem called the Berkeley Restaurant Project(BeRP). Multiple pronunciation word modelsconstructed using the algorithm proposed in thisthesis are evaluated within the context of the BeRPsystem. The results of this evaluation show that theexplicit modeling of variation in the pronunciationof words improves the performance of both the speechrecognition and the speech understandingcomponents of the BeRP system. 000857On the Definition of SpeedupWolfgang Erteltr-93-069199311__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-069.pdfWe propose an alternative definition for the speedupof parallel algorithms. Let A be a sequentialalgorithm and B a parallel algorithm for solving thesame problem. If A and/or B are randomized or if we areinterested in their performance on a probabilitydistribution of problem instances, the runningtimes are described by random variables T_A and T_B.The speedup is usually defined as E[T_A]/E[T_B]where E is the arithmetic mean. This notion of speedupdelivers just a number, i.e. much information aboutthe distribution is lost. For example, there is novariance of the speedup. To define a measure forpossible fluctuations of the speedup, a new notion ofspeedup is required. The basic idea is to definespeedup as M(T_A/ T_B) where the functional form of Mhas to be determined. Also, we argue that in many casesM(T_A/T_B) is more informative than E[T_A]/E[T_B]for a typical user of A and B. We present a set ofintuitive axioms that any speedup functionM(T_A/T_B) must fulfill and prove that the geometricmean is the only solution. As a result, we now have auniquely defined speedup function that will allowthe user of an improved system to talk about theaverage performance improvement as well as about itspossible variations. 000858An Alphabet-Independent Optimal Parallel Search for Three DimensionalPatternsMarek Karpinski, Wojciech Ryttertr-93-070199311__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-070.pdfWe give an alphabet-independent optimal parallelalgorithm for the searching phase of threedimensional pattern- matching. All occurrences of athree dimensional pattern P of shape m x m x m in a text Tof shape n x n x n are to be found. Our algorithm works inlog m time with O(N log m) processors of a CREW PRAM,where N = n^3. The searching phase in three dimensionsexplores classification of two- dimensionalperiodicities of the cubic pattern. Some newprojection techniques are developed to deal withthree dimensions. The periodicities of the patternwith respect to its faces are investigated. Thenonperiodicities imply some sparsenessproperties, while periodicities imply otherspecial useful properties (i.e. monotonicity) ofthe set of occurrences. Both types of properties areuseful in deriving an efficient algorithm.

Thesearch phase is preceeded by the preprocessing phase(computation of the witness table). Our main resultsconcern the searching phase, however we presentshortly a new approach to the second phase also.Usefullness of the dictionaries of basic factors(DBF's), see [CR 91], in the computation of the threedimensional witness table is presented. The DBFapproach gains simplicity at the expense of a smallincrease in time. It gives a (nonoptimal) O(log m)time algorithm using m processors of a CRCW PRAM. Thealphabet-independent optimal preprocessing isvery complex even in the case of two dimensions, see[GP 92]. For large alphabets the DBF's giveassymptotically the same complexity as the(alphabet-dependent) suffix trees approach (butavoids suffix trees and is simpler).

However, thebasic advantage of the DBF approach is its simplicityof dealing with three (or more) dimensions.

Thealgorithm can be easily adjusted to the case ofunequally sided patterns. 000859Lower Bounds on Testing Membership to a Polyhedron by Algebraic DecisionTreesDima Grigoriev, Marek Karpinski, Nicolai Vorobjovtr-93-071199311__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-071.pdfWe describe a new method of proving lower bounds on thedepth of algebraic decision trees and apply it toprove a lower bound Omega(log N) for testingmembership to a convex polyhedron having N facets ofall dimensions. This bound apparently does notfollow from the methods developed by M. Ben-Or, A.Bjoerner, L. Lovasz, A. Yao ([B 83], [BLY 92]) becausethe topological invariants used in these methodsbecome trivial for the convex polyhedra. 000860Software Protection and Simulation on Oblivious RAMsOded Goldreich, Rafail Ostrovskytr-93-072199311__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-072.pdfSoftware protection is one of the most importantissues concerning computer practice. There existmany heuristics and ad-hoc methods for protection,but the problem as a whole has not received thetheoretical treatment it deserves. In this paper weprovide theoretical treatment of softwareprotection. We reduce the problem of softwareprotection to the problem of efficient simulation on{\em oblivious\/} RAM.

A machine is {\emoblivious\/} if the sequence in which it accessesmemory locations is equivalent for any two inputswith the same running time. For example, an obliviousTuring Machine is one for which the movement of theheads on the tapes is identical for each computation.(Thus, it is independent of the actual input.) {\emWhat is the slowdown in the running time of anymachine, if it is required to be oblivious?\/} In 1979Pippenger and Fischer showed how a two-tape {\emoblivious\/} Turing Machine can simulate, on-line,a one-tape Turing Machine, with a logarithmicslowdown in the running time. We show an analogueresult for the random-access machine (RAM) model ofcomputation. In particular, we show how to do anon-line simulation of an arbitrary RAM input by aprobabilistic {\em oblivious\/} RAM with apoly-logarithmic slowdown in the running time. Onthe other hand, we show that a logarithmic slowdown isa lower bound. 000861One-Way Functions are Essential for Non-Trivial Zero-KnowledgeRafail Ostrovsky, Avi Wigdersontr-93-073199311__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-073.pdfIt was known that if one-way functions exist, thenthere are zero-knowledge proofs for every languagein $\PSPACE$. We prove that unless very {\em weak}one-way functions exist, Zero-Knowledge proofs canbe given only for languages in $\BPP$. Foraverage-case definitions of $\BPP$ we prove ananalogous result under the assumption that {\emuniform} one-way functions do not exist.

Thus, veryloosely speaking, zero--knowledge is either {\emuseless} (exists only for ``easy'' languages), or{\em universal} (exists for every provablelanguage). 000862How and When to Be UniqueShay Kutten, Rafail Ostrovsky, Boaz Patt-Shamirtr-93-074199311__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-074.pdfOne of the fundamental problems in distributedcomputing is how identical processors withidentical local memory can choose unique IDsprovided they can flip a coin. The variant consideredin this paper is the asynchronous shared memory model(atomic registers), and the basic correctnessrequirement is that upon termination the processesmust always have unique IDs.

We study this problemfrom several viewpoints. On the positive side, wepresent the first protocol that solves the problemand terminates with probability 1. The protocolterminates in (optimal) $O(\log n)$ expected time,using $O(n)$ shared memory space, where $n$ is thenumber of participating processes. On the negativeside, we show that no protocol can terminate withprobability 1 if $n$ is unknown, and that nofinite-state protocol can terminate withprobability 1 if the schedule is non-oblivious(i.e., may depend on the history of the sharedvariable).

We also discuss the dynamic setting(where processes may join and leave the systemdynamically), and give a deterministic protocol forthe read-modify-write model that needs only 3 sharedbits. 000863Matching nuts and boltsNoga Alon, Manuel Blum, Amos Fiat, Sampath Kannan, Moni Naor, Rafail Ostrovskytr-93-075199311__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-075.pdfWe describe a procedure which may be helpful to anydisorganized carpenter who has a mixed pile of boltsand nuts and wants to find the corresponding pairs ofbolts and nuts. The procedure uses our (and thecarpenter's) ability to construct efficientlyhighly expanding graphs. The problem considered isgiven a collection of $n$ bolts of distinct widths and$n$ nuts such that there is a 1-1 correspondencebetween the nuts and bolts. The goal is to find for eachbolt its corresponding nut by comparing nuts to boltsbut not nuts to nuts or bolts to bolts. Our objective isto minimize the number of operations of this kind (aswell as the total running time). The problem has arandomized algorithm similar to Quicksort. Our mainresult is an $n (\log n)^{O(1)}$-time {\emdeterministic} algorithm, based on expandergraphs, for matching the bolts and the nuts. 000864Any Non-Private Boolean Function Is Complete For Private Multi-PartyComputationsEyal Kushilevitz, Silvio Micali, Rafail Ostrovskytr-93-076199311__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-076.pdfLet $g$ be an $n$-argument boolean function. Supposewe are given a {\em black-box\/} for $g$, to which $n$honest-but-curious players can secretly giveinputs and it broadcasts the result of operating $g$on these inputs to all the players. We say that $g$ is{\em complete \/} (for multi-party privatecomputations) if for {\em every\/} function $f$, the$n$ players can compute the function $f$$n$-privately, given the black-box for $g$. In thispaper, we characterize the boolean functions whichare complete: we show that a boolean function $g$ iscomplete if and only if $g$ itself cannot be computed$n$-privately (when there is no black-boxavailable). Namely, for boolean functions, thenotions of {\bf completeness\/} and {\bf$n$-privacy} are {\em complementary\/}. On theother hand, for non-boolean functions, we show thatthis two notions are {\em not\/} complementary. Ourresult can be viewed as a generalization (formulti-party protocols and for $(n\geq 2)$-argumentfunctions) of the two-party case, where it was knownthat two-argument functions which contain``embedded-OR'' are complete. 000865A Cognitive Model of Sentence Interpretation: the Construction GrammarapproachDaniel Jurafskytr-93-077199312__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-077.pdfThis paper describes a new,psychologically-plausible model of human sentenceinterpretation, based on a new model of linguisticstructure, Construction Grammar. This on-line,parallel, probabilistic interpreter accounts for awide variety of psycholinguistic results on lexicalaccess, idiom processing, parsing preferences, andstudies of gap-filling and other valenceambiguities, including various frequency effects.We show that many of these results derive from thefundamental assumptions of Construction Grammarthat lexical idioms, idioms, and syntacticstructures are uniformly represented asgrammatical constructions, and argue for the use ofprobabilistically-enriched grammars andinterpreters as models of human knowledge of andprocessing of language. 000866An Evaluation of Burst-level Bandwidth Reservation Methods in WANEnvironmentsMakiko Yoshida, Chinatsu Ikeda, Hiroshi Suzukitr-93-078199402__000006This paper shows the evaluation results of fastbandwidth reservation (FRP) methods applied tobursty traffics, e.g. a large file transfer, in ATMnetworks with long propagation delay. Thesetraffics require a large bandwidth for a short time,i.e., bursty characteristics and a strict cell lossquality. For these traffics, an FRP instead of a calllevel bandwidth resservation is effective toutilize network resources under guaranteed QoS.Comparison among FRP methods with peak rate controlsin terms of a transmission completion time is carriedout under a short and long propagation delay. Wecompare two types of FRP and FRPs with three adaptivepeak rate controls.

Evaluation results show thatconfirmed type FRP is preferable to uncomfirmed typeFRP. In addition, we see from the results that FRP ispreferable to unconfirmed type FRP. In addition, wesee from the results that FRP with peak rate controlusing network availability information provids theshortest transmission completion time under lightload conditions. However, the results also show thatFRP with simple peak rate control using ACK/NACKprovides fair transmission completion time underheavy load and long propagation delay conditions. 000867On a Sublinear Time Parallel Construction of Optimal Binary Search TreesMarek Karpinski, Wojciech Ryttertr-93-079199312__000006We design an efficient sublinear time parallelconstruction of optimal binary search trees. Theefficiency of the parallel algorithm corresponds toits total work (the product time X processors). Ouralgorithm works in O(n_1-e) log(n)) time with thetotal work O(n_2+2e), for an aritrarily smallconstant 0< e less than or equal to one half. This isoptimal within factor n_2e with respect to the bestknown sequential algorithm given by Knuth, whichneeds only O(n_2) time due to a monotonicity propertyof optimal binary search trees, see {6}). It isunknown how to explore this property in an efficientNC construction of binary search trees. Here we showthat it can be effectively used in sublinear timeparallel computation. Our improvement also relieson the use (in independently processed smallsubcomputations) of the parallelism present in theKnuth's algorithm. The best known sublinear timealgorithms for the construction of binary searchtrees (as an instance of a more general problem) haveO(n_3) work for time larger than n_3/4, see {3} and{7}. For time square root of (n) these algorithms needn_4 work, while our algorithm needs for this time onlyn_3 work, thus improving the known algorithms by alinear factor. Also if time is O(n_1-e) and e is verysmall our improvement is close to O(n). Suchimprovement is similar to the one implied by themonotonicity property in sequential computations(from n_3 sequential time for a more general dynamicprogramming problem to n_2 time for the special caseof optimal binary search trees). 000868Dynamic Programming in a Generalized Decision ModelUlrich Huckenbecktr-93-080199312__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-080.pdf(Pages: 40) We present two dynamic programmingstrategies for a general class of decisionprocesses. Each of these algorithms includes amongothers the following graph theoretic optimizationalgorithms as special cases:

In our general decision model,we define several structural properties of costmeasures in order to formulate sufficientconditions for the correctness of our algorithms.

Our first algorithm works as fast as the originalFord-Bellman Strategy and the Greedy Method,respectively. Our second algorithm solves a largerclass of optimization problems than our first searchstrategy. 000869On Valve Adjustments that Interrupt all s-t-Paths in a DigraphUlrich Huckenbecktr-93-081199312__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-081.pdf(Pages: 15) When searching a path in a digraph,usually the following situation is given: Every nodev may be entered by an arbitrary incoming arc (u,v),and v may be left by an arbitrary outgoing arc (v,w).

Inthis paper, however, we consider graphs with valvenodes, which cannot arbitrarily be entered and left.More precisely, a movable valve is installed in eachvalve node v. entering v via (u,v) and leaving it via(v,w) is only possible if the current position of thevalve generates a connection between these two arcs;if, however, the current valve adjustmentinterrupts this connection then every path using thearcs (u,v) and (v,w) is interrupted, too.

Weinvestigate the complexity of the followingproblem:



Given a digraph with valve nodes. Let s and tbe two nodes of this graph.

Does there exist a valveadjustment that interrupts all paths from s to t?

Weshow that this problem can be solved in deterministicpolynomial time if all valve nodes belong to aparticular class of valves; otherwise the problem isNP-complete. 000870All-to-all Broadcast on the CNS-1Silvia M. Müllertr-93-082199312__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1993/tr-93-082.pdfThis study deals with the all-to-all broadcast on theCNS-1. We determine a lower bound for the run time andpresent an algorithm meeting this bound. Since thisstudy points out a bottleneck in the networkinterface, we also analyze the performance ofalternative interface designs. Our analyses arebased on a run time model of the network.

Keywords: CNS, all-to-all broadcast, transfer, performance analysis, parallelization. 000871Surface Learning with Applications to Lip-ReadingChristoph Bregler, Stephen Omohundrotr-94-001199401__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1994/tr-94-001.pdfMost connectionist research has focused on learningmappings from one space to another (eg.classification and regression). This paperintroduces the more general task of learningconstraint surfaces. It describes a simple butpowerful architecture for learning andmanipulating nonlinear surfaces from data. Wedemonstrate the technique on low dimensionalsynthetic surfaces and compare it to nearestneighbor approaches. We then show its utility inlearning the space of lip images in a system forimproving speech recognition by lip reading. Thislearned surface is used to improve the visualtracking performance during recognition. 000872"Eigenlips" for Robust Speech RecognitionChristoph Bregler, Yochai Konigtr-94-002199401__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1994/tr-94-002.pdfIn this study we improve the performance of a hybridconnectionist speech recognition system byincorporating visual information about thecorresponding lip movements. Specifically, weinvestigate the benefits of adding visual featuresin the presence of additive noise and crosstalk(cocktail party effect). Our study extends previousexperiments by using a new visual front end, and analternative architecture for combining the visualand acoustic information. Furthermore, we haveextended our recognizer to a multi-speaker,connected letters recognizer. Our results show asignificant improvement for the combinedarchitecture (acoustic and visual information)over just the acoustic system in the presence ofadditive noise and crosstalk. 000873Best-first Model Merging for Hidden Markov Model InductionAndreas Stolcke, Stephen M. Omohundrotr-94-003199401__000006ftp://ftp.icsi.berkeley.edu/pub/techreports/1994/tr-94-003.pdfThis report describes a new technique for inducingthe structure of Hidden Markov Models from data whichis based on the general `model merging' strategy(Omohundro 1992). The process begins with a maximumlikelihood HMM that directly encodes the trainingdata. Successively more general models are producedby merging HMM states. A Bayesian posteriorprobability criterion is used to determine whichstates to merge and when to stop generalizing. Theprocedure may be considered a heuristic search forthe HMM structure with the highest posteriorprobability.

We discuss a variety of possible priorsfor HMMs, as well as a number of approximations whichimprove the computational efficiency of thealgorithm. We studied three applications toevaluate the procedure. The first compares themerging algorithm with the standard Baum-Welchap