Tools

"... We introduce a novel algorithm to compute non-negative sparse principal components of posi-tive semidefinite (PSD) matrices. Our algorithm comes with approximation guarantees contingent on the spectral profile of the input matrix A: the sharper the eigenvalue decay, the better the qual-ity of the ap ..."

"... 1 Introduction Problem and Significance. A gapped local similarity between two sequencesis a pair of fixed length substrings, one from each sequence, that align with few mismatches and indels (insertions/deletions). We address the problem offinding all such similarities between two sequences. This i ..."

1 Introduction Problem and Significance. A gapped local similarity between two sequencesis a pair of fixed length substrings, one from each sequence, that align with few mismatches and indels (insertions/deletions). We address the problem offinding all such similarities between two sequences. This is a core problem in bio-sequence similarity search, as it is a variant of the basic local alignmentproblem [29] with edit distance as the scoring function. Edit distance is simpler than a general scoring function as it treats mismatches and indels via unitcosts, nevertheless it is important and very relevant for comparing genomic DNA sequences as discussed next.

, and propose novel estimators for the cases of noisy, missing, and/or dependent data. Many standard approaches to noisy or missing data, such as those using the EM algorithm, lead to optimization problems that are inherently non-convex, and it is difficult to establish theoretical guarantees on practical

"... License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Building high confidence regression test suites to validate new system versions is a challenging problem. A modelbased approach to build a regression test suite from a ..."

License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Building high confidence regression test suites to validate new system versions is a challenging problem. A modelbased approach to build a regression test suite from a given test suite is described. The generated test suite includes every test that will traverse a change performed to produce the new version, and consists of only such tests to reduce the testing costs. Finite state machines extended with typed variables (EFSMs) are used to model systems and system changes are mapped to EFSM transition changes adding/deleting/replacing EFSM transitions and states. Tests are a sequence of input and expected output messages with concrete parameter values over the supported data types. An invariant is formulated to characterize tests whose runtime behavior can be accurately predicted by analyzing their descriptions along with the model. Incremental procedures to efficiently evaluate the invariant and to select tests for regression are developed. Overlaps among the test descriptions are exploited to extend the approach to simultaneously select multiple tests to reduce the test selection costs. Effectiveness of the approach is demonstrated by applying it to several protocols, Web services, and model programs extracted from a popular testing benchmark. Our experimental results show that the proposed approach is economical for regression test selection in all these examples. For all these examples, the proposed approach is able to identify all tests exercising changes more efficiently than brute-force symbolic evaluation.

"... Abstract. Building high confidence regression test suites to validate the changes performed during system evolution and maintenance is a challenging problem. This paper describes a formal approach that selects every test from a given test suite guaranteed to exercise a given change and discards othe ..."

Abstract. Building high confidence regression test suites to validate the changes performed during system evolution and maintenance is a challenging problem. This paper describes a formal approach that selects every test from a given test suite guaranteed to exercise a given change and discards

"... We consider the problem of embedding one signal (e.g., a digital watermark), within another "host" signal to form a third, "composite" signal. The embedding is designed to achieve efficient tradeoffs among the three conflicting goals of maximizing information-embedding rate, mini ..."

refer to as dither modulation. Using deterministic models to evaluate digital watermarking methods, we show that QIM is "provably good" against arbitrary bounded and fully informed attacks, which arise in several copyright applications, and in particular, it achieves provably better rate

"... The Cooperative File System (CFS) is a new peer-to-peer readonly storage system that provides provable guarantees for the efficiency, robustness, and load-balance of file storage and retrieval. CFS does this with a completely decentralized architecture that can scale to large systems. CFS servers pr ..."

The Cooperative File System (CFS) is a new peer-to-peer readonly storage system that provides provableguarantees for the efficiency, robustness, and load-balance of file storage and retrieval. CFS does this with a completely decentralized architecture that can scale to large systems. CFS servers

"... Models for the processes by which ideas and influence propagate through a social network have been studied in a number of domains, including the diffusion of medical and technological innovations, the sudden and widespread adoption of various strategies in game-theoretic settings, and the effects of ..."

the first provable approximation guarantees for efficient algorithms. Using an analysis framework based on submodular functions, we show that a natural greedy strategy obtains a solution that is provably within 63 % of optimal for several classes of models; our framework suggests a general approach