Articles
![]() |
08/19/2016--
08/19/2016
Thrill: High-Performance Algorithmic Distributed Batch Data Processing with C++
We present the design and a first performance evaluation of Thrill -- a
prototype of a general purpose big data processing framework with a convenient
data-flow style programming interface. Thrill is somewhat similar to Apache
Spark and Apache Flink with at least two main differences. First, Thrill is
based on C++ which enables performance advantages due to direct native code
compilation, a more cache-friendly memory layout, and explicit memory
management. In particular, Thrill uses template meta-programming to compile
chains of subsequent local operations into a single binary routine without
intermediate buffering and with minimal indirections. Second, Thrill uses
arrays rather than multisets as its primary data structure which enables
additional operations like sorting, prefix sums, window scans, or combining
corresponding fields of several arrays (zipping). We compare Thrill with Apache
Spark and Apache Flink using five kernels from the HiBench suite. Thrill is
consistently faster and often several times faster than the other frameworks.
At the same time, the source codes have a similar level of simplicity and
abstraction
Timo Bingmann
Michael Axtmann
Emanuel Jöbstl
Sebastian Lamm
Huyen Chau Nguyen
Alexander Noe
Sebastian Schlag
Matthias Stumpp
Tobias Sturm
Peter Sanders
01/10/2005--
01/03/2005
Dispersive estimates for Schroedinger operators: A survey
We present some old and new results on dispersive estimates for Schroedinger
equations.
Wilhelm Schlag
01/09/2005--
01/09/2005
Stable manifolds for all monic supercritical NLS in one dimension
We show that all supercritical monic focusing NLS in one space dimension
exhibit asymptotic stability of perturbed standing waves provided the
perturbations are chosen on a small hypersuface in a suitable space.
Joachim Krieger
Wilhelm Schlag
09/04/2005--
09/01/2005
Spectral Theory and Nonlinear PDE: a Survey
This is a slightly modified version of the survey article. Minor changes to
the text and the references have been made.
Wilhelm Schlag
08/15/2007--
08/15/2007
A version of the proof for Peres-Schlag's theorem on lacunary sequences
We present a proof of a multidimensional version of Peres-Schlag's theorem on
Diophantine approximations with lacunary sequences.
Nikolai G. Moshchevitin
02/06/2018--
02/06/2018
Intertwining wave operators, Fourier restriction, and Wiener theorems
We review some of the literature on intertwining wave operators, as far as it
relates to Tosio Kato's stationary method based on the resolvent. Techniques
from Fourier restriction and Wiener algebras are emphasized.
W. Schlag
03/26/2020--
03/26/2020
Advanced Flow-Based Multilevel Hypergraph Partitioning
The balanced hypergraph partitioning problem is to partition a hypergraph
into $k$ disjoint blocks of bounded size such that the sum of the number of
blocks connected by each hyperedge is minimized. We present an improvement to
the flow-based refinement framework of KaHyPar-MF, the current state-of-the-art
multilevel $k$-way hypergraph partitioning algorithm for high-quality
solutions. Our improvement is based on the recently proposed HyperFlowCutter
algorithm for computing bipartitions of unweighted hypergraphs by solving a
sequence of incremental maximum flow problems. Since vertices and hyperedges
are aggregated during the coarsening phase, refinement algorithms employed in
the multilevel setting must be able to handle both weighted hyperedges and
weighted vertices -- even if the initial input hypergraph is unweighted. We
therefore enhance HyperFlowCutter to handle weighted instances and propose a
technique for computing maximum flows directly on weighted hypergraphs.
We compare the performance of two configurations of our new algorithm with
KaHyPar-MF and seven other partitioning algorithms on a comprehensive benchmark
set with instances from application areas such as VLSI design, scientific
computing, and SAT solving. Our first configuration, KaHyPar-HFC, computes
slightly better solutions than KaHyPar-MF using significantly less running
time. The second configuration, KaHyPar-HFC*, computes solutions of
significantly better quality and is still slightly faster than KaHyPar-MF.
Furthermore, in terms of solution quality, both configurations also outperform
all other competing partitioners.
Lars Gottesbüren
Michael Hamann
Sebastian Schlag
Dorothea Wagner
06/16/2021--
06/16/2021
High-Quality Hypergraph Partitioning
This paper considers the balanced hypergraph partitioning problem, which asks
for partitioning the vertices into $k$ disjoint blocks of bounded size while
minimizing an objective function over the hyperedges. Here, we consider the
most commonly used connectivity metric. We describe our open source hypergraph
partitioner KaHyPar which is based on the successful multi-level approach --
driving it to the extreme of one level for (almost) every vertex. Using
carefully designed data structures and dynamic update techniques, this approach
offers a very good time-quality tradeoff. We present two preprocessing
techniques -- pin sparsification using locality sensitive hashing and community
detection based on the Louvain algorithm. The community structure is used to
guide the coarsening process that incrementally contracts vertices.
Portfolio-based partitioning of the contracted hypergraph already achieves good
initial solutions. While reversing the contractions, a combination of
highly-localized direct $k$-way local search and flow-based techniques that
take a more global view, refine the partition to achieve high quality.
Optionally, a memetic algorithm evolves a pool of solution candidates to obtain
even higher quality.
We evaluate KaHyPar on a large set of instances from a wide range of
application domains. With respect to quality, KaHyPar outperforms all
previously considered systems that can handle large hypergraphs such as hMETIS,
PaToH, Mondriaan, or Zoltan. KaHyPar is also faster than most of these systems
except for PaToH which represents a different speed-quality tradeoff. The
results even extend to the special case of graph partitioning, where
specialized systems such as KaHIP should have an advantage.
Sebastian Schlag
Tobias Heuer
Lars Gottesbüren
Yaroslav Akhremtsev
Christian Schulz
Peter Sanders
10/15/2020--
02/06/2020
Multilevel Acyclic Hypergraph Partitioning
A directed acyclic hypergraph is a generalized concept of a directed acyclic
graph, where each hyperedge can contain an arbitrary number of tails and heads.
Directed hypergraphs can be used to model data flow and execution dependencies
in streaming applications. Thus, hypergraph partitioning algorithms can be used
to obtain efficient parallelizations for multiprocessor architectures. However,
an acyclicity constraint on the partition is necessary when mapping streaming
applications to embedded multiprocessors due to resource restrictions on this
type of hardware. The acyclic hypergraph partitioning problem is to partition
the hypernodes of a directed acyclic hypergraph into a given number of blocks
of roughly equal size such that the corresponding quotient graph is acyclic
while minimizing an objective function on the partition.
Here, we contribute the first n-level algorithm for the acyclic hypergraph
partitioning problem. Our focus is on acyclic hypergraphs where hyperedges can
have one head and arbitrary many tails. Based on this, we engineer a memetic
algorithm to further reduce communication cost, as well as to improve
scheduling makespan on embedded multiprocessor architectures. Experiments
indicate that our algorithm outperforms previous algorithms that focus on the
directed acyclic graph case which have previously been employed in the
application domain. Moreover, our experiments indicate that using the directed
hypergraph model for this type of application yields a significantly smaller
makespan.
Merten Popp
Sebastian Schlag
Christian Schulz
Daniel Seemaier
10/21/2020--
10/20/2020
Scalable Shared-Memory Hypergraph Partitioning
Hypergraph partitioning is an important preprocessing step for optimizing
data placement and minimizing communication volumes in high-performance
computing applications. To cope with ever growing problem sizes, it has become
increasingly important to develop fast parallel partitioning algorithms whose
solution quality is competitive with existing sequential algorithms. To this
end, we present Mt-KaHyPar, the first shared-memory multilevel hypergraph
partitioner with parallel implementations of many techniques used by the
sequential, high-quality partitioning systems: a parallel coarsening algorithm
that uses parallel community detection as guidance, initial partitioning via
parallel recursive bipartitioning with work-stealing, a scalable label
propagation refinement algorithm, and the first fully-parallel direct $k$-way
formulation of the classical FM algorithm. Experiments performed on a large
benchmark set of instances from various application domains demonstrate the
scalability and effectiveness of our approach. With 64 cores, we observe
self-relative speedups of up to 51 and a harmonic mean speedup of 23.5. In
terms of solution quality, we outperform the distributed hypergraph partitioner
Zoltan on 95% of the instances while also being a factor of 2.1 faster. With
just four cores,Mt-KaHyPar is also slightly faster than the fastest sequential
multilevel partitioner PaToH while producing better solutions on 83% of all
instances. The sequential high-quality partitioner KaHyPar still finds better
solutions than our parallel approach, especially when using max-flow-based
refinement. This, however, comes at the cost of considerably longer running
times.
Lars Gottesbüren
Tobias Heuer
Peter Sanders
Sebastian Schlag
|
|