Articles
![]() |
11/30/2018--
11/30/2018
HEP Software Foundation Community White Paper Working Group -- Data Organization, Management and Access (DOMA)
Without significant changes to data organization, management, and access
(DOMA), HEP experiments will find scientific output limited by how fast data
can be accessed and digested by computational resources. In this white paper we
discuss challenges in DOMA that HEP experiments, such as the HL-LHC, will face
as well as potential ways to address them. A research and development timeline
to assess these changes is also proposed.
Dario Berzano
Riccardo Maria Bianchi
Ian Bird
Brian Bockelman
Simone Campana
Kaushik De
Dirk Duellmann
Peter Elmer
Robert Gardner
Vincent Garonne
Claudio Grandi
Oliver Gutsche
Andrew Hanushevsky
Burt Holzman
Bodhitha Jayatilaka
Ivo Jimenez
Michel Jouvin
Oliver Keeble
Alexei Klimentov
Valentin Kuznetsov
Eric Lancon
Mario Lassnig
Miron Livny
Carlos Maltzahn
Shawn McKee
Dario Menasce
Andrew Norman
Jim Pivarski
Benedikt Riedel
Markus Schulz
Horst Severini
Michael Sevilla
Elizabeth Sexton-Kennedy
Eric Vaandering
Ilija Vukotic
Noah Watkins
Torre Wenaus
Frank Wuerthwein
05/07/1999--
11/05/1997
Cos(M)ological Solutions in M- and String Theory
We consider solutions to the cosmological equations of motion in 11
dimensions with and without 4-form charges. We show explicitly the
correspondence between some of these solutions and known solutions in 10
dimensional string gravity. New solutions involving combinations of 4-form
charges are explored. We also speculate on the possibility of removing
curvature singularities present in 10D theories by oxidizing to 11D.
10/06/2008--
09/18/2008
Beauty photoproduction using decays into electrons at HERA
Photoproduction of beauty quarks in events with two jets and an electron
associated with one of the jets has been studied with the ZEUS detector at HERA
using an integrated luminosity of 120 pb-1. The fractions of events containing
b quarks, and also of events containing c quarks, were extracted from a
likelihood fit using variables sensitive to electron identification as well as
to semileptonic decays. Total and differential cross sections for beauty and
charm production were measured and compared with next-to-leading-order QCD
calculations and Monte Carlo models.
05/08/2019--
11/02/2018
Beyond Equal-Length Snippets: How Long is Sufficient to Recognize an Audio Scene?
Due to the variability in characteristics of audio scenes, some scenes can
naturally be recognized earlier than others. In this work, rather than using
equal-length snippets for all scene categories, as is common in the literature,
we study to which temporal extent an audio scene can be reliably recognized
given state-of-the-art models. Moreover, as model fusion with deep network
ensemble is prevalent in audio scene classification, we further study whether,
and if so, when model fusion is necessary for this task. To achieve these
goals, we employ two single-network systems relying on a convolutional neural
network and a recurrent neural network for classification as well as early
fusion and late fusion of these networks. Experimental results on the
LITIS-Rouen dataset show that some scenes can be reliably recognized with a few
seconds while other scenes require significantly longer durations. In addition,
model fusion is shown to be the most beneficial when the signal length is
short.
10/23/2019--
05/06/2019
MixMatch: A Holistic Approach to Semi-Supervised Learning
Semi-supervised learning has proven to be a powerful paradigm for leveraging
unlabeled data to mitigate the reliance on large labeled datasets. In this
work, we unify the current dominant approaches for semi-supervised learning to
produce a new algorithm, MixMatch, that works by guessing low-entropy labels
for data-augmented unlabeled examples and mixing labeled and unlabeled data
using MixUp. We show that MixMatch obtains state-of-the-art results by a large
margin across many datasets and labeled data amounts. For example, on CIFAR-10
with 250 labels, we reduce error rate by a factor of 4 (from 38% to 11%) and by
a factor of 2 on STL-10. We also demonstrate how MixMatch can help achieve a
dramatically better accuracy-privacy trade-off for differential privacy.
Finally, we perform an ablation study to tease apart which components of
MixMatch are most important for its success.
12/18/2019--
12/18/2019
Incremental ELMVIS for unsupervised learning
An incremental version of the ELMVIS+ method is proposed in this paper. It
iteratively selects a few best fitting data samples from a large pool, and adds
them to the model. The method keeps high speed of ELMVIS+ while allowing for
much larger possible sample pools due to lower memory requirements. The
extension is useful for reaching a better local optimum with greedy
optimization of ELMVIS, and the data structure can be specified in
semi-supervised optimization. The major new application of incremental ELMVIS
is not to visualization, but to a general dataset processing. The method is
capable of learning dependencies from non-organized unsupervised data -- either
reconstructing a shuffled dataset, or learning dependencies in complex
high-dimensional space. The results are interesting and promising, although
there is space for improvements.
09/15/2019--
08/23/2019
Design choices for productive, secure, data-intensive research at scale in the cloud
We present a policy and process framework for secure environments for
productive data science research projects at scale, by combining prevailing
data security threat and risk profiles into five sensitivity tiers, and, at
each tier, specifying recommended policies for data classification, data
ingress, software ingress, data egress, user access, user device control, and
analysis environments. By presenting design patterns for security choices for
each tier, and using software defined infrastructure so that a different,
independent, secure research environment can be instantiated for each project
appropriate to its classification, we hope to maximise researcher productivity
and minimise risk, allowing research organisations to operate with confidence.
Diego Arenas
Jon Atkins
Claire Austin
David Beavan
Alvaro Cabrejas Egea
Steven Carlysle-Davies
Ian Carter
Rob Clarke
James Cunningham
Tom Doel
Oliver Forrest
Evelina Gabasova
James Geddes
James Hetherington
Radka Jersakova
Franz Kiraly
Catherine Lawrence
Jules Manser
Martin T. O'Reilly
James Robinson
Helen Sherwood-Taylor
Serena Tierney
Catalina A. Vallejos
Sebastian Vollmer
Kirstie Whitaker
02/06/2021--
10/18/2020
Self-Attention Generative Adversarial Network for Speech Enhancement
Existing generative adversarial networks (GANs) for speech enhancement solely
rely on the convolution operation, which may obscure temporal dependencies
across the sequence input. To remedy this issue, we propose a self-attention
layer adapted from non-local attention, coupled with the convolutional and
deconvolutional layers of a speech enhancement GAN (SEGAN) using raw signal
input. Further, we empirically study the effect of placing the self-attention
layer at the (de)convolutional layers with varying layer indices as well as at
all of them when memory allows. Our experiments show that introducing
self-attention to SEGAN leads to consistent improvement across the objective
evaluation metrics of enhancement performance. Furthermore, applying at
different (de)convolutional layers does not significantly alter performance,
suggesting that it can be conveniently applied at the highest-level
(de)convolutional layer with the smallest memory overhead.
06/10/2024--
10/16/2023
Diamond-lattice photonic crystals assembled from DNA origami
Colloidal self-assembly allows rational design of structures on the
micrometer and submicrometer scale. One architecture that can generate complete
3D photonic band gaps is the diamond cubic lattice, which has remained
difficult to realize at length scales comparable to the wavelength of visible
or ultraviolet light. Here, we demonstrate three-dimensional photonic crystals
self-assembled from DNA origami that act as precisely programmable patchy
colloids. Our DNA-based nanoscale tetrapods crystallize into a rod-connected
diamond cubic lattice with a periodicity of 170 nm. This structure serves as a
scaffold for atomic layer deposition of high refractive index materials such as
TiO$_2$, yielding a tunable photonic band gap in the near-ultraviolet.
08/21/2000--
08/21/2000
An HST Lensing Survey of X-ray Luminous Galaxy Clusters: I. A383
We analyse the mass distribution in the core of A383 (z=0.188), one of 12
X-ray luminous clusters at z~0.2 selected for a comprehensive and unbiased
study of the mass distribution in massive clusters. Deep HST imaging reveals a
wide variety of gravitationally lensed features in A383, including a giant arc
formed from the strongly-lensed images of 2 background galaxies, 2 radial arcs,
several multiply-imaged arcs and numerous arclets. Based upon the constraints
from the various lensed features, we construct a detailed mass model for the
central regions of the cluster, taking into account both the cluster-scale
potential and perturbations from individual cluster galaxies. Keck spectroscopy
of one component of the giant arc identifies it as a star-forming galaxy at
z=1.01 and provides an accurate measurement of the cluster mass within the
radius of the giant arc (65kpc) of (3.5+/-0.1)*10^13 Mo. Using the weak shear
measured from our HST observations we extend our mass model to determine a mass
of (1.8+/-0.2)*10^14 Mo within a radius of 250kpc. On smaller scales we employ
the radial arcs as probes of the shape of the mass distribution in the cluster
core (r<20kpc), and find that the mass profile is more peaked than a single NFW
profile. The optical and X-ray properties of A383 indicate the presence of a
central cooling flow, for which we derive a mass deposition rate of >200 Mo/yr.
We also use the X-ray emission from A383 to obtain independent estimates of the
total mass within projected radii of 65 and 250kpc: (4.0+/-1.4)*10^13 Mo and
(1.2+/-0.5)*10^14 Mo, which are consistent with the lensing measurements.
[Abridged]
|
|