Alumni Project

Particle Physics Data Grid (PPDG): An Interdisciplinary Collaboration Contributing to and Benefiting from Shared Grid Technology

PIs: Richard Mount, SLAC, Miron Livny, Wisconsin, Harvey Newman, Caltech
Steering Committee: John Huth, Harvard (ATLAS), Tim Adye, RAL (BaBar), Lothar Bauerdick, FNAL (CMS),
Lee Lueking, FNAL (D0), Chip Watson, TJNAF, Jerome Lauret, BNL (STAR),
Miron Livny, Wisconsin (Condor), Jennifer Schopf, ANL (Globus), Ian Foster, ANL (Globus),
Reagan Moore, SDSC (SRB), Arie Shoshani, LBNL (SRM)
Coordinators: Ruth Pordes, FNAL, Doug Olson, LBNL
Liaisons: Paul Avery (iVDGL), Larry Price (HICB), Mike Wilde(GriPhyN), Torre Wenaus, Ian Bird (LCG)
(www.ppdg.net)

Summary

The close collaboration of computer science teams with the physicists and software engineers in the PPDG collaboration is contributing extensions to, and deployment of, Grid software technologies of benefit to the larger scientific community. The improved technology is then shared with other SciDAC projects and collaborators. Some samples of these benefits are given below.

The PPDG collaboration, consisting of four leading grid middleware groups and six high-energy and nuclear physics experiments, has proven to be very effective at identifying, developing, and adopting improvements to grid middleware beneficial not only across different physics experiments but for the broader science community as well. The interaction with other SciDAC projects provides an important mechanism for mutual sharing of technology improvements across the science community.

PPDG has contributed many improvements to the Globus Toolkit1 in the form of requirements specifications, testing and hardening of many components including Grid Security Infrastructure, GridFTP, Replica Location Service, Monitoring and Discovery Service, GRAM job submission, and the Community Authorization Service.

PPDG had a similar impact on technologies and middleware developed by the Condor project2 – most noticeably the grid enabled job manager (Condor-G) and workflow management. Development of new features in Globus and Condor are supported by the SciDAC projects "A High-Performance Data Grid Toolkit" and "Security and Policy for Group Collaboration." Improvements to the Globus Toolkit and Condor are shared across the whole science community as they have been adopted by most grid applications as key infrastructure.

Collaboration with PPDG has led to the addition of key features to the Storage Resource Manager3 (SRM), including space reservation and management needs of disk and tape storage resources. Based on the extensive PPDG experience, the SRM technology has been used by other SciDAC projects, the Earth Science Grid (ESG), and Lattice QCD (LQCD). ESG benefits from an interface with High-Performance Storage System (HPSS) as well as a version that works with a large shared disk. In LQCD, SRM-compliant storage interfaces are being deployed at JLab and the international LQCD grid. SRM was also easily adapted to the National Center for Atmospheric Research (NCAR) mass storage system. Development of SRM is supported by the SciDAC middleware "Storage Resource Management for Data Grid Applications" project and includes technology and optimizations from the SciDAC-supported "Scientific Data Management Integrated Software Infrastructure Center" (SDM ISIC).

Improvements to the Storage Resource Broker4 (SRB) from work in PPDG includes extensions to the metadata catalog based on requirements of PPDG experiments as well as abstraction of the data transport interfaces to support the diverse data access technologies used across different applica-tion communities. Ongoing work includes developing the capability to federate SRB metadata catalogs useful to the globally distributed nature of large physics experiment data handling needs.

PPDG collaborated closely with the DOE Science Grid to establish a Registration and Certificate Authority5 (CA) for the science research community, called the DOEGrids CA. PPDG and its colleagues have been early users of the facility, and helped extend adoption of these certificates by the wider particle physics community in the US, and developed mutual trust with the European Data Grid6 project in Europe. The DOEGrids CA now supports numerous communities including ESG and the FusionGrid.

A specific example of benefits being shared with other science domains is the work on job scheduling for the D0 experiment in PPDG has led to extensions to Condor-G that facilitate late binding between a job and a grid resource. This new feature was used to make the task of analyzing very large collections of proteins by the Basic Local Alignment Search Tool (BLAST) tool more effective.

In addition to the shared technology benefits accomplished to date in PPDG, a number of additional opportunities have been identified. Development of a web services interface to Condor-G from the SciDAC-supported "Pervasive Collaborative Computing Environment" will likely be of benefit to PPDG applications in their future transition to Open Grid Services Architecture compliant grids. The high-dimensional indexing techniques from the SDM ISIC are of potential benefit to PPDG experiments as a flexible, more efficient way of getting the desired subsets of events from the Grid in the data analysis phase. Discussions with the SciDAC project for "Distributed Security Architectures" are helpful as the resource authorization needs of the physics experiments’ grids become better defined. Collaboration with the "Bandwidth Estimation, Measurement Methodologies, and Applications" project contributes to the PPDG monitoring group and development of future plans for performance monitoring and troubleshooting of end-to-end applications.

1 http://www.globus.org/toolkit
2 http://www.cs.wisc.edu/condor
3 http://sdm.lbl.gov/srm-wg
4 http://www.npaci.edu/DICE/SRB
5 http://www.doegrids.org/
6 http://www.eu-datagrid.org/

back to project page

 


Home  |  ASCR  |  Contact Us  |  DOE disclaimer