forschungszentrum karlsruhe gmbh institut für wissenschaftliches rechnen, iwr
Post on 12-Jan-2016
30 Views
Preview:
DESCRIPTION
TRANSCRIPT
Forschungszentrum Karlsruhein der Helmholtz-Gemeinschaft
Tier 1/2 Meeting and 47. Session GridKa TAB, 2.-3. March 2006
Forschungszentrum Karlsruhe GmbHInstitut für Wissenschaftliches Rechnen, IWR
Hermann-von-Helmholtz-Platz 1D-76344 Eggenstein-Leopoldshafen
H. Marten
http://www.gridka.de
GridKa plans for SC4 and beyond
Tier 1/2 Meeting and 47th Session of the GridKa TAB, 2.-3.3.2006
Forschungszentrum Karlsruhein der Helmholtz-Gemeinschaft
Tier 1/2 Meeting and 47. Session GridKa TAB, 2.-3. March 2006
LCG Service DeadlinesLCG Service Deadlines
full physicsrun
first physics
cosmics
2007
2008
2006 Pilot Services – stable service from 1 June 06
LHC Service in operation – 1 Oct 06 over following six months ramp up to full operational capacity & performance
LHC service commissioned – 1 Apr 07
Service Challenge 4
Forschungszentrum Karlsruhein der Helmholtz-Gemeinschaft
Tier 1/2 Meeting and 47. Session GridKa TAB, 2.-3. March 2006
WLCG MB defines a set of High Level MilestonesWLCG MB defines a set of High Level Milestoneshttps://uimon.cern.ch/twiki/pub/LCG/Planning/WLCG_High_Level_Phase2_Plan-20060112.xls
2006
SC4-1 28.02.06 All required software for baseline services deployed and operational at all Tier-1s and at least 20 Tier-2 sites
OPN-2 31.03.06 Tier-0/1 high-performance network operational at CERN and 6 Tier-1s, at least 3 via GEANT.
SC4-2 28.02.06 Use cases and service level support defined for SC4
CAS-1 15.03.06 Castor2 Readiness Review
SC3-4 31.03.06 All services on all Tier-1 sites monitored
SC3-5 31.03.06 Proposal on availability levels specified in Annex 3 of the WLCG MoU
SC4-3 30.04.06 Service Challenge 4 Set-up: Set-up complete and basic service demonstrated, capable of running experiment-supplied packaged test jobs, data distribution tested.
DRC-3 30.04.06 1.0 GB/s data recording demonstration at CERN
SC4-4 31.05.06 Service Challenge 4: Start of stable service phase
SC4-5 30.09.06 Service Challenge 4: Successful completion of service phase
To be shifted by 1 month (see later)
Forschungszentrum Karlsruhein der Helmholtz-Gemeinschaft
Tier 1/2 Meeting and 47. Session GridKa TAB, 2.-3. March 2006
WLCG MB asks Tier-1s to provide site milestone plansWLCG MB asks Tier-1s to provide site milestone plans
https://uimon.cern.ch/twiki/bin/view/LCG/SitesPlanshttps://uimon.cern.ch/twiki/pub/LCG/SitesPlans/FZK_Plan-20060121.xls
Specifies hardware installation and configuration plans (see also
TAB#46). In detail:
03/ 2006: tape access and I/O optimization tests
03/ 2006: installation of 2nd 10 Gbps OPN from GridKa to CERN
03/ 2006: delivery and installation of CPUs
04/ 2006: dCache server & write pool upgrade (throughput)
04/ 2006: 2nd Tape I/O upgrade
05/ 2006: disk delivery installation, configuration, tests
06/ 2006: start of stable SC4 services
Forschungszentrum Karlsruhein der Helmholtz-Gemeinschaft
Tier 1/2 Meeting and 47. Session GridKa TAB, 2.-3. March 2006
experiment kSI2000 Disk / TB Tape / TB
Alice 363 59 106
Atlas 250 56 55
CMS 220 120 180
LHCb 194 49 52
BaBar 430 104 50
CDF 120 81 100
Dzero 430 135 300
Compass 80 33 95
GridKa resources after all upgradesGridKa resources after all upgrades
Σ 2087 636 938
Forschungszentrum Karlsruhein der Helmholtz-Gemeinschaft
Tier 1/2 Meeting and 47. Session GridKa TAB, 2.-3. March 2006
experiment kSI2000 share percentage
Alice 363 36 300 17.4
Atlas 250 25 000 12.0
CMS 220 22 000 10.5
LHCb 194 19 400 9.3
BaBar 430 43 000 20.6
CDF 120 12 000 5.8
Dzero 430 43 300 20.6
Compass 80 8 000 3.8
1-Apr-2006
The default (test) queue is not handled by the fair share.
These 20-30 CPUs are kept free for test jobs.
PBSPro fair share after delivery of CPUs in productionPBSPro fair share after delivery of CPUs in production
49 % LHC51 % nLHC
Forschungszentrum Karlsruhein der Helmholtz-Gemeinschaft
Tier 1/2 Meeting and 47. Session GridKa TAB, 2.-3. March 2006
WLCG MB asks the TCG to provideWLCG MB asks the TCG to providea middleware deployment plana middleware deployment plan
https://uimon.cern.ch/twiki/pub/LCG/Planning/SC4ServicesPlanning_06-01-30.xls
TCG = Technical Coordination Group
Combines user’s requirements and middleware development plans
(great work done by Flavia Donno).
Resulted in “SC4 Middleware (deployment) Plan”
Specifies, which middleware component and version will be
deployed for SC4 and when.
Important message:
LCG 2_7_0 released for production end of January 2006
gLite 3.0 released for pre-production end of February 2006
gLite 3.0 released for production end of April 2006
Forschungszentrum Karlsruhein der Helmholtz-Gemeinschaft
Tier 1/2 Meeting and 47. Session GridKa TAB, 2.-3. March 2006
• February• March• April• May• June• July• August• September• October
gLite-3.0 gLite-3.2
p-ps
p-ps
deploy
deploy
production
production
certification
certification
SC4LHC Pilot service
gLite-3.x
Middleware release schedule 2006Middleware release schedule 2006(acc. to Maite Borosso Lopez, EGEE ROC managers meeting, 21-feb-2006)
Likely not the final schema !
Forschungszentrum Karlsruhein der Helmholtz-Gemeinschaft
Tier 1/2 Meeting and 47. Session GridKa TAB, 2.-3. March 2006
Middleware deployment plans per siteMiddleware deployment plans per site
https://uimon.cern.ch/twiki/pub/LCG/Planning/SC4SiteServicesPlan.xls
• have been prepared by all Tier-1s
• were discussed in Mumbai (SC4 workshop) together with
• mw development plans
• mw requirements by LHC VOs (applications)
• time scales for activities of LHC VOs
• A summary of the Mumbai workshop was
• prepared by J.Shiers, I.Bird, T.Cass, L.Robertson
• submitted to LCG MB for comments
• submitted to LCG GDB for comments and approval
Forschungszentrum Karlsruhein der Helmholtz-Gemeinschaft
Tier 1/2 Meeting and 47. Session GridKa TAB, 2.-3. March 2006
Services & packages
Needed by Pre-production Production
gLite Service deployed
Installation
gLite 3.0
Tests by all VOs
LCG Service deployed
Installation
gLite 3.0
SC4
VOMS server COMPASS & others -- -- -- (VO server) May Jun-Sep
VOMS clients All VOs 1.3.-20.3. 21.3.-30.4. not tested May Jun-Sep
Myproxy All VOs -- -- X May Jun-Sep
Site BDII All VOs 1.3.-20.3. 21.3.-30.4. X May Jun-Sep
Top level BDII All VOs -- -- X May Jun-Sep
FTS Alice, Atlas, LHCb 1.3.-20.3. 21.3.-30.4. X May Jun-Sep
LFC Alice, Atlas, CMS 1.3.-20.3. 21.3.-30.4. X May Jun-Sep
RB All VOs 1.3.-20.3. 21.3.-30.4. X May Jun-Sep
CE All VOs X 1.3.-20.3. 21.3.-30.4. X May Jun-Sep
SE All VOs 1.3.-20.3. 21.3.-30.4. X May Jun-Sep
SRM / dCache All VOs ? 21.3.-30.4. X May Jun-Sep
VOBox Alice, Atlas -- -- -- X May Jun-Sep
UI All VOs 1.3.-20.3. 21.3.-30.4. X May Jun-Sep
WN packages All VOs X 1.3.-20.3. 21.3.-30.4. X May Jun-Sep
Lcg-utils All VOs X 1.3.-20.3. 21.3.-30.4. X May Jun-Sep
GFAL All VOs X 1.3.-20.3. 21.3.-30.4. X May Jun-Sep
R-GMA All VOs X 1.3.-20.3. 21.3.-30.4. X May Jun-Sep
Apel All VOs X 1.3.-20.3. 21.3.-30.4. X May Jun-Sep
3D DB services All VOs (incl. SQUID) -- -- not yet 27.2.-? / tests Jun-Sep
Middleware deployment (gLite 3.0) at GridKaMiddleware deployment (gLite 3.0) at GridKa
Forschungszentrum Karlsruhein der Helmholtz-Gemeinschaft
Tier 1/2 Meeting and 47. Session GridKa TAB, 2.-3. March 2006
Month who Pre-production env. Production environment
March GridKa Deployment gLite 3.0
ALICE Testing gLite 3.0 Bulk production at T1/T2; data back to T0
ATLAS Testing gLite 3.0 3-4 weeks Mar / Apr T0 tests (not at GridKa)
CMS Testing gLite 3.0 PhEDEx integration with FTS (development; not at GridKa)
LHCb Testing gLite 3.0 Start generation of 100M B-physics + 100M bias events
April GridKa Support gLite 3.0 SC4 throughput tests
ALICE Testing gLite 3.0 First push out of sim. Data; reconstruction at T1s
ATLAS Testing gLite 3.0 See above (not at GridKa)
CMS Testing gLite 3.0 10 TB to Tape at T1s at 150 MB/s
LHCb Testing gLite 3.0 Generation of B-physics and bias events continues
May GridKa Deployment of gLite 3.0; Main hardware setup
ALICE --
ATLAS Test distributed operations (cont.) --
CMS --
LHCb --
June GridKa Deployment gLite 3.2 Support for SC4
ALICE Testing gLite 3.2
ATLAS Testing gLite 3.2 Tier-0 test (phase 1) with data distribution to Tier-1s (last 3 weeks)
CMS Testing gLite 3.2 2-week re-run of SC3 goals (beginning of month)
LHCb Testing gLite 3.2 Reconstruction/stripping: 2 TB/day out of CERN; 125 TB on MSS @ Tier-1s
LHC activities at GridKa March - JuneLHC activities at GridKa March - June
Forschungszentrum Karlsruhein der Helmholtz-Gemeinschaft
Tier 1/2 Meeting and 47. Session GridKa TAB, 2.-3. March 2006
Month who Pre-production env. Production environment
July GridLa Support gLite 3.2 T0-T1 at full nominal rates (tape); via dTeam
ALICE Testing gLite 3.2 Reconstruction at CERN and remote centres
ATLAS Testing gLite 3.2 Distributed processing tests(part I; 3 weeks
CMS Testing gLite 3.2 Bulk simulation (2 months)
LHCb Testing gLite 3.2 Reconstruction/stripping: 2 TB/day out of CERN; 125 TB on MSS @ Tier-1s
August GridKa Deployment of gLite 3.2 ??
ALICE
ATLAS Distributed analysis tests part I (2 weeks in July - August)
CMS Bulk simulation continues
LHCb Analysis on data from June / July … until spring 07 or so…
Sept. GridKa
ALICE Scheduled + unscheduled (T2s?) analysis challenges
ATLAS Tier-0 test phase 2 with data to Tier-2s (3-4 weeks in September - October)
CMS Preparations for Computing Software Analysis Challenge 2006 (CSA06)
LHCb Analysis on data continues
October GridKa Prepare for re-installation with Scientific Linux 4.x ??
ALICE
ATLAS Distributed processing tests part 2 (3 weeks)
CMS Execute CSA06
LHCb Analysis on data continues
LHC activities at GridKa July - OctoberLHC activities at GridKa July - October
Forschungszentrum Karlsruhein der Helmholtz-Gemeinschaft
Tier 1/2 Meeting and 47. Session GridKa TAB, 2.-3. March 2006
WLCG medium term evolutionWLCG medium term evolution
3Ddistributeddatabaseservices
developmenttest
SC4
SRM 2test and
deployment
Plan beingelaborated
October?
Additional planned
Functionality
to be agreed& completedin the next
few months
then - testeddeployed
Subject to progress& experience
Newfunctionality
Evaluation&
developmentcycles
Possiblecomponents
for lateryears
??
Forschungszentrum Karlsruhein der Helmholtz-Gemeinschaft
Tier 1/2 Meeting and 47. Session GridKa TAB, 2.-3. March 2006
Two grid infrastructures are now in operation, on which we are able to build computing services for LHC
Reliability and performance have improved significantly over the past year
The focus of Service Challenge 4 is to demonstrate a basic but reliable service that can be scaled up - by April 2007 -to the capacity and performance needed for the first beams.
Development of new functionality and services must continue, but we must be careful that this does not interfere with the main priority for this year –
reliable operation of the baseline services
Summary (taken from Jamie Shiers / SC4 Mumbai)Summary (taken from Jamie Shiers / SC4 Mumbai)
top related