Menu:

Important Dates

Applications submitted prior to June 10 have priority for scholarships.

Notification of acceptance will be provided by June 14. Applications submitted after June 10 will be reviewed and accepted based on availability.

 
Contact Info:

For questions please contact:

sdscsi@sdsc.edu

Links:

arrow UC San Diego
arrow SDSC

Sponsors:

National Science Foundation

SDSC

Center for Large Scale Distributed Systems
Brocade

SDSC 2013 Summer Institute:

Discover Big Data

Program/Schedule

The SDSC Summer Institute: Big Data Supercomputing will be held Monday – Friday, August 5 – 9, 2013, at the San Diego Supercomputer Center (SDSC) on the University of California, San Diego (UCSD) campus. Light refreshments and lunch will be provided throughout.

Required Materials
Participants of the Summer Institute are expected to bring a laptop computer allowing participants to follow along with demos and hands-on instruction throughout the Institute.

August 5 – 9, 2013
SDSC Auditorium at UC San Diego
Agenda

Day 1 | Day 2 | Day 3 | Day4 | Day 5

Monday, August 5, 2013

8:30-12:00 AM: INTRODUCTION, Chaitan Baru
8:30-10:00am

Introduction to the SDSC Summer Institute (Natasha Balac and Chaitan Baru)

(Natasha's and Chaitan's Slides in PDF)

Introduction to XSEDE (Bob Sinkovits)

(Bob's Slides in PDF)

High Performance Computing and Data Resources at SDSC (Mahidhar Tatineni and Christine Bagwell)

(Mahidar's Slides in PDF)

(Christine's Slides in PDF)

10-10:30am Break
10:30-11:15am Attendee Introductions
11:15am-12pm

OpenTopography (Vishu Nandigam, Chaitan Baru)

(Chaitan's and Vishu's Slides in PDF)

12-1:30PM LUNCH
1:30-5PM PM: INTRODUCTION (cont'd), Mahidhar Tatineni
1:30-2:15pm

Technology presentation on Globus Online (GO), with data transfer demo.

(Slides TBD)

2:15:3:00pm

Enabling Phylogenetic Research via the CIPRES Science Gateway. (Wayne Pfeiffer)

(Wayne's Slides in PDF)

3:00-3:30pm  Break
3:30-4:15pm

Usecase: The Compact Muon Solenoid (CMS) high-energy physics project. (Frank Wuerthwein)

(Rick's and Frank's Slides in PDF)

4:15-5pm

Hands-on session: Running Jobs on SDSC systems and an Overview of Available Data Analytics Software. (Mahidar Tatineni)

(Mahidhar's Slides in PDF)

6pm RECEPTION

top

Day 2: Tuesday, August 6, 2013

8:30-12:00 AM: DATA MANAGEMENT, Bob Sinkovits
8:30-10:00am

Basics of storage technologies and filesystems including, parallel filesystems, distributed filesystems, and cloud storage, e.g. Lustre, HDFS, OpenStack Swift ObjectStore. Pros and cons. (Rick Wagner)

(Rick's Slides in PDF)

10:00-10:30am Break
10:30-11:15am

Demo, myHadoop on Gordon and Hadoop. (Mahidhar Tatineni)

(Mahidhar's Slides in PDF)

11:15am-12pm

Usecase: IntegromeDB search engine for biomedical, biochemical, drug, disease and health related data. (Julia Ponomarenko)

(Julia's Slides in PDF)

12-1:30pm LUNCH
1:30-5pm PM: DATA GENRES, Chaitan Baru
1:30-2:15pm

Basic Data Management (Chaitan Baru)

(Chaitan's Slides in PDF)

2:15-3pm

New Technologies for Data Management (structured, semistructured, unstructured data) for Big Data (Chaitan Baru)

(Chaitan's Slides in PDF)

3-3:30pm Break
3:30-4:15pm

Usecase: Neurosciences Information Framework. (Amarnath Gupta)

(Amarnath's Slides in PDF)

4:15-5pm Demo/Hands-on: MongoDb, MongoDN vs. Postgress; CYCORE project usecase (Kai Lin)

top

Day 3: Wednesday, August 7, 2013

8:30-12:00 AM: DATA GENRES (contd), Amarnath Gupta
8:30-10:00am

Management of data streams, graph data. (Amarnath Gupta)

(Amarnath's Slides in PDF)

10:00-10:30am Break
10:30-11:15am

Usecase Presentations: Identification of structurally cohesive subgroups using R and iGraph.

(Doug White, Bob Sinkovits)

(Part 1 Slides in PDF)

(Part 2 Slides in PDF)

11:15am-12pm

Demos/Hands-on: Exercises with graph data, e.g. DB2 RDF, GraphLab (Chris Condit, Amarnath Gupta)

(Chris' Slides Online)

12-1:30PM LUNCH
1:30-5PM PM: DATA ANALYTICS, Natasha Balac
1:30-2:15pm

Intro to Data mining and Predictive Analytics. (Natasha Balac)

 

2:15-3:00pm

Hands-on: Intro to R. (Nicole Wolter)

 

3:15-3:30pm Break
3:30-4:15pm

Decision Trees Presentation; Hands-on Decision Trees in R. (Natahsa Balac, Nicole Wolter)

 

4:15-5pm

Demo: Clustering overview and Example. (Paul Rodriguez)

(Paul's Slides in PDF)

top

Day 4: Thursday, August 8, 2013

8:30-12:00pm AM: DATA ANALYTICS (cont'd)/HPC, Paul Rodriguez
8:30-10:00am

Algorithms overview; Data Mining Guidelines and Foundations; R Implementations (Paul Rodriguez)

(Paul's Slides in PDF)

10:00-10:30am Break
10:30am-11:15am

R Parallel options; Hands-on. (Glenn Lockwood)

(Glenn's Page of Resources)

(Glenn's Slides in PDF)

11:15am-12pm

Random Forest; Use case; Exmple and Implementations; Variable selectio; Demo in R. (Paul Rodriguez)

(Paul's Use Case Slides in PDF)

12-1:30PM LUNCH
1:30-5PM PM: HPC/ VISUALIZATION, Amit Chourasia
1:30-2:15pm Information Visualization Techniques and Use Cases (Lecture) (Amit Chourasia)
2:15-3:00pm Scientific Visualization Techniques and Use Cases (Lecture). (Amit Chourasia)
3:00-3:30pm Break
3:30-5pm:

Introduction to Visit Software (Hands on). (Amit Chourasia)

(Amit's Slides in PDF)

Additional Links of Interest:

Video of Introduction to Visualization Workshop

Video of VisIt Visualization Workshop

top

Day 5: Friday, August 9, 2013

8:30-12:00 AM: HPC PROGRAMMING (cont'd) Ilkay Altintas
8:30-10:00am

Introduction to MatLab, parallel MatLab (Ilkay Altintas)

(Ilkay's and Jerry's Slides in PDF)

10:00-10:30am Break
10:30-11:15am

Usecase Presentation/Demo: MatLab usecase with workflow (Ilkay Altintas, Jerry Greenberg)

(Ilkay's and Jerry's Slides in PDF)

11:15am-12pm

Closing: Lightning Talks by Attendees

(Download the Lightning Talks Template)