Data Day 2016
August 22 - 23, 2016
Wang Hall (Building 59) and Building 50 Auditorium
Lawrence Berkeley National Laboratory
This summer, NERSC will host a brand new, data-centric event, Data Day. The main event will take place on August 22, followed by a half-day hackathon on August 23. The goal: Bring together researchers who use, or are interested in using, NERSC systems for data-intensive work.
On August 22, a full day of talks and tutorials will get you up and running with the latest data-focused tools for scientific computing. Training sessions on machine learning, Python, Spark, the Cori Burst Buffer, and science gateways will be complemented by presentations from scientists already taking advantage of these tools in their own research. A poster session will highlight work and work-in-progress that uses NERSC resources for experimental and observational science.
Stick around for the hack day, and you’ll get a chance to put your new knowledge to work creating a custom workflow. We’ll start off the half-day hackathon with an introduction to building workflows with Python and Spark, then provide local experts to help you get coding. The entire event will be held on site at the lab, in the Building 50 Auditorium and in Wang Hall, NERSC’s new landmark building at Berkeley Lab. Come and learn how to power up your data.
Agenda Overview
Monday, August 22: Data Day
Remote connection available via zoom.
Some notebooks that will be shown are available at NERSC's data-day examples git repo.
Time | Topic | Presenter(s) | Room |
---|---|---|---|
Data talks | Bldg. 50 Aud | ||
8:30 am | Welcome (video, slides) | Katie Antypas, Head, NERSC Data Dept |
Bldg. 50 Aud |
8.45 | Intro to Machine Learning (video, slides) | Prabhat, NERSC Data and Analytics Services Group Lead | Bldg. 50 Aud |
9.15 | Machine Learning tutorial (video, slides) |
Evan Racah, NERSC | |
9:30 | Science with Machine Learning (video, slides) | Marcus Stoiber, LBNL | Bldg. 50 Aud |
9.45 | Break | ||
10:10 | Python Tutorial (video, slides) | Rollin Thomas, NERSC | Bldg. 50 Aud |
10:40 | Science with Python (video, slides) | Ben Bowen, LBNL | Bldg. 50 Aud |
11:10 | Spark tutorial (video, slides) | Lisa Gerhardt, LBNL | Bldg. 50 Aud |
11:40 | Science with Spark (video, slides) | Zhong Wang, DOE Joint Genome Institute | Bldg. 50 Aud |
12.10 - 1.30 | Lunch and poster preview | Lunch will be provided for registered attendees | 59-3101/3104 |
1:45 | Visualization tutorial and discussion (video, slides) | Annette Greiner, NERSC | Bldg. 50 Aud |
2:30 | Burst Buffer Tutorial (video, slides) | Debbie Bard, NERSC | Bldg. 50 Aud |
3:00 | Science with the Burst Buffer (video, slides) | Andrey Ovsyannikov | Bldg. 50 Aud |
3:30 | Break | ||
3.50 | Data Management Tutorial (video, slides) | Quincey Koziol, NERSC | Bldg. 50 Aud |
4.20 | Scientific Data Management (video, slides) | Patrick Huck, LBNL, The Materials Project | Bldg. 50 Aud |
4:50 | Closing Comments | Bldg. 50 Aud | |
5:00 - 6:30 | Poster session | 59-3101/3104 | |
6.30 + | Dinner at Triple Rock Brewery in downtown Berkeley (no host) |
Tuesday, August 23: Hackathon!
- Talks will be held in 3101 - but for those who would rather hack than attend a talk we will also have room 3104 for all-day hacking.
- A NERSC account will be needed to get the most out of the hacking.
- NERSC staff will be on hand to help out with your questions on the day.
Time | Topic | Presenter(s) | Room |
---|---|---|---|
Hands-on talks | 59-3101 | ||
8:30 am | Workflows with Spark | Evan Racah, LBNL |
59-3101 |
8.30-11:00 | Hack session 1 | All | 59-3101/3104 |
11:00 | Workflows with the Cori Burst Buffer (slides) | Glen Overby, Cray Inc. | 59-3101 |
11:00-1:30 | Hack session 2/working lunch (provided) | All | 59-3101/3104 |
1:30 | close |
Venue
Lawrence Berkeley National Laboratory
Wang Hall and Building 50
1 Cyclotron Road, Bldg. 59
Berkeley, CA 94720
Tel: 510.486.7612
http://www.lbl.gov/
Lawrence Berkeley National Laboratory is a multidisciplinary national laboratory located in Berkeley, California on a hillside directly above the campus of the University of California at Berkeley. The site consists of 76 buildings located on 183 acres, which overlook both the campus and the San Francisco Bay. »More Visitor Information.
Posters
All attendees are invited to bring a poster for display at Data Day. Please indicate whether you will do so on the registration form so that we know how many poster spaces to reserve. We will encourage attendees to hang their posters when they arrive in the morning so that others can view them throughout the day. We will also ask poster presenters to stand by their posters for the first half hour of the evening poster session. Please size your poster to no larger than 36 in. x 48 in, and leave it unmounted. We will provide means to hang it, including hardware.
Site Access
All non-LBNL participants must register. Those attending in person will need to obtain a visitor bus pass or parking pass in advance. Indicating on the registration form that you will attend in person will initiate the request. We recommend carpooling, shuttle and taxi. Check-in with Security and picture ID is required for on-site attendance.
LBNL Security requires the names, citizenship and country of birth of all visitors, including those attending remotely. If you have questions, please contact Zaida McCunney.
The Blackberry Canyon entrance is closest to Wang Hall and Building 50. The shuttle buses stop at building 65, which is the main shuttle bus stop (across the street from the driveway to Wang Hall and the stairs to Building 50).
Parking
Parking at LBNL is limited due to a number of construction projects. We recommend car pooling, LBNL shuttle, or taxi. If you drive, a parking space may be reserved. For those visitors who plan to drive to LBNL, we will reserve general parking near Wang Hall.
Shuttle Information
There is an LBNL shuttle bus from various locations around the City of Berkeley, including near the Downtown Berkeley BART station. If you plan to take the LBNL shuttle, you are required to have an LBNL Visitor Bus Pass (requires advance notice) and picture ID to show the bus driver.
Bay Area Rapid Transit Train (BART)
The Downtown Berkeley BART station is two short blocks from the LBNL shuttle line on the corner of Addison and Shuttuck. The LBNL Blue Uphill Shuttle Bus will take you to the Blackberry Canyon entrance and the CRT Building. The CRT building is across the street from LBNL's main bus stop. »BART Web site
Lodging
It is recommended lodging reservations be booked early.
Downtown Berkeley Inn
2001 Bancroft Way
Berkeley, CA 94720
510.843.4043
http://www.downtownberkeleyinn.com/
Hotel Durant
2600 Durant St., Berkeley, CA 94705
Tel: 510.845.8981
http://hoteldurantberkeley.com/
Hotel Shattuck
2086 Allston Way, Berkeley, CA 94704
Tel: 510.845.7300
http://www.hotelshattuckplaza.com/
(One short block from the Downtown Berkeley BART station and LBNL shuttle stop)
LBNL Guest House
1 Cyclotron Road
Berkeley, CA 94720
510.495.8000
Rate: $124
http://www.berkeleylabguesthouse.berkeley.edu/
Courtyard Oakland Airport
350 Hegenberger Road, Oakland, CA 94710
Tel: 510.568.7600
http://www.marriott.com/hotels/travel/oakap-courtyard-oakland-airport/
Remote Participants
Remote participation will be available via Zoom:
Join from PC, Mac, Linux, iOS or Android: https://zoom.us/j/834308495
Or iPhone one-tap (US Toll): +16465588656,834308495# or +14086380968,834308495#
Or Telephone:
Dial: +1 646 558 8656 (US Toll) or +1 408 638 0968 (US Toll)
Meeting ID: 834 308 495
International numbers available: https://zoom.us/zoomconference?m=YBPtgMnBZV7j5gpUX5SDjYfpsWPxjYeL
Or a H.323/SIP room system:
H.323: 162.255.37.11 (US West) or 162.255.36.11 (US East)
Meeting ID: 834 308 495
SIP: 834308495@zoomcrc.com
Downloads
-
DataDay-Welcome.mp4 |Welcome to Data Day, with Katie Antypas
-
DataDay-Intro-To-ML.mp4 |Introduction to Machine Learning, with Prabhat
-
DataDay-ML-Tutorial.mp4 |Machine Learning tutorial, with Evan Racah
-
DataDay-Science-ML.mp4 |Science with Machine Learning, with Marcus Stoiber
-
DataDay-Python-Tutorial.mp4 |Python tutorial, with Rollin Thomas
-
DataDay-Science-Python.mp4 |Science with Python, with Ben Bowen
-
DataDay-Spark-Tutorial.mp4 |Spark tutorial, with Lisa Gerhardt
-
DataDay-Science-Spark.mp4 |Science with Spark, with Zhong Wang
-
DataDay-Viz-Tutorial.mp4 |Visualization tutorial, with Annette Grenier
-
DataDay-BurstBuffer-Tutorial.mp4 |Burst Buffer Tutorial, with Debbie Bard
-
DataDay-Science-BurstBuffer.mp4 |Science with the Burst Buffer, with Andrey Ovsyannikov
-
DataDay-DataManagement-Tutorial.mp4 |Data Management tutorial, with Quincey Koziol.
-
DataDay-Science-DataManagement.mp4 |Science and Data Management, with Patrick Huck