Linux Cluster Institute Workshop, Urbana, IL — Aug. 4-7

The Linux Cluster Institute (LCI) Workshop is scheduled for August 4-7 2014 at the National Center for Supercomputing Applications (NCSA), Urbana IL.

Aimed at HPC users and those responsible for maintaining HPC resources, the workshop will address:

  • How to be an HPC cluster system administrator
  • How to be an effective HPC cluster user
  • The key issues of HPC
  • Current and emerging HPC hardware and software technologies

Workshop: High-Performance Networking for International Climate Science — July 14-16, Boulder, CO

ESnet and Internet2 in collaboration with key partners Indiana University (IU), National Center for Atmospheric Research (NCAR) and NOAA have announced a multi-day workshop that will bring together network and data management experts with the international climate community to discuss their pressing data challenges and emerging network requirements. The workshop will include a slate of invited speakers and panelists in a format designed to encourage lively, interactive discussions with the goal of developing a set of tangible next steps for supporting this data-intensive science community. Breakout sessions and many opportunities for professional networking are also planned.

For more information, visit

EarthCube All-Hands Meeting — June 24-26, Washington, D.C.

The EarthCube All-Hands Meeting will bring together project institutions, partners, collaborators, and scientists from across the globe to share their progress and experience with EarthCube thus far, and discuss and plan activities for the upcoming year. This meeting will be held from June 24-26, 2014 at the Renaissance Dupont Circle hotel in Washington, D.C.  

This event is designed in the collaborative spirit of EarthCube and will provide multiple opportunities for networking and meaningful work, as well as the chance to share your efforts and to learn from others.

Register Now!
To learn more about the meeting and register, please visit the EarthCube All-Hands Meeting website (a registration fee of $180 will cover a light breakfast, lunch, and coffee breaks for all 3 days, as well as a reception on the evening of June 24th).

Call for Proposals
We are pleased to invite proposals for sessions for the All-Hands Meeting. Proposed sessions should relate to EarthCube, the state of cyberinfrastructure in the geosciences, and innovative geoscience contributions to data management. Sessions in multiple formats will be considered, including:

•    Workshops or Hack-a-thons
•    Technology Presentations
•    Working Groups & Business (Project) Meetings
•    Panel discussions (Plenary or Breakout)
•    Presentations

NCSA Blue Waters hosting “Symposium for Petascale Science and Beyond” — May 12-15, Champaign, IL

Hosted by the National Center for Supercomputing Applications, the “Symposium for Petascale Science and Beyond” will bring together leaders in petascale computational science and engineering and will be a tremendous opportunity for sharing successes, methods, and future challenges in petascale+ computing and analysis.

The symposium is scheduled for May 12-15 in Champaign, IL.

Along with presentations from the leaders of Blue Waters science teams, the symposium will provide forums for high-bandwidth information exchanges between teams and investigators. Charles Seife, author of “Decoding the Universe: How the New Science of Information is Explaining Everything in the Cosmos, From Our Brains to Black Holes,” will deliver the keynote address.

There is no fee to attend the symposium, but there is limited space and capacity so registration is required. If necessary, priority will be given to Blue Waters science team members. Online registration, along with a draft agenda and logistics details, can be found on the Blue Waters portal:

TACC Summer Supercomputing Institute — May 2 application deadline

The 2014 Texas Advanced Computing Center (TACC) Summer Supercomputing Institute will take place Monday, June 16 – Friday, June 20, 2014.

Apply Now


This week-long workshop is appropriate for all levels of researchers, faculty, staff, and graduate students, from new users of advanced computing technologies, to those who have research projects requiring powerful
computing, visualization, storage, or software. We encourage participation from Minority Serving Institutions,
Hispanic Serving Institutions, and Historically Black Colleges and Universities.

  • Researchers across disciplines: Mathematics, Engineering, Physics, Astronomy, Astrophysics, Cosmology, Geology & Geophysics, Computer Sciences, Biosciences, Nanosciences, and Data Analytics
  • Graduate and undergraduate students
  • Current TACC & XSEDE users
  • Industrial affiliates


The Institute will provide researchers with an intensive introduction to using TACC’s computing resources.
Senior TACC staff will deliver presentations and lead interactive lab sessions focused on using TACC’s advanced computing resources and technology.

  • Stampede: Dell PowerEdge C8220 Cluster with Intel Xeon Phi coprocessors
  • Lonestar: Dell Linux Cluster
  • Maverick: HP/NVIDIA Interactive Visualization and Data Analytics System
  • Ranch: Petascale archival facility

On January 7, 2013 TACC deployed a new compute cluster, Stampede. Funded by the National Science Foundation, this new cluster provides the community with access to 2 PFlops of Intel based microprocessor power and 8 PFlops of Intel MIC (Many Integrated Core) architecture technology. During the Institute students will receive a description of the system and TACC staff will present a session on how to use the new MIC architecture.

Lectures and Labs: Senior TACC staff will deliver presentations and lead interactive laboratory sessions:

  • Obtaining access to TACC resources and services
  • Reviewing the hardware and software available on TACC resources
  • Developing parallel programs with OpenMP and MPI
  • Using visualization and data analysis software and systems
  • GPGPU programming
  • Using the Intel Xeon Phi coprocessor (MIC)

Applications Seminars: Leading computational researchers will discuss their work, including examples of how they are utilizing TACC’s resources.

Consulting: During the Institute, TACC staff will be available to assist participants in applying the techniques and technologies covered in the Institute to their own applications.

Applying to Attend the Institute

Applications to attend the Institute must be submitted by Friday, May 2, 2014. Applicants will receive notification of the status of their application by Friday, May 9, 2014.

Apply Now

SC14 conference fellowships available for PhD students — May 1 deadline

The ACM/IEEE-CS George Michael Memorial HPC Fellowship is now open for submissions from exceptional PhD students whose research focus is on high-performance computing applications, networking, storage, or large-scale data analysis using the most powerful computers that are currently available.

Recipients receive a $5000 honorarium, travel and registration for SC14, and recognition at the SC14 Awards Ceremony.

For more information, visit the SC14 site at:

Submit applications via:

Applications due: May 1, 2014

Vijay Pande of Stanford University to speak on “Surprises in the Biophysics of Protein Dynamics” — April 30

As part of the Michigan Institute for Computational Discovery and Engineering Seminar Series, Vijay Pande of Stanford University will speak on “Some Surprises in the Biophysics of Protein Dynamics: simulating conformation change of kinases and GPCRs with Folding@home.”

Date: Wednesday, April 30

Time: 4 p.m.

Location: 1311 EECS Building (1301 Beal Ave.)

Abstract: A major challenge in molecular simulation is reaching experimentally relevant timescales.  We have developed a new approach for simulating long timescale dynamics using the Folding@home distributed computing project coupled to Markov State Models (MSMs) methods which can overcome these key challenges.  I will demonstrate this method with applications to all-atom molecular simulations on the millisecond timescale and beyond, with applications to protein conformational change in disease-relevant drug targets of GPCRs and kinases.  In particular, these simulations reveal novel druggable targets with the potential for more selective kinase drugs as well as give insight into the fundamental mechanisms of how these key proteins operate.

Bio: Vijay Pande is currently the Director of the Program in Biophysics and a Professor of Chemistry and (by courtesy) of Structural Biology and of Computer Science at Stanford University.  Prof. Pande received a BA in Physics from Princeton University in 1992 and PhD in physics from MIT in 1995.  Prof. Pande’s current research centers on the development and application of novel grid computing simulation techniques to address problems in chemical biology.  In particular, he has pioneered novel distributed computing methodology to break fundamental barriers in the simulation of kinetics and thermodynamics of proteins and nucleic acids.

Data Visualization and Exploration Tools: April 29 – May 1 conference, Boston

The BioIT World Conference and Expo in Boston, scheduled for April 29 – May 1, will feature a Data and Visualization Tools Track. The track will showcase how to design, implement and evaluate visualization techniques and tools that offer real value to the user both in support of genomics and sequencing research, as well as in drug discovery and development. The conference will present case studies that showcase approaches to data visualization and analysis that address important challenges in genomics, pathway analysis, oncology and drug discovery.

The program includes:

Variant View: Visualizing Sequence Variants in their Gene Context
Tamara Munzner, Ph.D., Professor, Computer Science, University of British Columbia
The Variant View visualization tool supports variant impact assessment with an information-dense visual encoding that provides maximal information at the overview level, in contrast to the extensive navigation required by currently-prevalent genome browsers… Read More

A Compendium of Next-Generation Clustered Heat Maps for Interactive Exploration of TCGA Data
John N. Weinstein, M.D., Ph.D., Professor & Chair, Bioinformatics & Computational Biology, Division of Quantitative Sciences, The University of Texas MD Anderson Cancer Center
The Cancer Genome Atlas (TCGA) program is generating comprehensive molecular profiles of more than 25 clinical tumor types, the first 12 of which have been incorporated into a Pan-Cancer project. One challenge is statistical analysis of the resulting profiles; a second is the visual detective… Read More

Making the UCSC Genome Browser Work for You
Robert Kuhn, Ph.D., Associate Director, UCSC Genome Browser, Center for Biomolecular Science and Engineering University of California, Santa Cruz
The UCSC Genome Browser provides visualization tools for a large genomic database spanning more than 100 animals. New features include a tool to analyze sequence variant data and hosting organisms not part of the UCSC infrastructure. Browser views of user data may be saved and shared… Read More

Visualizing the Broad Institute’s Connectivity Map
Bang Wong, Creative Director, Broad Institute of MIT & Harvard; Adjunct Assistant Professor, Art as Applied to Medicine, Johns Hopkins University School of Medicine
The CMap is a catalog of a gene-expression data generated by exposing cells to chemical and genetic modifiers. Depicting findings from this 26 trillion point dataset requires thoughtful decisions about data presentation. I will describe how we apply design principles to develop… Read More

Integrated Analysis and Visualization of Large-Scale Biological Data with the Refinery Platform
Nils Gehlenborg, Ph.D., Research Associate, Center for Biomedical Informatics, Harvard Medical School
Data sets with dozens or hundreds of samples are now common in molecular biology and the development of visualization tools for such large and complex data sets requires extensive software infrastructure. To address these challenges, we have developed the Refinery Platform… Read More

Web-Based Visualization and Visual Analysis for High-Throughput Genomics with Galaxy
Jeremy Goecks, Ph.D., Computational Biology Institute, George Washington University
Learn about how to use the popular, web-based Galaxy platform to analyze and visualize your high-throughput genomics data. Galaxy visualizations require only a web browser to use and no software or data downloads. Galaxy visualizations include a genome browser, Circos plot… Read More

NetGestalt: Integrating Multidimensional Omics Data over Biological Networks
Bing Zhang, Ph.D., Associate Professor, Biomedical Informatics, Vanderbilt University School of Medicine
Node-link diagram-based network visualization becomes inadequate as network size and data complexity increase. NetGestalt exploits the inherent hierarchical modular architecture of biological networks to achieve high scalability. It allows simultaneous presentation… Read More

Caleydo Entourage: Visualizing Relationships between Biological Pathways
Alexander Lex, Ph.D., Researcher, Harvard School of Engineering & Applied Sciences
This talk will introduce Entourage, a visualization technique for analyzing interrelationships between multiple related biological pathways. We use a novel technique – contextual subsets – to determine and present parts of other pathways that are relevant in the context of a focus pathway… Read More

Visit for a complete agenda. For registration, visit

Workshop series on data issues in research and business, June 23-27, Chapel Hill, NC — Early registration through April 28

Business managers, data analytics specialists, academic researchers, data center administrators and anyone else who grapples with big data are the target audience for a weeklong workshop series on data issues sponsored by the National Consortium for Data Science (NCDS), the Odum Institute for Social Science Research at UNC Chapel Hill, and the Renaissance Computing Institute (RENCI).

Data Matters, a summer workshop series on all things data, will be held June 23 – 27 at the Friday Center for Continuing Education, 100 Friday Center Drive, Chapel Hill. The workshop series will feature two-day courses on Monday and Tuesday and Thursday and Friday, and one-day courses on Wednesday.

Topics to be covered include strategies for managing big data, data management and analysis tools, using large-scale data networks, data mining and machine learning, data visualization, and predictive analysis. Instructors will include experts from SAS, Cisco, Duke University, UNC Chapel Hill, RENCI, Saffron Technologies, University of Massachuesetts at Amherst, and Pennslyvania State University.

For course descriptions, fees and registration information, click here.  Early bird registration runs through April 28 and saves you $50 per day. In addition to the courses, registration includes: an evening kickoff reception at Top of the Hill in Chapel Hill on Monday, June 23; lunch each day plus a lunchtime speaker on Wednesday, June 25; and transportation between the Friday Center and the UNC campus for lab work for some courses.


The National Consortium for Data Science formed in 2013 as a non-profit public-private partnership to advance the field of data science and address the data challenges of the 21st century. For more information, visit

The Odum Institute for Research in Social Science provides a range of consulting services on quantitative and qualitative methods, GIS and spatial analysis, survey research, and data management. For more information, visit

RENCI develops and deploys advanced computing, networking, and data technologies to enable research discoveries and business innovations. The institute is a collaborative effort involving UNC, Duke University and NC State University. For more information, visit

XSEDE Webinar: Introduction to Maverick — April 25

In this one-day class, users will receive instructions on the use of remote visualization software to visualize data sets generated on the new system Maverick. A review of the scientific visualization process will precede an overview of the visualization software available to XSEDE users, including the parallel visualization software VisIt and Paraview. In addition users will be introduced to Python and R for data analysis. Labs will provide students with the opportunity to prepare data sets to be visualized using these applications.

08:30 – 09:30 Intro to Scientific Visualization
09:30 – 10:00 Intro to Maverick
10:00 – 10:15 Break
10:15 – 11:00 Introduction to Python
11:00 – 12:00 Introduction to R
12:00 – 13:00 Lunch
13:00 – 14:00 Lab – ParaView
14:00 – 15:00 Lab – VisIt
15:00 – 16:00 Parallel Vis
16:00 – 17:00 Lab – Remote & Collaborative Visualization