NOAO >   Meetings and Colloquia  >   Software Talks

Small NOAO Logo

National Optical Astronomy Observatory

Software Talks -- Sep 01, 2008 to Aug 31, 2009


NOAO Software Talks
12:00pm, NOAO-Tucson Main Conference Room (unless otherwise noted)


Requests for inclusion in the mailing list (or any other inquiries) should be forwarded to Rob Seaman.

May 26  
Todd Boroson NOAO Finding the Goodies in the Sloan Survey Archive of QSO Spectra

Apr 28  
Irene Barg NOAO SDM Lessons Learned from PostgreSQL Administration Course

I attended a 5 day PostgreSQL Administration Course April 6-10, 2009 at the Open Technology Group http://www.otg-nc.com in Morrisville NC. Our instructor Chander Ganesan took us through wide range of PostgreSQL administration tasks, from manual installation and configuration to performance tuning, connection pooling, full text searching, replication and more. This talk will be about the changes I made to our PostgreSQL installations because of the many techniques I learned from this course.

 

Apr 21  
Chuck Gessner NOAO Risk Management for Programmers, Ergonomics for Everyone

Apr 7  
Dick Shaw NOAO Doing Science with Virtual Observatory Tools, or What I did at the NVO Summer School

The much trumpeted utility of the Virtual Observatory was put to a severe test this past fall when a well-intentioned, diverse group of scientists, programmers, ameture astronomers, librarians, and Public Outreach professionals of all ages, shapes, and technical prowess embarked on "hands-on" science projects at the NVO Summer School in Santa Fe, NM. Our particular project sought to assimilate the results of multiple public and private surveys to search for infra-red counterparts to variable objects (including planetary nebulae) in the Large Magellanic Cloud. Although we had some false-starts along the way, in the end we learned that there are generally multiple ways to approach a problem with VO tools, and that our success would have been significantly more difficult to achieve without them. I will conclude with a summary our new scientific results.

 

Mar 31  
None (preempted)

Mar 24  
Open (TBA)

Mar 17  
None (Conflicts with WildStars II and SARSEF)

Mar 10  
Brian Harker-Lundberg NSO GPU-Accelerated Stokes Inversion for Solar Vector Magnetography using NVIDIA's CUDA Platform

Modern GPUs are typically an order of magnitude more powerful than their modern CPU counterparts. Combine this with the fact that modern GPUs are able to execute many instructions independently and in parallel (since they were designed for high-speed, low-latency graphics rendering), and what results is hardware for scientific computation that has an inherently fine-grained parallelism. This type of platform can offer tremendous speedup in an application, provided the application can be efficiently parallelized to run over multiple threads. This short talk outlines the hardware and CUDA software platform we will be using to implement a high-speed, thread-based method of inferring vector magnetic field information from solar Stokes polarization profiles.

 

Mar 3  
None (preempted)

Feb 24  
Mike Fitzpatrick NOAO Science Data Management NVO Tools for Data Discovery and Access

The NVO Data Discovery Portal is a suite of integrated web applications meant to allow astronomers to find the data, services and tools available within the VO that enhance their science. This talk will introduce the newly released Portal elements and provide real-science examples of its use.

 

Feb 3  
Andrey Yeatts WIYN The ODI Instrument Pipeline

The ODI pipeline is modeled as an event-dispatch framework. We look at its design and discuss the remote pipeline service and how it integrates with the StarGrasp controller service.

 

Jan 27  
Bradford Castalia HiRISE Operations Center Maestro: Managing Conductor Networks of Automated Processing Pipelines

This presentation will provide an overview of the Maestro software for managing networks of Conductor pipelines with an emphasis on the software technology issues involved along with examples as used at the HiRISE Operations Center.

The Mars Reconnaissance Orbiter (MRO; http://mars.jpl.nasa.gov/mro/) High Resolution Imaging Science Experiment (HiRISE; http://hirise.lpl.arizona.edu/) generated a large number of big observation data products during the Primary Science Phase of the mission - currently over 30 terabytes in over 867,000 product files - and continues to generate products at a high rate. This has been accomplished by using a network of automated Conductor (http://pirlwww.lpl.arizona.edu/software/Conductor.shtml) pipelines distributed over a cluster of 27 multi-processor compute nodes at the HiRISE Operations Center (HiROC). When a watchdog process, that is always running on one of these nodes, detects that new HiRISE observation data is available from NASA's Jet Propulsion Lab (JPL; http://www.jpl.nasa.gov/), which receives it from the spacecraft via the Deep Space Network, the watchdog makes an entry in the sources database table of the first Conductor pipeline segment. This initiates the data download to HiROC that begins the sequence of linked Conductor pipelines. Each pipeline segment defines a particular set of data processing operations to be applied as data flows through the network of Conductor pipelines which ultimately produce the data products that are delivered to the science community and the public by the Planetary Data System (PDS; http://pds.jpl.nasa.gov/) and the HiRISE web site.

The Conductor pipelines, though they operate autonomously, do require management. For example, to keep up with the flow of new incoming data multiple Conductors are allocated on multiple compute nodes to handle long running procedures, such as geometric processing, thus avoiding processing bottlenecks by processing multiple data sources in parallel. Bad data (Deep Space Network transmission gaps in critical sections) or problems in the underlying systems can cause processing failures that will, if the configured failure limit is reached, cause Conductors to stop processing the affected pipelines and notify operators to investigate the problem before restarting the Conductors. Changes to processing parameters can require reprocessing of some or all data products which calls for a different network of Conductor pipelines than is used for routine processing. Thus the data processing operators use various Conductor networks depending on the needs of the situation. The data processing procedures themselves are undergoing constant tuning and enhancements that require suspending some or all of the Conductor pipelines while new processing software and/or configuration files are installed. These conditions call for a tool that can manage both individual Conductors and the pipeline networks as a whole.

The Maestro package is a new addition to the Conductor software package (http://pirl.lpl.arizona.edu/software/Conductor.shtml). It provides remote monitoring and management of Conductor networks. The Maestro software is based on an asynchronous, event-driven Messenger service in which each Conductor reports its processing activities, as they occur, to a Stage_Manager for its Theater location. Each computer system may host multiple Theaters as needed. A Kapellmeister client can connect to the Stage_Manager of any Theater location and request to receive a list identifying all Conductors operating at the Theater location and notification of changes to the list. The Kapellmeister establishes Messenger connections, through the Stage_Manager, to the individual Conductors to receive the notifications of all the Conductor processing activities in real-time. The Kapellmeister provides Conductor network managers with a graphical user interface that controls all Theater connections, lists all the Conductors on all the Theaters with their processing state, enables managers to send the Conductors messages to change their processing state, and shows a matrix of all Conductor pipelines by their Theater location with a display that summarizes all the Conductor states. Operators can start new Conductors on any Theater as needed as well as cause existing Conductors to safely stop processing and, if desired, quit. The Kapellmeister can write a Profile file that defines the current Conductor network, and can read a Profile file to establish the defined Conductor network. Operators also have available a new Conductor Manager interface that provides detailed monitoring and management of all Conductor operations. The Manager may be used remotely via a Kapellmeister or locally when running an individual Conductor.

The Maestro package - part of the PIRL Java Packages (http://pirlwww.lpl.arizona.edu/software/PIRL_Java_Packages.shtml) - provides a high level of view of a Conductor pipeline network combined with detailed monitoring and control capabilities that offers significant management effectiveness and efficiency for these networks.

 

Jan 20  
Ken Mighell NOAO CRBLASTER: A Fast Parallel-Processing Program for Cosmic Ray Rejection in Space-Based Observations

Jan 13  
Alvaro Egaņa Science Data Management FRESSIA - A Framework for Testing Rich Applications

Dec 16  
Rob Seaman Science Data Management The Art of Noise: Optimal DN encoding for CCD and CMOS detectors

CCDs are linear devices. What does this mean? It does not mean that pixel values (DNs) must be represented on a linear scale. Since CCD and CMOS detectors are photon counting devices, they obey a Poisson (shot) noise model. This means that such data are "heteroscedastic", with a variance proportional to DN. Thus, a linear scaling of the data will result in a vast oversampling of the noise for brighter pixels.

Variance stabilization techniques will be discussed that can dramatically compress 16-bit or 32-bit integer data into a few hundred data numbers. In combination with tiled FITS Rice compression, this produces near-optimal encoding of astronomical data to improve both data storage and data throughput metrics. This dual technique has been used to compress datastreams from spacecraft, but a more formal analysis supports the argument that a variance stabilized "Poisson encoding" is the natural representation for CCD/CMOS data of all types. In general, the FITS Tile Compression standard provides significant advantages over general purpose compression formats such as the familiar gzip.

 

Dec 9  
Katy Garmany and Ken Mighell NOAO Using Virtual Astronomical Observatory Tools for Astronomy 101

Nov 18  
German Shumacher CTIO LSST Telescope and Site Control Software

Nov 11  
Phil Daly NOAO NEWFIRM Tools and Utilities

Oct 21  
Igor Suarez-Sola NSO Virtual Solar Observatory: the SDO data provider

NSF logo

NOAO is operated by the Association of Universities for Research in Astronomy (AURA), Inc. under cooperative agreement with the National Science Foundation

AURA, Inc. logo

NOAO  >   Meetings and Colloquia  >   Software Talks

webmaster@noao.edu

Small NOAO Logo