Objectives: We present a new model of data processing using commercially available cyberinfrastructure (e.g. cloud computing) to carry out high performance computing workflows against curated data sets within NDAR.
Methods: During the session, we will demonstrate the use of the autism informatics grid, now established, to access and process large volumes of data using generally available processing pipelines.
Results: Discussion of the steps to set up and automate a pipeline, the benefits of this approach over more traditional computational techniques, its cost, and any barriers encountered will be provided.
Conclusions: Cloud computing infrastructure and very large datasets, both readily available to the autism research community, provide unprecedented opportunities for discovery. By demonstrating these capabilities in real time, we will outline the framework needed for the research community to adopt similar methods in helping to accelerate scientific discovery.