Prof. Douglas Thain at Notre Dame
Selected Research Talks
- Building Data Intensive Function Oriented Workflows with TaskVine,
Throughput Computing Conference
, July 2024.
- TaskVine: Workflow for Data Intensive and Serverless Applications,
Throughput Computing
(HTC)
, July 2023.
- Data Intensive Computing with TaskVine,
Greater Chicago Area Systems Research Workshop
(GCASR)
, April 2023.
- Virtual Clusters for Community Computation,
DOE Collaborative Projects PI Meeting
, September 2019.
- Scalable Application Design: Pitfalls and Possibilities,
CVMFS Workshop at CERN
, June 2019.
- Reduction of Workflow Resource Consumption Using a Density-based Clustering Model,
Workshop on Workflows at Supercomputing
(WORKS)
, November 2018.
- VC3: A Virtual Cluster Service for Community Computation,
Practice and Experience in Advanced Research Computing, 2018.
(PEARC)
, June 2018.
- Challenges in Delivering and Deploying Software on Large Clusters,
Workshop on Runtime and Operating Systems for Supercomputers
(ROSS)
, June 2018.
- A Strategic Overview of Docker in Fifteen Minutes,
MAGIC Interagency Working Group
, February 2018.
- Provisioning Complex Software Environments for Scientific Applications,
CERM-VM Workshop
, January 2018.
- VC3: Virtual Clusters for Community Computation,
DOE Next Generation Network Systems PI Meeting, October 2017.
(NGNS)
, October 2017.
- Seamless Scientific Computing from Laptops to Clouds,
ScienceCloud Workshop at HPDC
, June 2017.
- Reconsidering the Filesystem for DAG Structured Workflows,
Huawei Labs
, March 2017.
- A First Look at Reproducibility and Non-Determinism in CMS Software and ROOT Data,
Computing in High Energy Physics
(CHEP)
, October 2016.
- Combining Containers and Workflow Systems for Reproducible Execution,
DASPOS Workshop on Container Technology
, May 2016.
- Preservation and Portability in Distributed Scientific Computing,
Grid 5000 Winter School, Grenoble, France
, February 2016.
- Analyzing LHC Data on 10K Cores with Lobster,
Workshop on Data Intensive Computing in the Clouds at ACM/IEEE Supercomputing
, November 2015.
- Techniques for Preserving Scientific Software Executions: Preserve the Mess or Encourage Cleanliness,
International Conference on Digital Preservation
(iPres)
, November 2015.
- Reproducibility and Preservation of Scientific Applications,
Trends in HPDC Workshop, IBM Almaden
, March 2015.
- Lobster: Personalized Opportunistic Computing for CMS at Scale,
CVMFS Workshop, CERN
, March 2015.
- Toward a Common Model of Highly Concurrent Applications,
MTAGS Workshop at Supercomputing
, November 2015.
- Scaling Up Without Blowing Up,
Greater Chicago Area Systems Research Workshop
, May 2013.
- Portable Resource Management for Data Intensive Workflows,
HTCondor Workshop
, May 2013.
- Computational Abstractions: Strategies for Scaling Up Applications,
Initiative for Computational Economics, University of Chicago
, July 2012.
- Real-World Barriers to Scaling Up Scientific Applications,
Vrije University, The Netherlands
, March 2012.
- Unsolved Computer Science Problems in Distributed Computing,
Workshop on Grid Computing: The Next Decade, Zakopane, Poland
, January 2012.
- High Throughput Scientific Computing with Condor: Computer Science Challenges in Large Scale Parallelism,
University of Alabama at Birmingham
, October 2011.
- Scaling up Data Intensive Science with Application Frameworks,
Michigan State University
, September 2011.
- Models and Frameworks for Data Intensive Cloud Computing,
IDGA Cloud Computing Summit, Washington DC
, February 2011.
- Experience with Cloud Adoption at Notre Dame,
IEEE Cloud Computing
, November 2010.
- Scaling Up Data Intensive Science to Campus Grids,
Clemson University
, September 2009.
- Science in the Clouds: History, Challenges, and Opportunities,
Cloud Computing and the Geosciences Workshop, IUPUI, Indianapolis
, September 2009.
- Getting Beyond the Filesystem,
High End Computing File Systems and I/O Workshop, Washington DC
, August 2009.
- Scaling Up Data Intensive Scientific Applications to Campus Grids,
Large Scale Application Performance (LSAP) Workshop at HPDC
, June 2009.
- Using Abstractions to Scale Up Applications to Campus Grids,
Purdue University
, April 2009.
- Using Small Abstractions to Program Large Distributed Systems (and multicore computers),
Fermi National Accelerator Laboratory
, February 2009.
- Using Small Abstractions to Program Large Distributed Systems (and multicore computers),
Distributed Programming Abstractions Workshop
, December 2008.
- Cooperative Computing for Data Intensive Science,
NSF Bridges to Engineering 2020 Conference
, March 2008.
- Efficient Access to Many Small Files in a Filesystem for Grid Computing,
IEEE Grid Computing
, September 2007.
- Data Intensive Abstractions for High End Biometric Applications,
High End Computing FSIO Working Group
, August 2007.
- Operating System Support for Space Allocation in Grid Storage Systems,
IEEE Grid Computing, Barcelona, Spain
, September 2006.
- Positioning Dynamic Storage Caches for Transient Data,
Workshop on High Performance I/O Systems at IEEE Cluster Computing, Barcelona, Spain
, September 2006.
- Tactical Storage: Simple, Secure, and Semantic Access to Remote Data,
European Condor Week
, July 2006.
- Debugging Distributed Systems via Data Mining,
IEEE High Performance Distributed Computing
, June 2006.
- Enabling Data Intensive Science with Tactical Storage Systems,
Fermi National Accelerator Laborator, Batavia, Illinois
, February 2006.
- Transparently Adapting Scientific Applications for the Grid,
DutchGrid Computing Colloquium, Netherlands Institude for High Energy Physics (NIKHEF), Amsterdam, Netherlands
, January 2006.
- Enabling Data Intensive Science with Tactical Storage Systems,
EGEE Computing Seminar, European Laboratory for High Energy Physics (CERN), Geneva, Switzerland
, January 2006.
- Separating Abstractions from Resources in a Tactical Storage System,
ACM/IEEE Supercomputing
, November 2005.
- Identity Boxing: A New Technique for Consistent Global Identity,
ACM/IEEE Supercomputing
, November 2005.
- Tactical Storage Systems,
Purdue RCAC/CRI
, October 2005.
- Enabling Data Intensive Science with Tactical Storage Systems,
INFN, Bologna, Italy
, May 2005.