Abstract

Real-world scientific applications often encompass end-to-end data processing pipelines composed of a large number of interconnected computational tasks of various granularity. We introduce HyperLoom, an open source platform for defining and executing such pipelines in distributed environments and providing a Python interface for defining tasks. HyperLoom is a self-contained system that does not use an external scheduler for the actual execution of the task. We have successfully employed HyperLoom for executing chemogenomics pipelines used in pharmaceutic industry for novel drug discovery. 6 1


Original document

The different versions of the original document can be found in:

http://dx.doi.org/10.1145/3183767.3183768 under the license http://www.acm.org/publications/policies/copyright_policy#Background
https://dl.acm.org/citation.cfm?doid=3183767.3183768,
https://dspace.vsb.cz/handle/10084/133294,
https://academic.microsoft.com/#/detail/2790801434
Back to Top

Document information

Published on 01/01/2018

Volume 2018, 2018
DOI: 10.1145/3183767.3183768
Licence: Other

Document Score

0

Views 0
Recommendations 0

Share this document

claim authorship

Are you one of the authors of this document?