fbpx
Wikipedia

Anduril (workflow engine)

Anduril is an open source component-based workflow framework for scientific data analysis[2] developed at the Systems Biology Laboratory, University of Helsinki.

Anduril
Anduril workflow in Eclipse
Developer(s)Systems Biology Laboratory University of Helsinki
Initial release1 July 2010; 12 years ago (2010-07-01)
Stable release
2.0.0 (2016-07-01) / July 1, 2016; 6 years ago (2016-07-01)[1]
Repository
  • bitbucket.org/anduril-dev/
Written inJava
Operating systemLinux, Microsoft Windows, Mac OS X
TypeWorkflow engine
LicenseGPL (v.1.x), BSD (v.2.x)
Websitewww.anduril.org

Anduril is designed to enable systematic, flexible and efficient data analysis, particularly in the field of high-throughput experiments in biomedical research. The workflow system currently provides components for several types of analysis such as sequencing, gene expression, SNP, ChIP-on-chip, comparative genomic hybridization and exon microarray analysis as well as cytometry and cell imaging analysis.

Architecture and features

A workflow is a series of processing steps connected together so that the output of one step is used as the input of another. Processing steps implement data analysis tasks such as data importing, statistical tests and report generation. In Anduril, processing steps are implemented using components, which are reusable executable code that can be written in any programming language. Components are wired together into a workflow, or a component network, that is executed by the Anduril workflow engine. Workflow configuration is done using a simple yet powerful scripting language, AndurilScript. Workflow configuration and execution can be done from Eclipse, a popular multipurpose GUI, or from the command line.

The core Anduril engine is written in Java and components are written in a variety of programming languages, including Java, R, MATLAB, Lua, Perl and Python. Components may also have dependencies on third-party libraries, such as Bioconductor. Components for cell imaging and microarray analysis are provided but additional components can be implemented by users. The Anduril core has been tested on Linux and Windows.

Anduril 1.0: AndurilScript language

Hello world in AndurilScript is simply

 std.echo("Hello world!") 

Commenting follows the syntax of Java:

 // A simple comment /* Another simple comment */ /** A description that will be included in component description */ 

Components are called by assigning their calls to named component instances. Names cannot be re-used within a single workflow. There are special components for input files that include external files to the script. Supported atomic types are integer, float, boolean and string, and typing is done implicitly.

 in1 = INPUT(path="myFile.csv") constant1 = 1 componentInstance1 = MyComponent(inputPort1 = in1, inputParam1 = constant1) 

Workflows are constructed by assigning outputs of component instances to inputs of following components.

 componentInstance2 = AnotherComponent(inputPort1 = componentInstance1.outputPort1) 

Component instances can also be wrapped as functions.

 function MyFunction(InType1 in1, ..., optional InTypeM inM, ParType1 param1, ..., ParTypeP paramP=defaultP) -> (OutType1 out1, ..., OutTypeN outN) { ... statements ... return record(out1=x1, ..., outN=xN) } 

In addition to standard if-else and switch-case statements, AndurilScript also includes for-loops.

 // Iterates over 1, 2, ..., 10 array = record() for i: std.range(1, 10) { array[i] = SomeComponent(k=i) } 

Extensibility

Anduril can be extended on multiple levels. Users can add new components to existing component bundles. However, if the new component or components carry out tasks that are not related to existing bundles, users can also create new bundles.

Moksiskaan

 
The upset face of the Moksiskaan logo

Moksiskaan is a data integration framework for the cancer research and molecular biology.[3] The framework provides a relational database that represents a graph of biological entities such as genes, protein, drugs, pathways, diseases, biological processes, cellular components, and molecular functions. In addition, there is a wide set of analysis and accession tools built on top of this data. The great majority of these tools are implemented as Anduril components and functions.

Moksiskaan is used mainly to interpret lists of candidate genes obtained from the genomic studies. Its tools can be used to generate graphs of biological entities related to the input genes. The exact for of these graphs may vary from the drug target predictions to the time series of signalling cascades. Some of the goals of these tools are closely related to IPA.

See also

References

  1. ^ "anduril-dev / anduril / doc / ChangeLog.txt — Bitbucket". bitbucket.org. Retrieved 2021-03-25.
  2. ^ Ovaska, K.; Laakso, M.; Haapa-Paananen, S.; Louhimo, R.; Chen, P.; Aittomäki, V.; Valo, E.; Núñez-Fontarnau, J.; Rantanen, V.; Karinen, S.; Nousiainen, K.; Lahesmaa-Korpinen, A. M.; Miettinen, M.; Saarinen, L.; Kohonen, P.; Wu, J.; Westermarck, J.; Hautaniemi, S. (2010). "Large-scale data integration framework provides a comprehensive view on glioblastoma multiforme". Genome Medicine. 2 (9): 65. doi:10.1186/gm186. PMC 3092116. PMID 20822536.
  3. ^ Laakso, M.; Hautaniemi, S. (2010). "Integrative platform to translate gene sets to networks". Bioinformatics. 26 (14): 1802–1803. doi:10.1093/bioinformatics/btq277. PMID 20507894.

Further reading

  • Scientists develop new database that provides comprehensive view of Glioblastoma Multiforme genome in the Cancer Genome Atlas Research Briefs, March 2011, by Catherine Evans.
  • Almeida, J. S. (2010). "Computational ecosystems for data-driven medical genomics". Genome Medicine. 2 (9): 67. doi:10.1186/gm188. PMC 3092118. PMID 20854645.
  • Sahu, B.; Laakso, M.; Ovaska, K.; Mirtti, T.; Lundin, J.; Rannikko, A.; Sankila, A.; Turunen, J. P.; Lundin, M.; Konsti, J.; Vesterinen, T.; Nordling, S.; Kallioniemi, O.; Hautaniemi, S.; Jänne, O. A. (2011). "Dual role of FoxA1 in androgen receptor binding to chromatin, androgen signalling and prostate cancer". The EMBO Journal. 30 (19): 3962–3976. doi:10.1038/emboj.2011.328. PMC 3209787. PMID 21915096.
  • Pihlajamaa, P.; Zhang, F. -P.; Saarinen, L.; Mikkonen, L.; Hautaniemi, S.; Janne, O. A. (2011). "The Phytoestrogen Genistein is a Tissue-Specific Androgen Receptor Modulator". Endocrinology. 152 (11): 4395–4405. doi:10.1210/en.2011-0221. PMID 21878517.
  • Blom, H.; Rönnlund, D.; Scott, L.; Spicarova, Z.; Rantanen, V.; Widengren, J.; Aperia, A.; Brismar, H. (2011). "Nearest neighbor analysis of dopamine D1 receptors and Na+-K+-ATPases in dendritic spines dissected by STED microscopy". Microscopy Research and Technique. 75 (2): 220–228. doi:10.1002/jemt.21046. PMID 21809413. S2CID 206067902.
  • Ehlers, P. I.; Kivimäki, A. S.; Turpeinen, A. M.; Korpela, R.; Vapaatalo, H. (2011). "High blood pressure-lowering and vasoprotective effects of milk products in experimental hypertension". British Journal of Nutrition. 106 (9): 1353–1363. doi:10.1017/S0007114511001723. PMID 21736845.
  • Maliniemi, P.; Carlsson, E.; Kaukola, A.; Ovaska, K.; Niiranen, K.; Saksela, O.; Jeskanen, L.; Hautaniemi, S.; Ranki, A. (2011). "NAV3 copy number changes and target genes in basal and squamous cell cancers". Experimental Dermatology. 20 (11): 926–931. doi:10.1111/j.1600-0625.2011.01358.x. PMID 21995814. S2CID 26219786.
  • Chen, P.; Lepikhova, T.; Hu, Y.; Monni, O.; Hautaniemi, S. (2011). "Comprehensive exon array data processing method for quantitative analysis of alternative spliced variants". Nucleic Acids Research. 39 (18): e123. doi:10.1093/nar/gkr513. PMC 3185423. PMID 21745820.
  • Karinen S., Heikkinen T.; et al. (2011). "Data Integration Workflow for Search of Disease Driving Genes and Genetic Variants". PLOS ONE. 6 (4): e18636. doi:10.1371/journal.pone.0018636. PMC 3075259. PMID 21533266.  
  • Heinonen M., Hemmes A.; et al. (2011). "Role of RNA binding protein HuR in ductal carcinoma in situ of the breast". The Journal of Pathology. 224 (4): 529–539. doi:10.1002/path.2889. PMC 3504799. PMID 21480233.
  • Louhimo R., Hautaniemi S. (2011). "CNAmet: an R package for integrating copy number, methylation and expression data". Bioinformatics. 27 (6): 887–888. doi:10.1093/bioinformatics/btr019. PMID 21228048.

External links

  • Official Anduril website
  • Anduril Code repository
  • Official Moksiskaan website
  • AndurilScript implementation of Donald Knuth's 99 Bottles of Beer

anduril, workflow, engine, anduril, open, source, component, based, workflow, framework, scientific, data, analysis, developed, systems, biology, laboratory, university, helsinki, andurilanduril, workflow, eclipsedeveloper, systems, biology, laboratory, univer. Anduril is an open source component based workflow framework for scientific data analysis 2 developed at the Systems Biology Laboratory University of Helsinki AndurilAnduril workflow in EclipseDeveloper s Systems Biology Laboratory University of HelsinkiInitial release1 July 2010 12 years ago 2010 07 01 Stable release2 0 0 2016 07 01 July 1 2016 6 years ago 2016 07 01 1 Repositorybitbucket wbr org wbr anduril dev wbr Written inJavaOperating systemLinux Microsoft Windows Mac OS XTypeWorkflow engineLicenseGPL v 1 x BSD v 2 x Websitewww wbr anduril wbr orgAnduril is designed to enable systematic flexible and efficient data analysis particularly in the field of high throughput experiments in biomedical research The workflow system currently provides components for several types of analysis such as sequencing gene expression SNP ChIP on chip comparative genomic hybridization and exon microarray analysis as well as cytometry and cell imaging analysis Contents 1 Architecture and features 2 Anduril 1 0 AndurilScript language 3 Extensibility 3 1 Moksiskaan 4 See also 5 References 6 Further reading 7 External linksArchitecture and features EditA workflow is a series of processing steps connected together so that the output of one step is used as the input of another Processing steps implement data analysis tasks such as data importing statistical tests and report generation In Anduril processing steps are implemented using components which are reusable executable code that can be written in any programming language Components are wired together into a workflow or a component network that is executed by the Anduril workflow engine Workflow configuration is done using a simple yet powerful scripting language AndurilScript Workflow configuration and execution can be done from Eclipse a popular multipurpose GUI or from the command line The core Anduril engine is written in Java and components are written in a variety of programming languages including Java R MATLAB Lua Perl and Python Components may also have dependencies on third party libraries such as Bioconductor Components for cell imaging and microarray analysis are provided but additional components can be implemented by users The Anduril core has been tested on Linux and Windows Anduril 1 0 AndurilScript language EditHello world in AndurilScript is simply std echo Hello world Commenting follows the syntax of Java A simple comment Another simple comment A description that will be included in component description Components are called by assigning their calls to named component instances Names cannot be re used within a single workflow There are special components for input files that include external files to the script Supported atomic types are integer float boolean and string and typing is done implicitly in1 INPUT path myFile csv constant1 1 componentInstance1 MyComponent inputPort1 in1 inputParam1 constant1 Workflows are constructed by assigning outputs of component instances to inputs of following components componentInstance2 AnotherComponent inputPort1 componentInstance1 outputPort1 Component instances can also be wrapped as functions function MyFunction InType1 in1 optional InTypeM inM ParType1 param1 ParTypeP paramP defaultP gt OutType1 out1 OutTypeN outN statements return record out1 x1 outN xN In addition to standard if else and switch case statements AndurilScript also includes for loops Iterates over 1 2 10 array record for i std range 1 10 array i SomeComponent k i Extensibility EditAnduril can be extended on multiple levels Users can add new components to existing component bundles However if the new component or components carry out tasks that are not related to existing bundles users can also create new bundles Moksiskaan Edit The upset face of the Moksiskaan logo Moksiskaan is a data integration framework for the cancer research and molecular biology 3 The framework provides a relational database that represents a graph of biological entities such as genes protein drugs pathways diseases biological processes cellular components and molecular functions In addition there is a wide set of analysis and accession tools built on top of this data The great majority of these tools are implemented as Anduril components and functions Moksiskaan is used mainly to interpret lists of candidate genes obtained from the genomic studies Its tools can be used to generate graphs of biological entities related to the input genes The exact for of these graphs may vary from the drug target predictions to the time series of signalling cascades Some of the goals of these tools are closely related to IPA See also EditBioinformatics workflow management systems GenePattern Kepler Apache Taverna Workflow management systemReferences Edit anduril dev anduril doc ChangeLog txt Bitbucket bitbucket org Retrieved 2021 03 25 Ovaska K Laakso M Haapa Paananen S Louhimo R Chen P Aittomaki V Valo E Nunez Fontarnau J Rantanen V Karinen S Nousiainen K Lahesmaa Korpinen A M Miettinen M Saarinen L Kohonen P Wu J Westermarck J Hautaniemi S 2010 Large scale data integration framework provides a comprehensive view on glioblastoma multiforme Genome Medicine 2 9 65 doi 10 1186 gm186 PMC 3092116 PMID 20822536 Laakso M Hautaniemi S 2010 Integrative platform to translate gene sets to networks Bioinformatics 26 14 1802 1803 doi 10 1093 bioinformatics btq277 PMID 20507894 Further reading EditScientists develop new database that provides comprehensive view of Glioblastoma Multiforme genome in the Cancer Genome Atlas Research Briefs March 2011 by Catherine Evans Almeida J S 2010 Computational ecosystems for data driven medical genomics Genome Medicine 2 9 67 doi 10 1186 gm188 PMC 3092118 PMID 20854645 Sahu B Laakso M Ovaska K Mirtti T Lundin J Rannikko A Sankila A Turunen J P Lundin M Konsti J Vesterinen T Nordling S Kallioniemi O Hautaniemi S Janne O A 2011 Dual role of FoxA1 in androgen receptor binding to chromatin androgen signalling and prostate cancer The EMBO Journal 30 19 3962 3976 doi 10 1038 emboj 2011 328 PMC 3209787 PMID 21915096 Pihlajamaa P Zhang F P Saarinen L Mikkonen L Hautaniemi S Janne O A 2011 The Phytoestrogen Genistein is a Tissue Specific Androgen Receptor Modulator Endocrinology 152 11 4395 4405 doi 10 1210 en 2011 0221 PMID 21878517 Blom H Ronnlund D Scott L Spicarova Z Rantanen V Widengren J Aperia A Brismar H 2011 Nearest neighbor analysis of dopamine D1 receptors and Na K ATPases in dendritic spines dissected by STED microscopy Microscopy Research and Technique 75 2 220 228 doi 10 1002 jemt 21046 PMID 21809413 S2CID 206067902 Ehlers P I Kivimaki A S Turpeinen A M Korpela R Vapaatalo H 2011 High blood pressure lowering and vasoprotective effects of milk products in experimental hypertension British Journal of Nutrition 106 9 1353 1363 doi 10 1017 S0007114511001723 PMID 21736845 Maliniemi P Carlsson E Kaukola A Ovaska K Niiranen K Saksela O Jeskanen L Hautaniemi S Ranki A 2011 NAV3 copy number changes and target genes in basal and squamous cell cancers Experimental Dermatology 20 11 926 931 doi 10 1111 j 1600 0625 2011 01358 x PMID 21995814 S2CID 26219786 Chen P Lepikhova T Hu Y Monni O Hautaniemi S 2011 Comprehensive exon array data processing method for quantitative analysis of alternative spliced variants Nucleic Acids Research 39 18 e123 doi 10 1093 nar gkr513 PMC 3185423 PMID 21745820 Karinen S Heikkinen T et al 2011 Data Integration Workflow for Search of Disease Driving Genes and Genetic Variants PLOS ONE 6 4 e18636 doi 10 1371 journal pone 0018636 PMC 3075259 PMID 21533266 Heinonen M Hemmes A et al 2011 Role of RNA binding protein HuR in ductal carcinoma in situ of the breast The Journal of Pathology 224 4 529 539 doi 10 1002 path 2889 PMC 3504799 PMID 21480233 Louhimo R Hautaniemi S 2011 CNAmet an R package for integrating copy number methylation and expression data Bioinformatics 27 6 887 888 doi 10 1093 bioinformatics btr019 PMID 21228048 External links EditOfficial Anduril website Anduril Code repository Official Moksiskaan website AndurilScript implementation of Donald Knuth s 99 Bottles of Beer Retrieved from https en wikipedia org w index php title Anduril workflow engine amp oldid 1039560239, wikipedia, wiki, book, books, library,

article

, read, download, free, free download, mp3, video, mp4, 3gp, jpg, jpeg, gif, png, picture, music, song, movie, book, game, games.