To install click the Add extension button. That's it.

The source code for the WIKI 2 extension is being checked by specialists of the Mozilla Foundation, Google, and Apple. You could also do it yourself at any point in time.

4,5
Kelly Slayton
Congratulations on this excellent venture… what a great idea!
Alexander Grigorievskiy
I use WIKI 2 every day and almost forgot how the original Wikipedia looks like.
Live Statistics
English Articles
Improved in 24 Hours
Added in 24 Hours
Languages
Recent
Show all languages
What we do. Every page goes through several hundred of perfecting techniques; in live mode. Quite the same Wikipedia. Just better.
.
Leo
Newton
Brights
Milds

Galaxy (computational biology)

From Wikipedia, the free encyclopedia

Galaxy[2] is a scientific workflow, data integration,[3][4] and data and analysis persistence and publishing platform that aims to make computational biology accessible to research scientists that do not have computer programming or systems administration experience. Although it was initially developed for genomics research, it is largely domain agnostic and is now used as a general bioinformatics workflow management system.[5]

YouTube Encyclopedic

  • 1/5
    Views:
    1 483
    29 395
    7 239
    9 057
    600
  • Computational Metagenomics on the Species Level
  • 7. ChIP-seq Analysis; DNA-protein Interactions
  • 9. Modeling and Discovery of Sequence Motifs
  • Dr. David Deamer - "Systems biology, synthetic biology and the origin of life"
  • Introduction to Regulatory Genomics and Epigenomics II: Motif Finding

Transcription

Contents

Functionality

Galaxy is a scientific workflow system. These systems provide a means to build multi-step computational analyses akin to a recipe. They typically provide a graphical user interface[6] for specifying what data to operate on, what steps to take, and what order to do them in.

Galaxy is also a data integration platform for biological data. It supports data uploads from the user's computer, by URL, and directly from many online resources (such as the UCSC Genome Browser, BioMart and InterMine). Galaxy supports a range of widely used biological data formats, and translation between those formats. Galaxy provides a web interface to many text manipulation utilities, enabling researchers to do their own custom reformatting and manipulation without having to do any programming. Galaxy includes interval manipulation utilities for doing set theoretic operations (e.g. intersection, union, ...) on intervals. Many biological file formats include genomic interval data (a frame of reference, e.g., chromosome or contig name, and start and stop positions), allowing these data to be integrated.

Galaxy was originally written for biological data analysis, particularly genomics. The set of available tools has been greatly expanded over the years and Galaxy is now also used for gene expression, genome assembly, proteomics, epigenomics, transcriptomics and host of other disciplines in the life sciences. The platform itself is actually domain agnostic and can be applied, in theory, to any scientific domain. For example, Galaxy servers exist for image analysis,[7] computational chemistry[8] and drug design,[9] cosmology, climate modeling, social science,[10] and linguistics.

Finally, Galaxy also supports data and analysis persistence and publishing. See Reproducibility and Transparency below.

Project Goals

Galaxy is "an open, web-based platform for performing accessible, reproducible, and transparent genomic science."[11]

Accessibility

Computational biology is a specialized domain that often requires knowledge of computer programming. Galaxy aims to give biomedical researchers access to computational biology without also requiring them to understand computer programming.[12][13] Galaxy does this by stressing a simple user interface[14] over the ability to build complex workflows. This design choice makes it relatively easy to build typical analyses, but more difficult to build complex workflows that include, for example, looping constructs. (See Apache Taverna for an example of a data-driven workflow system that supports looping.[15])

Reproducibility

Reproducibility is a key goal of science: When scientific results are published the publications should include enough information that others can repeat the experiment and get the same results. There have been many recent efforts to extend this goal from the bench (the "wet lab") to computational experiments (the "dry lab") as well. This has proved to be a more difficult task than initially expected.[16]

Galaxy supports reproducibility by capturing sufficient information about every step in a computational analysis, so that the analysis can be repeated, exactly, at any point in the future. This includes keeping track of all input, intermediate, and final datasets, as well as the parameters provided to, and the order of each step of the analysis.

Transparency

Galaxy supports transparency in scientific research by enabling researchers to share any of their Galaxy Objects either publicly, or with specific individuals. Shared items can be examined in detail, rerun at will and copied and modified to test hypotheses.

Galaxy Objects: Histories, Workflows, Datasets and Pages

Galaxy objects are anything that can be saved, persisted, and shared in Galaxy:

Histories
Histories are computational analyses (recipes) run with specified input datasets, computational steps and parameters. Histories include all intermediate and output datasets as well.
Workflows
Workflows are computational analyses that specify all the steps (and parameters) in the analysis, but none of the data. Workflows are used to run the same analysis against multiple sets of input data.
Datasets
Datasets includes any input, intermediate, or output dataset, used or produced in an analysis.
Pages
Histories, workflows and datasets can include user-provided annotation. Galaxy Pages enables the creation of a virtual paper that describes the how and why of the overall experiment. Tight integration of Pages with Histories, Workflows, and Datasets supports this goal.

Availability

Galaxy is available:

  1. As a free public web server,[17] supported by the Galaxy Project.[18] This server includes many bioinformatics tools that are widely useful in many areas of genomics research. Users can create logins, and save histories, workflows, and datasets on the server. These saved items can also be shared with others.
  2. As open-source software that can be downloaded, installed and customized to address specific needs.[19] Galaxy can be installed locally or using a computing cloud.[20]
  3. Public web servers hosted by other organizations.[21] Several organizations with their own Galaxy installation have also opted to make those servers available to others.
  4. As part of the GenomeSpace initiative.

Implementation

Galaxy is open-source software implemented using the Python programming language. It is developed by the Galaxy team[22] at Penn State, Johns Hopkins University, Oregon Health & Science University, and the Galaxy Community.[23]

Galaxy is extensible, as new command line tools can be integrated and shared within the Galaxy ToolShed.[24]

An example of extending Galaxy is Galaxy-P from the University of Minnesota Supercomputing Institute, which is customized as a data analysis platform for mass spectrometry-based proteomics.[25]

Community

Galaxy is an open source project and the community includes users, organizations that install their own instance, Galaxy developers, and bioinformatics tool developers. The Galaxy project has mailing lists,[26] a community hub,[27] and annual meetings.[28]

See also

External links

References

  1. ^ https://galaxyproject.org/admin/license/
  2. ^ Afgan, E.; Baker, D.; van den Beek, M.; Blankenberg, D.; Bouvier, D.; Čech, M.; Chilton, J.; Clements, D.; Coraor, N.; Eberhard, C.; Grüning, B.; Guerler, A.; Hillman-Jackson, J.; Von Kuster, G.; Rasche, E.; Soranzo, N.; Turaga, N.; Taylor, J.; Nekrutenko, A.; Goecks, J. (8 July 2016). "The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2016 update". Nucleic Acids Research. 44 (W1): W3–W10. PMC 4987906Freely accessible. PMID 27137889. doi:10.1093/nar/gkw343. 
  3. ^ Blankenberg, D.; Coraor, N.; Von Kuster, G.; Taylor, J.; Nekrutenko, A.; Galaxy, T. (2011). "Integrating diverse databases into an unified analysis framework: A Galaxy approach". Database. 2011: bar011. PMC 3092608Freely accessible. PMID 21531983. doi:10.1093/database/bar011. 
  4. ^ Blankenberg, D.; Gordon, A.; Von Kuster, G.; Coraor, N.; Taylor, J.; Nekrutenko, A.; Galaxy, T. (2010). "Manipulation of FASTQ data with Galaxy". Bioinformatics. 26 (14): 1783–1785. PMC 2894519Freely accessible. PMID 20562416. doi:10.1093/bioinformatics/btq281. 
  5. ^ https://galaxyproject.org/public-galaxy-servers
  6. ^ Schatz, M. C. (2010). "The missing graphical user interface for genomics". Genome Biology. 11 (8): 128–201. PMC 2945776Freely accessible. PMID 20804568. doi:10.1186/gb-2010-11-8-128. 
  7. ^ http://cloudimaging.net.au/
  8. ^ Hildebrandt, A. K.; Stöckel, D; Fischer, N. M.; de la Garza, L; Krüger, J; Nickels, S; Röttig, M; Schärfe, C; Schumann, M; Thiel, P; Lenhof, H. P.; Kohlbacher, O; Hildebrandt, A (2014). "Ballaxy: Web services for structural bioinformatics". Bioinformatics. 31 (1): 121–2. PMID 25183489. doi:10.1093/bioinformatics/btu574. 
  9. ^ http://osddlinux.osdd.net:8001/
  10. ^ http://socscicompute.ss.uci.edu/
  11. ^ Goecks, J.; Nekrutenko, A.; Taylor, J.; Galaxy Team, T. (2010). "Galaxy: A comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences". Genome Biology. 11 (8): R86. PMC 2945788Freely accessible. PMID 20738864. doi:10.1186/gb-2010-11-8-r86. 
  12. ^ Blankenberg, D.; Taylor, J.; Nekrutenko, A.; The Galaxy, T. (2011). "Making whole genome multiple alignments usable for biologists". Bioinformatics. 27 (17): 2426–8. PMC 3157923Freely accessible. PMID 21775304. doi:10.1093/bioinformatics/btr398. 
  13. ^ Blankenberg, D.; Taylor, J.; Schenck, I.; He, J.; Zhang, Y.; Ghent, M.; Veeraraghavan, N.; Albert, I.; Miller, W.; Makova, K. D.; Hardison, R. C.; Nekrutenko, A. (2007). "A framework for collaborative analysis of ENCODE data: Making large-scale analyses biologist-friendly". Genome Research. 17 (6): 960–964. PMC 1891355Freely accessible. PMID 17568012. doi:10.1101/gr.5578007. 
  14. ^ Schatz, M. C. (2010). "The missing graphical user interface for genomics". Genome Biology. 11 (8): 128–201. PMC 2945776Freely accessible. PMID 20804568. doi:10.1186/gb-2010-11-8-128. 
  15. ^ Soiland-Reyes, S (2010-12-13). "Looping". The Taverna Knowledge Blog. knowledgeblog.org. Retrieved 28 January 2015. 
  16. ^ Ioannidis, J. P. A.; Allison, D. B.; Ball, C. A.; Coulibaly, I.; Cui, X.; Culhane, A. N. C.; Falchi, M.; Furlanello, C.; Game, L.; Jurman, G.; Mangion, J.; Mehta, T.; Nitzberg, M.; Page, G. P.; Petretto, E.; Van Noort, V. (2008). "Repeatability of published microarray gene expression analyses". Nature Genetics. 41 (2): 149–155. PMID 19174838. doi:10.1038/ng.295. 
  17. ^ https://usegalaxy.org/
  18. ^ http://galaxyproject.org/
  19. ^ http://getgalaxy.org/
  20. ^ Afgan, E.; Baker, D.; Coraor, N.; Chapman, B.; Nekrutenko, A.; Taylor, J. (2010). "Galaxy CloudMan: Delivering cloud compute clusters". BMC Bioinformatics. 11: S4. PMC 3040530Freely accessible. PMID 21210983. doi:10.1186/1471-2105-11-S12-S4. 
  21. ^ https://galaxyproject.org/public-galaxy-servers
  22. ^ https://galaxyproject.org/galaxy-team
  23. ^ Lazarus, R.; Taylor, J.; Qiu, W.; Nekrutenko, A. (2008). "Toward the commoditization of translational genomic research: Design and implementation features of the Galaxy genomic workbench". Summit on translational bioinformatics. 2008: 56–60. PMC 3041519Freely accessible. PMID 21347127. 
  24. ^ Blankenberg, Daniel; Von Kuster, Gregory; Bouvier, Emil; Baker, Dannon; Afgan, Enis; Stoler, Nicholas; Taylor, James; Nekrutenko, Anton (2014). "Dissemination of scientific software with Galaxy ToolShed". Genome Biology. 15 (2): 403. PMC 4038738Freely accessible. PMID 25001293. doi:10.1186/gb4161. 
  25. ^ Sheynkman, GM; Johnson, JE; Jagtap, PD; Shortreed, MR; Onsongo, G; Frey, BL; Griffin, TJ; Smith, LM (22 August 2014). "Using Galaxy-P to leverage RNA-Seq for the discovery of novel protein variations". BMC Genomics. 15 (703): 703. PMC 4158061Freely accessible. PMID 25149441. doi:10.1186/1471-2164-15-703. 
  26. ^ https://galaxyproject.org/mailing-lists
  27. ^ https://galaxyproject.org/
  28. ^ https://galaxyproject.org/events
This page was last edited on 13 October 2017, at 20:49.
Basis of this page is in Wikipedia. Text is available under the CC BY-SA 3.0 Unported License. Non-text media are available under their specified licenses. Wikipedia® is a registered trademark of the Wikimedia Foundation, Inc. WIKI 2 is an independent company and has no affiliation with Wikimedia Foundation.