Rxivist logo

ClusterEnG: An interactive educational web resource for clustering big data

By Mohith Manjunath, Yi Zhang, Steve H. Yeo, Omar Sobh, Nathan Russell, Christian Followell, Colleen Bushell, Umberto Ravaioli, Jun S. Song

Posted 27 Mar 2017
bioRxiv DOI: 10.1101/120915 (published DOI: 10.7717/peerj-cs.155)

Clustering is one of the most common techniques used in data analysis to discover hidden structures by grouping together data points that are similar in some measure into clusters. Although there are many programs available for performing clustering, a single web resource that provides both state-of-the-art clustering methods and interactive visualizations is lacking. ClusterEnG (acronym for Clustering Engine for Genomics) provides an interface for clustering big data and interactive visualizations including 3D views, cluster selection and zoom features. ClusterEnG also aims at educating the user about the similarities and differences between various clustering algorithms and provides clustering tutorials that demonstrate potential pitfalls of each algorithm. The web resource will be particularly useful to scientists who are not conversant with computing but want to understand the structure of their data in an intuitive manner. ClusterEnG is part of a bigger project called KnowEnG (Knowledge Engine for Genomics) and is available at http://education.knoweng.org/clustereng.

Download data

  • Downloaded 552 times
  • Download rankings, all-time:
    • Site-wide: 38,727 out of 118,553
    • In bioinformatics: 4,310 out of 9,592
  • Year to date:
    • Site-wide: 99,496 out of 118,553
  • Since beginning of last month:
    • Site-wide: 74,477 out of 118,553

Altmetric data

Downloads over time

Distribution of downloads per paper, site-wide


Sign up for the Rxivist weekly newsletter! (Click here for more details.)