Collective Knowledge (software)

The Collective Knowledge (CK) project is an open-source framework and repository to enable collaborative, reproducible and sustainable research and development of complex computational systems.[2] CK is a small, portable, customizable and decentralized infrastructure helping researchers and practitioners:

  • share their code, data and models as reusable Python components and automation actions[3] with unified JSON API, JSON meta information, and a UID based on FAIR principles[2]
  • assemble portable workflows from shared components (such as multi-objective autotuning and Design space exploration[4])
  • automate, crowdsource and reproduce benchmarking of complex computational systems[5]
  • unify predictive analytics (scikit-learn, R, DNN)
  • enable reproducible and interactive papers[6]
Collective Knowledge (CK)
Developer(s)Grigori Fursin and the cTuning foundation
Initial release2015 (2015)
Stable release
2.6.3 (discontinued for the new Collective Mind framework[1]) / November 30, 2022 (2022-11-30)
Written inPython
Operating systemLinux, Mac OS X, Microsoft Windows, Android
TypeKnowledge management, FAIR data, MLOps, Data management, Artifact Evaluation, Package management system, Scientific workflow system, DevOps, Continuous integration, Reproducibility
LicenseApache License for version 2.0 and BSD License 3-clause for version 1.0
Websitegithub.com/ctuning/ck, cknow.io

Notable usages

  • ARM uses CK to accelerate computer engineering[7]
  • Several ACM-sponsored conferences use CK to automate the Artifact Evaluation process[8][9]
  • Imperial College (London) uses CK to automate and crowdsource compiler bug detection[10]
  • Researchers from the University of Cambridge used CK to help the community reproduce results of their publication in the International Symposium on Code Generation and Optimization (CGO'17) during Artifact Evaluation[11]
  • General Motors (USA) uses CK to crowd-benchmark convolutional neural network optimizations [12][13]
  • The Raspberry Pi Foundation and the cTuning foundation released a CK workflow with a reproducible "live" paper to enable collaborative research into multi-objective autotuning and machine learning techniques[4]
  • IBM uses CK to reproduce quantum results from nature[14]
  • CK is used to automate MLPerf benchmark[15][16]

Portable package manager for portable workflows

CK has an integrated cross-platform package manager with Python scripts, JSON API and JSON meta-description to automatically rebuild software environment on a user machine required to run a given research workflow.[17]

Reproducibility of experiments

CK enables reproducibility of experimental results via community involvement similar to Wikipedia and physics. Whenever a new workflow with all components is shared via GitHub, anyone can try it on a different machine, with different environment and using slightly different choices (compilers, libraries, data sets). Whenever an unexpected or wrong behavior is encountered, the community explains it, fixes components and shares them back as described in.[4]

References

  • Development site:
  • Documentation:
  • Public repository with crowdsourced experiments:
  • International Workshop on Adaptive Self-tuning Computing System (ADAPT) uses CK to enable public reviewing of publications and artifacts via Reddit:
This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.