R is a free software environment for statistical computing and graphics

R is an integrated suite of software facilities for data manipulation, calculation and graphical display. It includes

  • an effective data handling and storage facility,
  • a suite of operators for calculations on arrays, in particular matrices,
  • a large, coherent, integrated collection of intermediate tools for data analysis,
  • graphical facilities for data analysis and display either on-screen or on hardcopy, and
  • a well-developed, simple and effective programming language which includes conditionals, loops, user-defined recursive functions and input and output facilities.

Many users think of R as a statistics system. We prefer to think of it of an environment within which statistical techniques are implemented. R can be extended (easily) via packages. There are about eight packages supplied with the R distribution and many more are available through the CRAN family of Internet sites covering a very wide range of modern statistics.

The term “environment” is intended to characterize it as a fully planned and coherent system, rather than an incremental accretion of very specific and inflexible tools, as is frequently the case with other data analysis software.

R, like S, is designed around a true computer language, and it allows users to add additional functionality by defining new functions. Much of the system is itself written in the R dialect of S, which makes it easy for users to follow the algorithmic choices made. For computationally-intensive tasks, C, C++ and Fortran code can be linked and called at run time. Advanced users can write C code to manipulate R objects directly.

R has its own LaTeX-like documentation format, which is used to supply comprehensive documentation, both on-line in a number of formats and in hardcopy.

lightbulb_outline View ideas list


  • r-project
  • c
  • c++
  • javascript
  • fortran


email Mailing list
mail_outline Contact email

R project for statistical computing 2018 Projects

  • Haoming Jiang
    A Major Update for HUGE and SAM
    With the recent progress in the theoretical field of sparse learning problems, current R packages are lagging behind the cutting edge research. We...
  • Gregory Brownson
    A Shiny User Interface to RobStatTM
    Project Summary The goal of this project is to develop a point-and-click graphical user interface (GUI) for the RobStatTM package. Both the UI and...
  • vivekktiwari
    Animint2 Designer Manual
    Animint2 is a re-write of Animint which is an R package for making interactive animated data visualization on the web using ggplot syntax and two new...
  • Changcheng Li
    Automatic Differentiation in R through Julia
    Automatic differentiation (AD) is a set of techniques to calculate derivatives automatically. It generally outperforms non-AD methods like symbolic...
  • Yanbo Xu
    Bayesian analysis of individualized treatment response curves on EHR time series
    With the fast adoption of Electronic Health Records (EHR) in modern healthcare systems, various machine learning methods are developed to conduct...
  • Thiloshon Nagarajah
    bdclean: User friendly biodiversity data cleaning pipeline
    Until recently, biodiversity data was scattered in different formats in natural history collections, survey reports, and in literature. In the last...
  • Ashwin Agrawal
    Biodiversity Data Utilities
    The aim of the project is to improve the current functionality of existing data management and cleaning packages for Biodiversity in R and integrate...
  • Andrew Connell
    There are many R packages available for offline changepoint detection but, to the knowledge of myself and the mentors, only one for online change...
  • Povilas Gibas
    Darwinazing biodiversity data in R
    Darwin Core (DwC) is a standard maintained by the Darwin Core maintenance group. It includes a glossary of terms (in other contexts these might be...
  • Wenjing Wang
    Diagnostic statistics and visualization for quantile regression
    This project aims to extend diagnostic statistics in the R package quokar. Currently in this package we have several methods such as absolute...
  • Apostolos Chalkis
    Efficient R tools for geometrical statistics
    Volume computation of convex polytopes and sampling algorithms are very useful in many scientific fields and applications. The VolEsti is a C++...
  • Ivan Pavlov
    Extending 'rvw' and reintegrating vowpal wabbit
    Vowpal Wabbit is an online machine learning system that is known for its speed and scalability and is widely used in research and industry. The goal...
  • Johan Larsson
    Fast Sparse Linear Models for Big Data with SAGA
    There are many alternatives for L1-regularized generalized linear models in R, but none that utilizes the efficient SAGA algorithm despite its...
  • Xin Chen
    Fast Symbolic Computation in R with SymEngine
    A computer algebra system(CAS) is a useful tool for researchers and scientists. Some tools exists for algebra compution are either few-featured or...
  • Jiasheng Zhu
    Firedata - Enabling easy cloud stats through Firestore
    This project will integrate with Google Cloud Firestore and also update the current Firedata package with the newest Firebase APIs and features.
  • Paul
    Firedata - Implementing Web Functionalities for Shiny and OpenCPU
    This project aims to integrate Firedata into a wide array of statistical web applications. We thereby not only add additional authorization functions...
  • Luis Damiano
    Full Bayesian Inference for Hidden Markov Models
    We create an R Package to run full Bayesian inference on Hidden Markov Models (HMM) using the probabilistic programming language Stan. By providing...
  • Yuze Zhou
    GEE and QIF for clustered data regression
    This project aims at developing a new R package for clustered data regression. Details include: Rewrite basic GEE method based on Rcpp and...
  • Parismita Das
    Max Margin Interval Trees
    There are few R packages available for interval regression, a machine learning problem which is important in genomics and medicine. Like usual...
  • Anthony_AC
    Performance Analytics Standard Errors
    The current finance industry practice in reporting risk and performance measure estimates of assets and portfolios does not typically include...
  • Tim Yu
    R Interface to Ideogram.js Library
    This project aims to provide an R interface to Ideogram.js, integrate it with bioconductor infrastructures and possibly provide an interactive...
  • Ignasi Montero Serra
    rOceans: an R Package for integrating spatial trends in biodiversity, human stressors, and conservation efforts
    rOceans will be an R package that serves as a platform for integrating multiple spatial datasets on marine biodiversity, human-driven stressors, and...
  • Trawl processes are continuous-time processes exhibiting autocorrelation. They are determined by a Lévy seed and trawl function, which can be viewed...
  • Marlon E. Cobos
    Species range maps in R
    The species range maps project is motivated by the importance of information about species distribution for processes of conservation planning and...