R project for statistical computing

R is a free software environment for statistical computing and graphics

Technologies
c, javascript, c++, r-project, fortran
Topics
visualization, machine learning, data science, graphics, statistics
R is a free software environment for statistical computing and graphics

R is an integrated suite of software facilities for data manipulation, calculation and graphical display. It includes

  • an effective data handling and storage facility,
  • a suite of operators for calculations on arrays, in particular matrices,
  • a large, coherent, integrated collection of intermediate tools for data analysis,
  • graphical facilities for data analysis and display either on-screen or on hardcopy, and
  • a well-developed, simple and effective programming language which includes conditionals, loops, user-defined recursive functions and input and output facilities.

Many users think of R as a statistics system. We prefer to think of it of an environment within which statistical techniques are implemented. R can be extended (easily) via packages. There are about eight packages supplied with the R distribution and many more are available through the CRAN family of Internet sites covering a very wide range of modern statistics.

The term “environment” is intended to characterize it as a fully planned and coherent system, rather than an incremental accretion of very specific and inflexible tools, as is frequently the case with other data analysis software.

R, like S, is designed around a true computer language, and it allows users to add additional functionality by defining new functions. Much of the system is itself written in the R dialect of S, which makes it easy for users to follow the algorithmic choices made. For computationally-intensive tasks, C, C++ and Fortran code can be linked and called at run time. Advanced users can write C code to manipulate R objects directly.

R has its own LaTeX-like documentation format, which is used to supply comprehensive documentation, both on-line in a number of formats and in hardcopy.

2018 Program

Successful Projects

Contributor
Dries Cornilly
Mentor
Brian Peterson, kboudt, Peter Carl
Organization
R project for statistical computing
rTrawl
Trawl processes are continuous-time processes exhibiting autocorrelation. They are determined by a LĂ©vy seed and trawl function, which can be viewed...
Contributor
Luis Damiano
Mentor
Brian Peterson, Michael Weylandt
Organization
R project for statistical computing
Full Bayesian Inference for Hidden Markov Models
We create an R Package to run full Bayesian inference on Hidden Markov Models (HMM) using the probabilistic programming language Stan. By providing...
Contributor
Paul
Mentor
Robin Kohze, Dr. Samuel Schmidt, Bert Jehoul
Organization
R project for statistical computing
Firedata - Implementing Web Functionalities for Shiny and OpenCPU
This project aims to integrate Firedata into a wide array of statistical web applications. We thereby not only add additional authorization functions...
Contributor
Thiloshon Nagarajah
Mentor
Yohay Carmel, Vijay Barve, Tomer Gueta
Organization
R project for statistical computing
bdclean: User friendly biodiversity data cleaning pipeline
Until recently, biodiversity data was scattered in different formats in natural history collections, survey reports, and in literature. In the last...
Contributor
Apostolos Chalkis
Mentor
Zaf, Vissarion Fisikopoulos
Organization
R project for statistical computing
Efficient R tools for geometrical statistics
Volume computation of convex polytopes and sampling algorithms are very useful in many scientific fields and applications. The VolEsti is a C++...
Contributor
vivekktiwari
Mentor
Toby Hocking, Faizan
Organization
R project for statistical computing
Animint2 Designer Manual
Animint2 is a re-write of Animint which is an R package for making interactive animated data visualization on the web using ggplot syntax and two new...
Contributor
Parismita Das
Mentor
Torsten Hothorn, Alexandre Drouin
Organization
R project for statistical computing
Max Margin Interval Trees
There are few R packages available for interval regression, a machine learning problem which is important in genomics and medicine. Like usual...
Contributor
Tim Yu
Mentor
Jialin Ma, Eric Weitz, Freeman Wang
Organization
R project for statistical computing
R Interface to Ideogram.js Library
This project aims to provide an R interface to Ideogram.js, integrate it with bioconductor infrastructures and possibly provide an interactive...
Contributor
Ivan Pavlov
Mentor
James Balamuta, Dirk Eddelbuettel
Organization
R project for statistical computing
Extending 'rvw' and reintegrating vowpal wabbit
Vowpal Wabbit is an online machine learning system that is known for its speed and scalability and is widely used in research and industry. The goal...
Contributor
Yanbo Xu
Mentor
Xingguo, Yanxun Xu
Organization
R project for statistical computing
Bayesian analysis of individualized treatment response curves on EHR time series
With the fast adoption of Electronic Health Records (EHR) in modern healthcare systems, various machine learning methods are developed to conduct...
Contributor
Xin Chen
Mentor
Jialin Ma, isuruf
Organization
R project for statistical computing
Fast Symbolic Computation in R with SymEngine
A computer algebra system(CAS) is a useful tool for researchers and scientists. Some tools exists for algebra compution are either few-featured or...
Contributor
Gregory Brownson
Mentor
Matias Salibian Barrera, Doug Martin
Organization
R project for statistical computing
A Shiny User Interface to RobStatTM
Project Summary The goal of this project is to develop a point-and-click graphical user interface (GUI) for the RobStatTM package. Both the UI and...
Contributor
Changcheng Li
Mentor
nashjc, Hans W. Borchers
Organization
R project for statistical computing
Automatic Differentiation in R through Julia
Automatic differentiation (AD) is a set of techniques to calculate derivatives automatically. It generally outperforms non-AD methods like symbolic...
Contributor
Povilas Gibas
Mentor
Yohay Carmel, Vijay Barve, Tomer Gueta
Organization
R project for statistical computing
Darwinazing biodiversity data in R
Darwin Core (DwC) is a standard maintained by the Darwin Core maintenance group. It includes a glossary of terms (in other contexts these might be...
Contributor
Jiasheng Zhu
Mentor
Robin Kohze, Dr. Samuel Schmidt
Organization
R project for statistical computing
Firedata - Enabling easy cloud stats through Firestore
This project will integrate with Google Cloud Firestore and also update the current Firedata package with the newest Firebase APIs and features.
Contributor
Andrew Connell
Mentor
Rebecca Killick, David Matteson
Organization
R project for statistical computing
changepoint.online
There are many R packages available for offline changepoint detection but, to the knowledge of myself and the mentors, only one for online change...
Contributor
Ashwin Agrawal
Mentor
Yohay Carmel, Vijay Barve, Tomer Gueta
Organization
R project for statistical computing
Biodiversity Data Utilities
The aim of the project is to improve the current functionality of existing data management and cleaning packages for Biodiversity in R and integrate...
Contributor
Wenjing Wang
Mentor
kboudt, Di Cook
Organization
R project for statistical computing
Diagnostic statistics and visualization for quantile regression
This project aims to extend diagnostic statistics in the R package quokar. Currently in this package we have several methods such as absolute...
Contributor
Johan Larsson
Mentor
Michael Weylandt, Toby Hocking
Organization
R project for statistical computing
Fast Sparse Linear Models for Big Data with SAGA
There are many alternatives for L1-regularized generalized linear models in R, but none that utilizes the efficient SAGA algorithm despite its...
Contributor
Yuze Zhou
Mentor
Jun Yan, Yixuan Qiu
Organization
R project for statistical computing
GEE and QIF for clustered data regression
This project aims at developing a new R package for clustered data regression. Details include: Rewrite basic GEE method based on Rcpp and...
Contributor
Ignasi Montero Serra
Mentor
Katie Kaplan, Eneko Aspillaga, Narayani Barve, Vijay Barve
Organization
R project for statistical computing
rOceans: an R Package for integrating spatial trends in biodiversity, human stressors, and conservation efforts
rOceans will be an R package that serves as a platform for integrating multiple spatial datasets on marine biodiversity, human-driven stressors, and...
Contributor
Haoming Jiang
Mentor
Xingguo, Tuo Zhao, Jason Ge
Organization
R project for statistical computing
A Major Update for HUGE and SAM
With the recent progress in the theoretical field of sparse learning problems, current R packages are lagging behind the cutting edge research. We...
Contributor
Marlon E. Cobos
Mentor
Narayani Barve, Vijay Barve, Alberto Jiménez Valverde
Organization
R project for statistical computing
Species range maps in R
The species range maps project is motivated by the importance of information about species distribution for processes of conservation planning and...
Contributor
Anthony_AC
Mentor
Brian Peterson, xchen, Ruben Zamar, Peter Carl, Doug Martin
Organization
R project for statistical computing
Performance Analytics Standard Errors
The current finance industry practice in reporting risk and performance measure estimates of assets and portfolios does not typically include...