R Project for Statistical Computing

R is a free software environment for statistical computing and graphics

Technologies
c, javascript, c++, r-project, fortran
Topics
machine learning, data visualization, data science, graphics, statistics
R is a free software environment for statistical computing and graphics

R is an integrated suite of software facilities for data manipulation, calculation and graphical display. It includes

  • an effective data handling and storage facility,
  • a suite of operators for calculations on arrays, in particular matrices,
  • a large, coherent, integrated collection of intermediate tools for data analysis,
  • graphical facilities for data analysis and display either on-screen or on hardcopy, and
  • a well-developed, simple and effective programming language which includes conditionals, loops, user-defined recursive functions and input and output facilities.

The term “environment” is intended to characterize it as a fully planned and coherent system, rather than an incremental accretion of very specific and inflexible tools, as is frequently the case with other data analysis software.

R, like S, is designed around a true computer language, and it allows users to add additional functionality by defining new functions. Much of the system is itself written in the R dialect of S, which makes it easy for users to follow the algorithmic choices made. For computationally-intensive tasks, C, C++ and Fortran code can be linked and called at run time. Advanced users can write C code to manipulate R objects directly.

Many users think of R as a statistics system. We prefer to think of it of an environment within which statistical techniques are implemented. R can be extended (easily) via packages. There are about eight packages supplied with the R distribution and many more are available through the CRAN family of Internet sites covering a very wide range of modern statistics.

R has its own LaTeX-like documentation format, which is used to supply comprehensive documentation, both on-line in a number of formats and in hardcopy.

2020 Program

Successful Projects

Contributor
Martynas Jočys
Mentor
Thiloshon Nagarajah, Povilas Gibas, Tomer Gueta, Vijay Barve
Organization
R Project for Statistical Computing
Enhancing bdchecks: a biodiversity data quality checks system in R
bdchecks has the potential to centralize the effort to develop a sustainable infrastructure for biodiversity data quality checks in R. This will...
Contributor
Emil Sjørup
Mentor
Kris Boudt, Nabil Bouamara, Onno Kleen
Organization
R Project for Statistical Computing
Expanding the highfrequency package
The highfrequency package is the go-to package for the analysis of intraday price data. The package was created as a merger of the packages RTAQ and...
Contributor
Rahul Saxena
Mentor
jcrodriguez, Mauricio "Pachá" Vargas
Organization
R Project for Statistical Computing
rco: The R Code Optimizer R Project for Statistical Computing-GSoC 2020
This project aims to further bolster the rco (The R Code Optimizer) package and make it a one stop solution for rendering efficient R code. By...
Contributor
Sanchit Saini
Mentor
Michael Lawrence, Toby Hocking
Organization
R Project for Statistical Computing
rtracklayer improvements
rtracklayer is an Extensible framework for interacting with multiple genome browsers. The project adds the following new features for rtracklayer. ...
Contributor
Akarsh Goyal
Mentor
Jonas Wallin, Johan Larsson
Organization
R Project for Statistical Computing
Better solvers for SLOPE package
SLOPE package offers implementations that solve the Sorted L-One Penalized Estimation (SLOPE) model for various objective functions. However when...
Contributor
Shehab Hosny
Mentor
Gergely Daroczi, Neal Fultz
Organization
R Project for Statistical Computing
gRPC
Implement Authentication, Security and Streaming in gRPC for R Package
Contributor
Ayush
Mentor
Dootika Vats, Adam Maidman
Organization
R Project for Statistical Computing
QBLD - Quantile Regression for Binary Longitudinal Data
This project follows Rahman and Vossmeyer (2019) as its motivating literature, and contributes to the three literatures by extending the various...
Contributor
Divyansh Chawla
Mentor
Marlon E. Cobos, Thiloshon Nagarajah, Narayani Barve, Vijay Barve
Organization
R Project for Statistical Computing
rsqliteadmin : sqlite admin tool in R shiny
R currently supports working on SQLite databases through the RSQLite package. SQLite is a RDBMS which is light and highly efficient for small scale...
Contributor
Erick Oduniyi
Mentor
Claudia Beleites, Roman Kiselev, Bryan Hanson
Organization
R Project for Statistical Computing
Fortification of the hyperSpec R Package
The hyperSpec (http://hyperspec.r-forge.r-project.org/) package allows R users a suite of utilities for manipulating spectroscopic data. These...
Contributor
Chen Liang
Mentor
Ethan Fang, Tuo Zhao
Organization
R Project for Statistical Computing
mAED: Multi-Stage Adaptive Enrichment Design in R
The average cost of trials in the United States is up to $19.6 million for a Phase 2 trial and $52.9 million for a Phase 3 trial. Optimizing the...
Contributor
Akshit Achara
Mentor
Lars Kotthoff, Hans W. Borchers
Organization
R Project for Statistical Computing
MiniZinc Interface for R
MiniZinc is a free and open-source constraint modeling language. Constraint satisfaction and discrete optimization problems can be formulated in a...
Contributor
Sunny Dhoke
Mentor
Thiloshon Nagarajah, Tomer Gueta, Vijay Barve
Organization
R Project for Statistical Computing
The bdverse's development and QA frameworks
The project idea is to develop a robust QA framework for bdverse. This shall consists of developing git strategies, developing bdtests...
Contributor
Claudia Nuñez Penichet
Mentor
Narayani Barve, Tomer Gueta, Vijay Barve
Organization
R Project for Statistical Computing
Biological Survey Planning Considering Hutchinson’s Duality
One of the challenges in biodiversity conservation is to complete an inventory of existing species in the world. Although various developed countries...
Contributor
Julian Stanley
Mentor
Toby Hocking, Guillem Rigaill
Organization
R Project for Statistical Computing
Constrained changepoint GUI
A Shiny web application for the gfpop R package. [...] Detecting sudden changes in data is important to a variety of fields. For example,...
Contributor
lazycipher
Mentor
Faizan F. Khan, Toby Hocking
Organization
R Project for Statistical Computing
Animated interactive ggplots
The goal of this GSOC project is to implement new features for animint2 in order to make it possible to do more kinds of interactive data...
Contributor
Anirban Chetia
Mentor
Toby Hocking, Neeraj Dhanraj Bokde
Organization
R Project for Statistical Computing
Asymptotic complexity testing framework/package
R package developers currently use ad-hoc tests of asymptotic computational complexity via empirical timings of functions and visual diagnostic...
Contributor
Echo Liu
Mentor
alexiosg, Daniel Hanson, R. Douglas Martin
Organization
R Project for Statistical Computing
A package for robust-GARCH model
In the R-language, many packages exist for the estimation and forecasting of GARCH processes, including fGarch and rugarch. However, none, to our...
Contributor
Vito Lestingi
Mentor
Justin M. Shea, Erol Biceroglu, Jasen Mackie, Brian G. Peterson, Peter Carl
Organization
R Project for Statistical Computing
Modeling Expected Returns with R
The main goal of the project is to reproduce selected key findings from the empirical asset pricing literature and related investment practices....
Contributor
Rahul Chauhan
Mentor
Thiloshon Nagarajah, Tomer Gueta, Vijay Barve
Organization
R Project for Statistical Computing
bddashboard: Interactive Biodiversity Data Dashboard
The bdverse is a family of R packages that allow users to conveniently employ R, for biodiversity data exploration, quality assessment (QA), data...
Contributor
Salsabila Mahdi
Mentor
Christophe Dutang, Patrice KIENER, John Nash
Organization
R Project for Statistical Computing
Validation of Neural Network Packages
The purpose of this GSoC project is to validate neural network packages that perform regression. It is a follow-up of a GSoC 2019 project in which we...