Package: bigReg 0.1.2

bigReg: Generalized Linear Models (GLM) for Large Data Sets

Allows the user to carry out GLM on very large data sets. Data can be created using the data_frame() function and appended to the object with object$append(data); data_frame and data_matrix objects are available that allow the user to store large data on disk. The data is stored as doubles in binary format and any character columns are transformed to factors and then stored as numeric (binary) data while a look-up table is stored in a separate .meta_data file in the same folder. The data is stored in blocks and GLM regression algorithm is modified and carries out a MapReduce- like algorithm to fit the model. The functions bglm(), and summary() and bglm_predict() are available for creating and post-processing of models. The library requires Armadillo installed on your system. It probably won't function on windows since multi-core processing is done using mclapply() which forks R on Unix/Linux type operating systems.

Authors:Chibisi Chima-Okereke <[email protected]>

bigReg_0.1.2.tar.gz

bigReg_0.1.2.tgz(r-4.4-x86_64)bigReg_0.1.2.tgz(r-4.4-arm64)bigReg_0.1.2.tgz(r-4.3-x86_64)bigReg_0.1.2.tgz(r-4.3-arm64)
bigReg_0.1.2.tar.gz(r-4.5-noble)bigReg_0.1.2.tar.gz(r-4.4-noble)
bigReg_0.1.2.tgz(r-4.4-emscripten)bigReg_0.1.2.tgz(r-4.3-emscripten)
bigReg.pdf |bigReg.html
bigReg/json (API)

# Install 'bigReg' in R:
install.packages('bigReg', repos = c('https://activeanalytics.r-universe.dev', 'https://cloud.r-project.org'))

Peer review:

Bug tracker:https://github.com/activeanalytics/bigreg/issues

Uses libs:
  • openblas– Optimized BLAS
  • c++– GNU Standard C++ Library v3
Datasets:
  • plasma - Plasma data from the HSAUR package

On CRAN:

big-datadata-framegeneralized-linear-models

2.00 score 1 stars 3 scripts 115 downloads 41 exports 4 dependencies

Last updated 8 years agofrom:73d725d2cd. Checks:OK: 3 WARNING: 3. Indexed: yes.

TargetResultDate
Doc / VignettesOKNov 05 2024
R-4.5-linux-x86_64WARNINGNov 05 2024
R-4.4-mac-x86_64WARNINGNov 05 2024
R-4.4-mac-aarch64WARNINGNov 05 2024
R-4.3-mac-x86_64OKNov 05 2024
R-4.3-mac-aarch64OKNov 05 2024

Exports:asIntegerbglmbglm_predictbinomial_blmCreateFactordata_framedata_matrixfamily_Gamma_gaussian_inverse.gaussian_load_data_frameload_data_matrixmyInmySeqpoisson_print.bglmprint.blmprint.data_frameprint.data_matrixprint.summary.bglmprint.summary.blmprocess_bglm_blockquasi_quasibinomial_quasipoisson_r_bindread_df_blockread_df_blocksread_matrix_blockread_matrix_blocksreadNumericVectorsum_bglm_blocksummary.bglmsummary.blmSVDwrite_numeric_vectorwriteNumericVectorXWXMatrixXWXMatrixW

Dependencies:MASSRcppRcppArmadillouuid

Readme and manuals

Help Manual

Help pageTopics
converts numeric vector to integerasInteger
Function to carry out generalized linear regression on a data_frame data objectbglm
predict function for bglm objectbglm_predict
binomial family functionbinomial_
Function to carry out linear regression on a data_frame data objectblm
creates factor from numeric vector and character vector as levelsCreateFactor
function to create a data_frame objectdata_frame
function to create a data_frame objectdata_matrix
family functionfamily_
Gamma family functionGamma_
gaussian family functiongaussian_
inverse.gaussian family functioninverse.gaussian_
function to load data_frame objectload_data_frame
function to load data_frame objectload_data_matrix
finds whether x is in ymyIn
mySeq function to sequence integersmySeq
plasma data from the HSAUR packageplasma
poisson family functionpoisson_
print function for the bglm objectprint.bglm
print function for the blm objectprint.blm
print function for a data_frameprint.data_frame
print function for a data_matrixprint.data_matrix
Function to print the summary object from the bglm objectprint.summary.bglm
Function to print the summary object from the blm objectprint.summary.blm
Function to print the summary object from the blm objectprocess_bglm_block
quasi family functionquasi_
quasibinomial family functionquasibinomial_
quasipoisson family functionquasipoisson_
row binding for benchmarking ...r_bind
read data frame block from fileread_df_block
read multiple blocks of data frames from fileread_df_blocks
read matrix block from fileread_matrix_block
read matrix blocks from fileread_matrix_blocks
reads numeric vector to filereadNumericVector
The reduction function for the algorithmsum_bglm_block
summary function for the bglm objectsummary.bglm
summary function for the blm objectsummary.blm
Singular value decomposition of the aggregated list from XWXMatrix(W) functionsSVD
writes numeric vector to filewrite_numeric_vector
writes numeric vector to filewriteNumericVector
Calculation of iterative regression componentsXWXMatrix
Calculation of iterative regression componentsXWXMatrixW