RProtoBuf: Efficient Cross-Language Data Serialization in R

View Researcher's Other Codes

Disclaimer: The provided code links for this paper are external links. Science Nest has no responsibility for the accuracy, legality or content of these links. Also, by downloading this code(s), you agree to comply with the terms of use as set out by the author(s) of the code(s).

Authors Dirk Eddelbuettel, Murray Stokely, Jeroen Ooms
Journal/Conference Name ArXiv
Paper Category
Paper Abstract Modern data collection and analysis pipelines often involve a sophisticated mix of applications written in general purpose and specialized programming languages. Many formats commonly used to import and export data between dierent programs or systems, such as CSV or JSON, are verbose, inecient, not type-safe, or tied to a specic programming language. Protocol Buers are a popular method of serializing structured data between applications|while remaining independent of programming languages or operating systems. They oer a unique combination of features, performance, and maturity that seems particulary well suited for data-driven applications and numerical computing. The RProtoBuf package provides a complete interface to Protocol Buers from the R environment for statistical computing. This paper outlines the general class of data serialization requirements for statistical computing, describes the implementation of the RProtoBuf package, and illustrates its use with example applications in large-scale data collection pipelines and web services.
Date of publication 2014
Code Programming Language R
Comment

Copyright Researcher 2021