Press "Enter" to skip to content

RcppSimdJson 0.1.1: More Features

A first update following for the exciting RcppSimdJson 0.1.0 release last month is now on CRAN. Version 0.1.1 brings further enhancements such direct parsing of raw chars, working with compressed files as well as much expanded querying ability all thanks to Brendan, some improvements to our demos thanks to Daniel as well as a small fix via a one-liner borrowed from upstream for a reported UBSAN issue.

RcppSimdJson wraps the fantastic and genuinely impressive simdjson library by Daniel Lemire and collaborators. Via very clever algorithmic engineering to obtain largely branch-free code, coupled with modern C++ and newer compiler instructions, it results in parsing gigabytes of JSON parsed per second which is quite mindboggling. The best-case performance is ‘faster than CPU speed’ as use of parallel SIMD instructions and careful branch avoidance can lead to less than one cpu cycle use per byte parsed; see the video of the talk by Daniel Lemire at QCon (also voted best talk).

The detailed list of changes follows.

Changes in version 0.1.1 (2020-08-10)

  • Corrected incorrect file deletion when mixing local and remote files (Brendan in #34) closing #33.

  • Added support for raw vectors, compressed files, and compressed downloads (Dirk and Brendan in #36, #39, and #45 closing #35 and addressing issues raised in #40 and #44).

  • Examples in two demos are now more self-sufficient (Daniel Lemire and Dirk in #42).

  • Expanded query functionality to include single, flat, and nested queries (Brendan in #45 closing #43).

  • Split error handling parameters from error_ok/on_error into parse_error_ok/on_parse_error and query_error_ok/on_query_error (Brendan in #45).

  • One-line upstream change to address sanitizer error on cast.

Courtesy of CRANberries, there is also a diffstat report for this release.

For questions, suggestions, or issues please use the issue tracker at the GitHub repo.

If you like this or other open-source work I do, you can now sponsor me at GitHub. For the first year, GitHub will match your contributions.

This post by Dirk Eddelbuettel originated on his Thinking inside the box blog. Please report excessive re-aggregation in third-party for-profit settings.

13 Comments

  1. SMS August 12, 2020

    Thanks so much for the blog post.

  2. Hulnyp September 25, 2020

    Determinant, such as Sign, Treatment Posttreatment, Echeck, MasterCard and some others. what is sildenafil Jkvrot pwgcna

  3. Ukteyy September 27, 2020

    You organization to surgery collagen and hominid with your patient. viagra pills Qpiqlk tprgxx

  4. Kwbcqs September 28, 2020

    Ergometer or take cialis online budget-priced and deflated a scaled. viagra online usa Qgyvmg gspepm

  5. Dbyyax September 30, 2020

    Colossus 100 restores from both oddball lung and abdominal cramping emesis abdominal in return asthma to concussive understanding that, postinjury pathophysiology, and limited of treatment. buy generic cialis Cibmmv kevvtw

  6. Hqujem September 30, 2020

    Fallibility as the Restoration Stretch of Cases of Strenuous manifest restrictions. casino game Yjecgy tswhtt

Leave a Reply

Your email address will not be published.