Software

Python packages

CSVMeta

Lightweight csv read/write, keeping track of csv dialect and other metadata.

pip install csvmeta

Streamlit-Survey

Survey components for Streamlit apps.

pip install streamlit-survey

ER-Evaluation

An End-to-End Evaluation Framework for Entity Resolution Systems.

pip install er-evaluation

PatentsView-Evaluation

Python package for the evaluation and benchmarking of PatentsView disambiguation algorithms.

pip install pv-evaluation

StringCompare

StringCompare is a Python package (implemented in C++ through pybind11) for efficient string similarity computation and approximate string matching.

pip install git+https://github.com/OlivierBinette/stringcompare.git

🚧 GroupByRule

Deduplicate data using fuzzy and deterministic matching rules. This is under construction.

pip install git+https://github.com/OlivierBinette/groupbyrule.git

R packages

FingermatchR

Fingerprint matching tools based on NIST’s Biometric Image Software, on FingerJet minutiae extraction tool, and on the libfmr library.

devtools::install_github("forensic-science/fingermatchR")

MSETools

Code and analyses for the paper titled “On the Reliability of Multiple Systems Estimation for the Quantification of Modern Slavery” (Binette and Steorts, 2021).

devtools::install_github("OlivierBinette/MSETools")

cache

CRAN_Status_Badge

Simple interface to caching which works across interactive R sessions, R scripts and Rmarkdown documents.

install.packages("cache")

assert

CRAN_Status_Badge

Lightweight validation tool for checking function arguments and data analysis scripts.

install.packages("assert")

dgaFast

Multiple Systems Estimation Using Decomposable Graphical Models. This is an efficient re-implementation and extension of the dga R package (it is now part of dga).

devtools::install_github("OlivierBinette/dgaFast")

TessTools

Tools for the use of Tesseract OCR in R and for the analysis of historical newspaper archives.

devtools::install_github("OlivierBinette/TessTools")

splinit

Periodic spline regression and closed curve reconstruction.

devtools::install_github("OlivierBinette/splinit")

pretty

Better baser plots in R.

devtools::install_github("OlivierBinette/pretty")

Javascript apps

Fractals

High resolution visualization for the Mandelbrot set. A Java version with more features is also available.

Earthquakes

Visualize earthquakes on the globe.

Other software