We sometimes need to compute the most significant digits of the product ...
In recent work, Lemire (2021) presented a fast algorithm to convert numb...
Intel includes on its recent processors a powerful set of instructions
c...
Bloom and cuckoo filters provide fast approximate set membership while u...
We often represent text using Unicode formats (UTF-8 and UTF-16). The UT...
In software, text is often represented using Unicode formats (UTF-8 and
...
With disks and networks providing gigabytes per second, parsing decimal
...
The integer division of a numerator n by a divisor d gives a quotient q ...
The majority of text is stored in UTF-8, which must be validated on
inge...
The Bloom filter provides fast approximate set membership while using li...
In several fields such as statistics, machine learning, and bioinformati...
Many common document formats on the Internet are text-only such as email...
JavaScript Object Notation or JSON is a ubiquitous data exchange format ...
On common processors, integer multiplication is many times faster than
i...
L'Ecuyer & Simard's Big Crush statistical test suite has revealed statis...
In simulations, probabilistic algorithms and statistical tests, we often...
Apache Calcite is a foundational software framework that provides query
...
Arrays of integers are often compressed in search engines. Though there ...
Compressed bitmap indexes are used in systems such as Git or Oracle to
a...
Web developers use base64 formats to include images, fonts, sounds and o...
The importance of a research article is routinely measured by counting h...
The Dynamic Time Warping (DTW) is a popular similarity measure between t...
Time series are difficult to monitor, summarize and predict. Segmentatio...