• GNU Gawk 5.4 Released With New MinRX Regex Matcher, Faster Reading Of Files Developers behind the widely-used GNU Awk text processing utility today released Gawk 5.4. • Gawk 5.4 is now using the new MinRX regular expression matcher as the default regexp engine. • The old regex and DFA engines remain available but the MinRX engine is now used by default. • This new matcher was written by Mike Haertel who was the original developer behind GNU grep. • The MinRX matcher is fully POSIX compliant unlike the existing GNU matchers. • Separately, Gawk 5.4 should be faster for reading regular disk input files.
Article Summaries:
- GNU Awk (Gawk) released version 5.4, introducing the MinRX regular‑expression matcher as the default engine. Developed by Mike Haertel, the MinRX matcher is fully POSIX‑compliant, replacing the older regex and DFA engines while still remaining available. The update also removes timeout checks on regular disk input, giving roughly a 9 % speed boost for large files. Additional improvements include UTF‑8 support for the MinGW Windows and Cygwin ports, enhanced multi‑byte character handling, updated POSIX 2024 compliance, new build options, Arabic translations, and expanded OpenVMS support. Documentation updates address mailing‑list conduct and discourage proprietary‑software discussion.
Sources: