Some Dubious and Pragmatic Benchmarks

This repo contains code for evaluating languages to use for string heavy programs (bioinformatics).

The benchmarks are not trying to find the fastest way to solve the problem of count the number of fields in the file, they are just meant to demonstrate the following:

Scriptability - How easy is it to read from stdin and split lines?
IO - Is there something inherently slow about how the language deals with IO?
String Ops - Are the builtins for doing string manipulations efficient, or do they just look pretty?
Lightweight Classes - can we cheaply create a class to hold data?
Array and String allocations - Does the compiler / interpreter optimize allocations?
(Not yet done) C binding - How easy is it to use a c library?
(Not yet done) C interop - Is there a large cost to interacting with C libraries?

Psuedocode

Input is a tab delimted file. The Python script may be easier to read :)

object Record:
    name: the first field in the line
    count: the number of fields that have 'bc' between chars 1:4

def create_record(list of fields):
    count = numer of fields that contain 'bc', case insensitive, betwen chars 1:4
    return Record object reflecting the passed in line

def main:
    read lines from stdin
        split line on '\t'
        send line to create_record function

    print the sum or the counts of each record

Results

Lang	Time
Python3	0m5.704s
Perl	0m5.602s
D - ldc	0m4.932s
Nim	0m3.776s
Rust	0m1.214s

See Implementation notes

Language version for results

Python 3.6 Perl 5.26 Nim 0.19 Dlang (dmd) v2.085.1 Dlang (ldc2) 1.15.0

Implementation Notes

I chose to use a class in Python because in a real life scenario, I would create a dataclass to take advantage of mypy / the type system. Classes are a little bit slower though. In Perl, I would likely not have used a class, since there is no real type benefit to doing so. I used Class::XSAccessor here just level the playing field. If dicts are uses in the Perl and Python versions, they both go to about 7.5s, with Perl having a slight edge.

Name		Name	Last commit message	Last commit date
Latest commit History 49 Commits
fast		fast
records		records
wordcount		wordcount
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Some Dubious and Pragmatic Benchmarks

Psuedocode

Results

Language version for results

Implementation Notes

About

Releases

Packages

Contributors 3

Languages

sstadick/bioinfo_benchmarks

Folders and files

Latest commit

History

Repository files navigation

Some Dubious and Pragmatic Benchmarks

Psuedocode

Results

Language version for results

Implementation Notes

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages