Processing websocket messages quickly

websockets summary

full-duplex communication over single TCP connection

websocket.send('hi');

websocket.onmessage = function(evt) {
  console.log('server said: ' + evt.data); };

> server said: hello!

Processing a 16M text message

Be faster than most popular implementations

version 1

Instrument everything and profile

High memory allocation
- Inherently slow
- Very high GC pressure
  
  Image would grow to over 1GB!

(Both brought down GC pressure significantly)

write-u8vector

(a websockets psuedo security mechanism)

Using a parser combinator algorithm
- Relatively slow
Most websockets implementations use a fast, freely available C algorithm/implementation

But it was broken!

Pretty much reached the goal but can we do better?

Assumption: most messages would be plain ASCII
Realization:
- Plain ASCII can be checked in C effectively free
- No memory allocations, no garbage, max O(n) running time
- Worst case still good (~400ms)

It helps to instrument and profile everything including language runtime
Watch out for unnecessary memory allocations
Utilize low-level languages/libraries when possible (most fast implementations do)
Consider likely "real world" usage as well as academic/theoretical
Cheat when possible :)