I initially pushed against the idea to save some space (~50gb for 100 billion samples). But a lot of time spent time in analysing the false alerts could simply be explained if we know exactly when the sample is stored on the database vs when it is produced on client side.
At that, scale additional 50gb should be fine for the time it will save us.