Skip to content
Snippets Groups Projects
  1. Oct 12, 2020
    • Erik Johnston's avatar
      Make event persisters periodically announce position over replication. (#8499) · 8de3703d
      Erik Johnston authored
      Currently background proccesses stream the events stream use the "minimum persisted position" (i.e. `get_current_token()`) rather than the vector clock style tokens. This is broadly fine as it doesn't matter if the background processes lag a small amount. However, in extreme cases (i.e. SyTests) where we only write to one event persister the background processes will never make progress.
      
      This PR changes it so that the `MultiWriterIDGenerator` keeps the current position of a given instance as up to date as possible (i.e using the latest token it sees if its not in the process of persisting anything), and then periodically announces that over replication. This then allows the "minimum persisted position" to advance, albeit with a small lag.
      8de3703d
  2. Oct 07, 2020
    • Erik Johnston's avatar
      Reduce serialization errors in MultiWriterIdGen (#8456) · fa8934b1
      Erik Johnston authored
      We call `_update_stream_positions_table_txn` a lot, which is an UPSERT
      that can conflict in `REPEATABLE READ` isolation level. Instead of doing
      a transaction consisting of a single query we may as well run it outside
      of a transaction.
      fa8934b1
    • Erik Johnston's avatar
      Reduce serialization errors in MultiWriterIdGen (#8456) · ae5b2a72
      Erik Johnston authored
      We call `_update_stream_positions_table_txn` a lot, which is an UPSERT
      that can conflict in `REPEATABLE READ` isolation level. Instead of doing
      a transaction consisting of a single query we may as well run it outside
      of a transaction.
      ae5b2a72
  3. Oct 02, 2020
    • Erik Johnston's avatar
      Add logging on startup/shutdown (#8448) · e3debf96
      Erik Johnston authored
      This is so we can tell what is going on when things are taking a while to start up.
      
      The main change here is to ensure that transactions that are created during startup get correctly logged like normal transactions.
      e3debf96
  4. Sep 30, 2020
    • Erik Johnston's avatar
      Make token serializing/deserializing async (#8427) · 7941372e
      Erik Johnston authored
      The idea is that in future tokens will encode a mapping of instance to position. However, we don't want to include the full instance name in the string representation, so instead we'll have a mapping between instance name and an immutable integer ID in the DB that we can use instead. We'll then do the lookup when we serialize/deserialize the token (we could alternatively pass around an `Instance` type that includes both the name and ID, but that turns out to be a lot more invasive).
      7941372e
    • Richard van der Hoff's avatar
      Rewrite BucketCollector · 6d2d42f8
      Richard van der Hoff authored
      This was a bit unweildy for what I wanted: in particular, I wanted to assign
      each measurement straight into a bucket, rather than storing an intermediate
      Counter which didn't do any bucketing at all.
      
      I've replaced it with something that is hopefully a bit easier to use.
      
      (I'm not entirely sure what the difference between a HistogramMetricFamily and
      a GaugeHistogramMetricFamily is, but given our counters can go down as well as
      up the latter *sounds* more accurate?)
      6d2d42f8
  5. Sep 29, 2020
  6. Sep 28, 2020
  7. Sep 24, 2020
    • Erik Johnston's avatar
      Fix MultiWriteIdGenerator's handling of restarts. (#8374) · f112cfe5
      Erik Johnston authored
      On startup `MultiWriteIdGenerator` fetches the maximum stream ID for
      each instance from the table and uses that as its initial "current
      position" for each writer. This is problematic as a) it involves either
      a scan of events table or an index (neither of which is ideal), and b)
      if rows are being persisted out of order elsewhere while the process
      restarts then using the maximum stream ID is not correct. This could
      theoretically lead to race conditions where e.g. events that are
      persisted out of order are not sent down sync streams.
      
      We fix this by creating a new table that tracks the current positions of
      each writer to the stream, and update it each time we finish persisting
      a new entry. This is a relatively small overhead when persisting events.
      However for the cache invalidation stream this is a much bigger relative
      overhead, so instead we note that for invalidation we don't actually
      care about reliability over restarts (as there's no caches to
      invalidate) and simply don't bother reading and writing to the new table
      in that particular case.
      f112cfe5
  8. Sep 23, 2020
  9. Sep 18, 2020
  10. Sep 08, 2020
  11. Sep 04, 2020
  12. Sep 02, 2020
  13. Sep 01, 2020
  14. Aug 28, 2020
    • Richard van der Hoff's avatar
      Rename `get_e2e_device_keys` to better reflect its purpose (#8205) · 45e8f772
      Richard van der Hoff authored
      ... and to show that it does something slightly different to
      `_get_e2e_device_keys_txn`.
      
      `include_all_devices` and `include_deleted_devices` were never used (and
      `include_deleted_devices` was broken, since that would cause `None`s in the
      result which were not handled in the loop below.
      
      Add some typing too.
      45e8f772
  15. Aug 27, 2020
  16. Aug 26, 2020
  17. Aug 25, 2020
  18. Aug 24, 2020
  19. Aug 19, 2020
    • Patrick Cloke's avatar
    • Erik Johnston's avatar
      Separate `get_current_token` into two. (#8113) · 76d21d14
      Erik Johnston authored
      The function is used for two purposes: 1) for subscribers of streams to
      get a token they can use to get further updates with, and 2) for
      replication to track position of the writers of the stream.
      
      For streams with a single writer the two scenarios produce the same
      result, however the situation becomes complicated for streams with
      multiple writers. The current `MultiWriterIdGenerator` does not
      correctly handle the first case (which is not an issue as its only used
      for the `caches` stream which nothing subscribes to outside of
      replication).
      76d21d14
  20. Aug 18, 2020
  21. Aug 17, 2020
  22. Aug 14, 2020
  23. Aug 12, 2020
  24. Aug 11, 2020
  25. Aug 10, 2020
  26. Aug 07, 2020
Loading