hi!
Autoscaling would be something to be done at the k8s or docker level
as per TTL, if you are using multitenancy, there is a new feature where you can load and offload from memory on a per tenant basis:
For now, you can activate and deactivate. So if this applies for your usecase, let’s say a user logs out from your system, you can offload that tenant.
Future versions will allow a time to auto offload the tenant, considering it’s last activity.
So, whenever a new query comes in for a deactivated tenant, Weaviate will load it on demand.
Let me know if this helps.
Thanks!