Better approach on getting streaming response
I have been following the official Convex tutorial methods to update the streaming output of LLMs, which involves a number of internal mutations whenever an action retrieves new streaming results.
While this method is easy to implement and appears as streaming from the user's perspective, I believe it consumes a significant amount of database bandwidth and is somewhat slower than having a direct streaming response.
Are there any plans or guides for returning streaming responses in Convex actions?
