I am developing a outlier detection model. The model will be accessed by users through a Streamlit app. At the moment I am assessing whether deploying the model through the Streamlit page in CDF would be a good option. My issue is that I am not sure what the most efficient way to access and import the dataset into the app is. Would it be a good approach to create a data model and import the data from there, or extract the data straight from raw? Another idea I had was setting up a scheduled Cognite Function to download the data as a csv and letting the app access that.
I need the data to be updated a few times a week and have the “import” time in the app to be fairly quick for the user experience in the app to be good.
My dataset is around 1 million rows with three numeric and two non-numeric columns. For now, I have the data as a raw table in CDF.
Thanks!