New

Improve 'list datasets' endpoint - Python SDK

Related products:API and SDKs

7 days ago
April 4, 2025
0 replies
11 views

Mariana Vertuli
Active

I would like to suggest a new feature on "list datasets" endpoint. It would be extremely helpful to have the count of files (of specific MIME types) included as a property in the response.

Currently, as an example, to determine the number of .pdf and .docx files in a specific dataset, we would make two separate requests to the files.aggregate() endpoint - each filtered by the datasetId and the mimetype (one request for each MIME type). This approach becomes inefficient when we need to evaluate this information across many datasets, as it requires too many requests.

If we need this information from many datasets, we should to list them, and then retrieve the count of files available through files.aggregate(). The proposed solution is to enhance data_sets.list() endpoint by adding a 'files' filter, allowing us to send an array of MIME types. The response would include the datasets that contains that specific file types, along with the files counts.

Thank you in advance!

Cookie Policy

We use cookies to enhance and personalize your experience. If you accept you agree to our full cookie policy. Learn more about our cookies.

Cookie settings

We use 3 different kinds of cookies. You can choose which cookies you want to accept. We need basic cookies to make this site work, therefore these are the minimum you can select. Learn more about our cookies.

Basic
Functional

Normal
Functional + analytics

Complete
Functional + analytics + social media + embedded videos

Related topics

Opzeggen Simonly?icon

Hoe overstappen van SimOnly naar Prepaid?icon

wat maakt de prijs voor mbs buiten de bundel bij jullie hoger dan bij andere providers?icon

Hoe moet ik mijn abonnement opzeggen?icon

prepaid esim omzetten naar simonly, hoe doe ik dat?icon

Sign up

Log in to the community

Scanning file for viruses.

This file cannot be downloaded

Cookie Policy

Cookie settings