In Etlworks Integrator you can configure a CSV format to automatically split a CSV document into multiple chunks when creating a new file or sending a payload to the HTTP endpoint.
Typically, it makes sense only for the large documents (millions of rows) because it can greatly improve performance of the certain data load tasks, such as loading data into the online data warehouses: Snowflake, Redshift, etc. These systems can load files in parallel. Transferring smaller files over the network can often be faster.
To always split the CSV file when creating it, simply create a new CSV format and enter a positive value in the field Maximum number of rows in file. Then use the created format in the TO part of the source-to-destination transformation.
Comments
0 comments
Please sign in to leave a comment.