Amazon S3 has expanded compaction support to include Apache Avro and ORC formats for Apache Iceberg tables, complementing existing Parquet format capabilities. This enhancement works across both S3 Tables and for general purpose S3 buckets using AWS Glue Data Catalog optimization.
While Parquet is the default format for Iceberg tables, you can also write data in Avro or ORC formats for specific workloads. For example, you can use Avro to improve write performance for data ingestion and streaming use cases like daily purchase transactions, streaming sensor data, or collecting ad impressions. S3 Tables automatically compact small files into larger ones to minimize scanned data, improve query performance, and reduce costs. By default, compaction converts Avro and ORC files to Parquet for optimal read performance, but you can specify your preferred target format in your table properties.
Compaction support for Apache Avro and ORC formats is now available in all AWS Regions where S3 Tables or optimization with the AWS Glue Data Catalog are available. To learn more about S3 Tables compaction, see the S3 Tables maintenance documentation. For general purpose bucket optimization, see the AWS Glue Data Catalog optimization documentation.
Categories:
Source: Amazon Web Services
Latest Posts
- (Updated) Consult and merge into a meeting or group call via Dual-Tone Multi-Frequency (DTMF) [MC1183611]
![(Updated) Consult and merge into a meeting or group call via Dual-Tone Multi-Frequency (DTMF) [MC1183611] 2 pexels mareefe 1638280](data:image/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==)
- (Updated) Microsoft Teams: Channel agent orchestration with GitHub, Asana, and Jira via Model Context Protocol (MCP) [MC1182703]
![(Updated) Microsoft Teams: Channel agent orchestration with GitHub, Asana, and Jira via Model Context Protocol (MCP) [MC1182703] 3 pexels cottonbro 4904564](data:image/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==)
- (Updated) New enrollment dashboard and data deletion controls in Teams Admin Center [MC1191921]
![(Updated) New enrollment dashboard and data deletion controls in Teams Admin Center [MC1191921] 4 pexels rostislav 5011647](data:image/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==)
- (Updated) Microsoft Planner: Support for Microsoft Information Protection (MIP) content sensitivity labels [MC1191342]
![(Updated) Microsoft Planner: Support for Microsoft Information Protection (MIP) content sensitivity labels [MC1191342] 5 pexels andre furtado 43594 1263985](data:image/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==)

![(Updated) Consult and merge into a meeting or group call via Dual-Tone Multi-Frequency (DTMF) [MC1183611] 2 pexels mareefe 1638280](https://mwpro.co.uk/wp-content/uploads/2025/06/pexels-mareefe-1638280-150x150.webp)
![(Updated) Microsoft Teams: Channel agent orchestration with GitHub, Asana, and Jira via Model Context Protocol (MCP) [MC1182703] 3 pexels cottonbro 4904564](https://mwpro.co.uk/wp-content/uploads/2024/08/pexels-cottonbro-4904564-150x150.webp)
![(Updated) New enrollment dashboard and data deletion controls in Teams Admin Center [MC1191921] 4 pexels rostislav 5011647](https://mwpro.co.uk/wp-content/uploads/2024/08/pexels-rostislav-5011647-150x150.webp)
![(Updated) Microsoft Planner: Support for Microsoft Information Protection (MIP) content sensitivity labels [MC1191342] 5 pexels andre furtado 43594 1263985](https://mwpro.co.uk/wp-content/uploads/2025/06/pexels-andre-furtado-43594-1263985-150x150.webp)
