Data deletion

Delete data for a time range manually

Apache Druid stores data partitioned by time chunk and supports deleting data for time chunks by dropping segments. This is a fast, metadata-only operation.

Deletion by time range happens in two steps:

Segments to be deleted must first be marked as "unused". This can happen when a segment is dropped by a drop rule or when you manually mark a segment unused through the Coordinator API or web console. This is a soft delete: the data is not available for querying, but the segment files remains in deep storage, and the segment records remains in the metadata store.
Once a segment is marked "unused", you can use a kill task to permanently delete the segment file from deep storage and remove its record from the metadata store. This is a hard delete: the data is unrecoverable unless you have a backup.

For documentation on disabling segments using the Coordinator API, see the Legacy metadata API reference.

A data deletion tutorial is available at Tutorial: Deleting data.

Delete data automatically using drop rules

Druid supports load and drop rules, which are used to define intervals of time where data should be preserved, and intervals where data should be discarded. Data that falls under a drop rule is marked unused, in the same manner as if you manually mark that time range unused. This is a fast, metadata-only operation.

Data that is dropped in this way is marked unused, but remains in deep storage. To permanently delete it, use a kill task.

Delete specific records

Druid supports deleting specific records using reindexing with a filter. The filter specifies which data remains after reindexing, so it must be the inverse of the data you want to delete. Because segments must be rewritten to delete data in this way, it can be a time-consuming operation.

For example, to delete records where userName is 'bob' with native batch indexing, use a transformSpec with filter {"type": "not", "field": {"type": "selector", "dimension": "userName", "value": "bob"}}.

To delete the same records using SQL, use REPLACE with WHERE userName <> 'bob'.

To reindex using native batch, use the druid input source. If needed, transformSpec can be used to filter or modify data during the reindexing job. To reindex with SQL, use REPLACE <table> OVERWRITE with SELECT ... FROM <table>. (Druid does not have UPDATE or ALTER TABLE statements.) Any SQL SELECT query can be used to filter, modify, or enrich the data during the reindexing job.

Data that is deleted in this way is marked unused, but remains in deep storage. To permanently delete it, use a kill task.

Delete an entire table

Deleting an entire table works the same way as deleting part of a table by time range. First, mark all segments unused using the Coordinator API or web console. Then, optionally, delete it permanently using a kill task.

Delete data permanently using `kill` tasks

Data that has been overwritten or soft-deleted still remains as segments that have been marked unused. You can use a kill task to permanently delete this data.

The available grammar is:

{
    "type": "kill",
    "id": <task_id>,
    "dataSource": <task_datasource>,
    "interval" : <all_unused_segments_in_this_interval_will_die!>,
    "versions" : <optional_list_of_segment_versions_to_delete_in_this_interval>,
    "context": <task_context>,
    "batchSize": <optional_batch_size>,
    "limit": <optional_maximum_number_of_segments_to_delete>,
    "maxUsedStatusLastUpdatedTime": <optional_maximum_timestamp_when_segments_were_marked_as_unused>
}

Some of the parameters used in the task payload are further explained below:

Parameter	Default	Explanation
`versions`	null (all versions)	List of segment versions within the specified `interval` for the kill task to delete. The default behavior is to delete all unused segment versions in the specified `interval`.
`batchSize`	100	Maximum number of segments that are deleted in one kill batch. Some operations on the Overlord may get stuck while a `kill` task is in progress due to concurrency constraints (such as in `TaskLockbox`). Thus, a `kill` task splits the list of unused segments to be deleted into smaller batches to yield the Overlord resources intermittently to other task operations.
`limit`	null (no limit)	Maximum number of segments for the kill task to delete.
`maxUsedStatusLastUpdatedTime`	null (no cutoff)	Maximum timestamp used as a cutoff to include unused segments. The kill task only considers segments which lie in the specified `interval` and were marked as unused no later than this time. The default behavior is to kill all unused segments in the `interval` regardless of when they where marked as unused.

WARNING: The kill task permanently removes all information about the affected segments from the metadata store and deep storage. This operation cannot be undone.

Auto-kill data using Coordinator duties

Instead of submitting kill tasks manually to permanently delete data for a given interval, you can enable auto-kill of unused segments on the Coordinator. The Coordinator runs a duty periodically to identify intervals containing unused segments that are eligible for kill. It then launches a kill task for each of these intervals.

Refer to Data management on the Coordinator to configure auto-kill of unused segments on the Coordinator.

Auto-kill data on the Overlord (Experimental)

info

This is an experimental feature that:

Can be used only if segment metadata caching is enabled on the Overlord.
MUST NOT be used if auto-kill of unused segments is already enabled on the Coordinator.

This is an experimental feature to run kill tasks in an "embedded" mode on the Overlord itself.

These embedded tasks offer several advantages over auto-kill performed by the Coordinator as they:

avoid a lot of unnecessary REST API calls to the Overlord from tasks or the Coordinator.
kill unused segments as soon as they become eligible.
run on the Overlord and do not take up task slots.
finish faster as they save on the overhead of launching a task process.
kill a small number of segments per task, to ensure that locks on an interval are not held for too long.
skip locked intervals to avoid head-of-line blocking in kill tasks.
require little to no configuration.
can keep up with a large number of unused segments in the cluster.
take advantage of the segment metadata cache on the Overlord.

Refer to Auto-kill unused segments on the Overlord to configure auto-kill of unused segments on the Overlord. See Auto-kill metrics for the metrics emitted by embedded kill tasks.

Delete data for a time range manually​

Delete data automatically using drop rules​

Delete specific records​

Delete an entire table​

Delete data permanently using kill tasks​

Auto-kill data using Coordinator duties​

Auto-kill data on the Overlord (Experimental)​