The Copy Data activity is one of the most common uses of a data pipeline. Many pipelines consist of a single Copy Data activity that is used to ingest data from an external source into a lakehouse file or table.
You can also combine the Copy Data activity with other activities to create a repeatable data ingestion process – for example by using a Delete data activity to remove existing data, a Copy Data activity to replace the deleted data with a file containing data from an external source, and a Notebook activity to run Spark code that transforms the data in the file and loads it into a table.
The Copy Data tool

When you add a Copy Data activity to a pipeline, a graphical tool takes you through the steps required to configure the data source and destination for the copy operation. A wide range of source connections is supported, making it possible to ingest data from most common sources. In OneLake, this includes support for lakehouse, warehouse, SQL Database, and others.

Copy Data activity settings
After you’ve added a Copy Data activity to a pipeline, you can select it in the pipeline canvas and edit its settings in the pane underneath.

When to use the Copy Data activity
Use the Copy Data activity when you need to copy data directly between a supported source and destination without applying any transformations, or when you want to import the raw data and apply transformations in later pipeline activities.
oracle linux administration training courses malaysia
Leave a Reply