2.5 KiB
Automated data lineage from Azure (Command Line Mode)
This article introduces how to discover the data lineage from azure scripts or the azure database and automatically update it. So the business users and developers can see the azure data lineage graph instantly.
Software used in this solution
- SQLFlow Cloud Or SQLFlow on-premise version
- Grabit tool for SQLFlow. It's free.
Install grabit tool
After download grabit tool, please check this article to see how to setup the grabit tool.
Discover data lineage in a Azure database
- Modify the
conf-template\azure-config-templateto meet your environment.
Here is a sample config file: azure-config that grabs metadata from the remote azure database
and sends the metadata to the SQLFlow Cloud to discover the data lineage.
It would help if you had a premium account to access the SQLFlow Cloud.
{
"databaseType":"azure",
"optionType":1,
"resultType":1,
"databaseServer":{
"hostname":"azure ip address",
"port":"1433",
"username":"azure user name",
"password":"your password here",
"database":"",
"extractedDbsSchemas":"",
"excludedDbsSchemas":"",
"extractedStoredProcedures":"",
"extractedViews":"",
"enableQueryHistory":false,
"queryHistoryBlockOfTimeInMinutes":30
},
"SQLFlowServer":{
"server":"https://api.gudusoft.com",
"serverPort":"",
"userId":"your sqlflow premium account id",
"userSecret":"your sqlflow premium account secret code"
},
"neo4jConnection":{
"url":"",
"username":"",
"password":""
},
"isUploadNeo4j":0
}
- Run grabit command-line tool, you may find the grabit.log under the logs directory.
./start.sh /f azure-config
-
Check out the diagram via this url: https://sqlflow.gudusoft.com/#/job/latest
-
You may save the data lineage in JSON/CSV/GRAPHML format.
The file will be saved under
data\datalineagedirectory. -
Run the grabit at a scheduled time