1. Data monitoring/metrics/alerts
2. Open source technologies
3. Kafka
4. Java
5. Spark
6. Python
7. AWS
8. Expert understanding of data flow (ETL, pipelines, etc)
9. Expert SQL skills with many data source types (RDBMS, Columnar Data stores, MPP )
10. Integration MDR/Catalog/Crawler(s)
11. Deep understanding of data schema (metadata management)
12. Alation, Apache, Amundsen