Querying, pre-processing, data cleaning, feature engineering and analyzing large amounts of structured and unstructured data (terabytes/petabytes) across multiple data sources using structured query language (SQL), Python, PyTorch, PySpark, R, Spark, and Scala in a cloud-native AWS environment. Coll...