* Apache Spark with Scala – proven experience in building and optimizing distributed data processing applications
* Strong proficiency in SQL for big data contexts
Nice-to-Have:
* Shell scripting
* Unix/Linux environments (command-line usage, scripting, automation)
* Experience working with on-premises infrastructure is a plus
* Familiarity with version control (Git), CI/CD, and data monitoring tools