Education/ Experience and Skill Requirement
4+ years of relevant experience in
Experience working with both relational and NoSQL databases.
Strong coding skills; Python (preferred) /R/Java/Scala
Experience in developing Data warehousing technologies Experience with AWS or equivalent cloud services preferred
Experience in BigData technologies (Hadoop, HDFS, MapReduce, Spark, Hive, HBase etc) will be valuable
Knowledge of Machine Learning a big plus (Random Forest, Decision Trees, SVM, NLP, Gradient Boosting, Supervised/Unsupervised Learning, Clustering, classification and regression modeling).
Responsibilities
-
Proficiency with several years’ experience in more than one of Python, R, Java, Scala, or robust Linux shell scripting
-
Implementation experience with data warehouse architecture & design, ETL design/development, and Analytics
-
Knowledge of general cloud architecture and cloud strategies especially around AWS services and concepts such as S3 object stores, RDS databases, EC2, Glacier, Lambda, IAM, enterprise security, data security, DevOps, replication and disaster recovery
-
Well versed with data mining & exploration, NLP and visualization
-
Understanding of data modeling, data integration, and data representation (metadata, OWL, ontologies)
-
Developing data marts and data management using SQL
-
Creating powerful visual outcomes
-
Independently manage daily client communication, especially over calls
-
Manage client deadlines, ensure quality of the deliverables, attention to detail
-
Experience/understanding of corporate finance data from company filings is desirable