- Work with stakeholders throughout the organization to identify opportunities for leveraging company data to drive business solutions
- Undertaking data collection, pre-processing and analysis
- Identify valuable data sources and automate collection processes
- Assess the effectiveness and accuracy of new data sources and data gathering techniques
- Develop custom data models and algorithms to apply to data sets
- Use predictive modelling to increase and optimize customer experiences, revenue generation, ad targeting and other business outcomes
- Develop company A/B testing framework and test model quality
- Coordinate with different functional teams to implement models and monitor outcomes
- Develop processes and tools to monitor and analyze model performance and data accuracy
- Building models to address business problems and then present findings using data visualization techniques (Exploratory Data Analysis)
- Help build mechanisms to analyse large amounts of information to discover trends and patterns (Scale Handling)
- Build predictive models and machine-learning algorithms
- Combine models through ensemble modelling
- Collaborate with product management and engineering departments to understand company needs and devise possible solutions
- Keep up-to-date with latest technology trends
- Communicate results and ideas to key decision makers
- Implement new statistical or other mathematical methodologies as needed for specific models or analysis
- Optimize joint development efforts through appropriate database use and project design
If you are someone with:
- 5+ years of experience manipulating data sets and building statistical models, has a Masters or PHD in Statistics, Mathematics, Computer Science or another quantitative field, and is familiar with the following software/tools:
a. Coding knowledge and experience with several languages: Python, C, C++, Java, JavaScript, etc.
b. Knowledge and experience in statistical and data mining techniques: GLM/Regression, Random Forest, Boosting, Trees, text mining, social network analysis, etc.
c. Deep understanding of Classification, Forecasting & Optimization based use-cases and implementation
d. Experience querying databases and using statistical computer languages: R, Python, SLQ, etc.
e. Experience using web services: Postqresql, Redshift, S3, Spark, DigitalOcean, etc.
f. Experience creating and using advanced machine learning algorithms and statistics: regression, simulation, scenario analysis, modeling, clustering, decision trees, neural networks, etc.
g. Experience with distributed data/computing tools: Map/Reduce, Hadoop, Hive, Spark, Gurobi, MySQL, etc.
h. Experience visualizing/presenting data for stakeholders using: Periscope, Business Objects, D3, ggplot, etc.
- Strong problem solving skills with an emphasis on product development
- Experience using statistical computer languages (R, Python, SLQ, etc.) to manipulate data and draw insights from large data sets
- Experience working with and creating data architectures
- Knowledge of a variety of machine learning techniques (clustering, decision tree learning, artificial neural networks, etc.) and their real-world advantages/drawbacks
- Knowledge of advanced statistical techniques and concepts (regression, properties of distributions, statistical tests and proper usage, etc.) and experience with applications
- Excellent written and verbal communication skills for coordinating across teams.
- A drive to learn and master new technologies and techniques
Interested candidates can share their resumes on silvina.lobo@3scsolution.com