data science full time tech

Job Details

Cape Analytics provides instant property intelligence for buildings across the United States. Cape Analytics enables insurers and other property stakeholders to access valuable property attributes at time of underwriting, with the accuracy and detail that traditionally required an on-site inspection, but with the speed and coverage of property record pre-fill. Founded in 2014, Cape Analytics is backed by leading venture firms and innovative insurers and is comprised of computer vision, data science, and risk analysis experts.

Positions Summary:
We are looking for a Data Scientist with a scientific approach who has strong expertise in data analysis, statistics, programming, and machine learning to join our Data Science team. You will collaborate with Data Scientists, Computer Vision/Machine Learning Engineers, Data Engineers, and members across Software Engineering, Product, and Sales teams to build robust, scalable machine learning models for identification and annotation of the... built world. Additionally, you will develop expertise in ground truth generation, model performance analysis, iterative model development, and unsupervised mapping of the feature space to bring scientific rigor, scalability, and robust performance to our core product offerings.

What You'll Do:

  • Develop scientifically rigorous, creative methodologies to continuously improve our machine learning models
  • Incorporate machine learning and data-driven decisioning into the core of our infrastructure
  • Contribute to design and automation of model training, model post-processing and evaluation pipelines at scale
  • Leverage the extensive data generated by Cape in addition to data from external sources to generate structured knowledge about our feature space
  • Present your results internally and externally;
  • Defend your methodology and incorporate feedback from internal teams as well as customers.
  • Improve model performance by identifying failure modes using supervised and unsupervised learning techniques
  • Ideate and implement data-driven methodologies to help scale model performance across geographical, climatic, and temporal dimensions
  • Implement automated solutions for ensuring data quality and delivery
  • Explore and mine new data sources that will help optimize and validate our models
  • Link model capabilities to market needs by customizing models, designing and running validation studies
  • Contribute to peer mentorship, knowledge bases, and skills transfer
    Skills/Requirements:
  • PhD in a STEM field and 1-3 years of hands-on industry experience
  • Solid knowledge of statistical techniques, including hypothesis testing, statistical sampling, significance testing, statistical inference, maximum likelihood estimation, and experimental design, among others.
  • Mastery of, supervised and unsupervised algorithms and their implementations, machine learning concepts including regularization, learning curves, optimizing hyperparameters, cross-validation, among others.
  • Advanced knowledge and significant programming experience in Python programming or other scripting language including relevant libraries like numpy, pandas, SciPy, matplotlib
  • Familiarity with the Linux environment including shell scripting, Git and tools for reproducibility (e.g. virtual environments, Docker )
  • Demonstrated expertise in building data tools for ETL and data analysis
  • Experience in building meaningful data visualizations using at least one scripting-based visualization tool such as matplotlib, d3.js or bokeh
  • Nice to haves:Experience designing data schemas and extracting data from SQL and NoSQL databases. Experience with GIS systems. Experience with modern data technologies, e.g. Spark, Tensorflow, Jupyter Notebook, DockerExperience with cloud computing on AWS or GCP

We believe:

  • Talent is critical, but best when tempered with humility
  • Self-motivation leads to the best outcomes
  • Open, direct communication is a sign of respect
  • Teamwork drives success
  • Having fun together is an important part of the job
  • **Cape Analytics is an E-verify participant

See something wrong with this listing?

Contact support