Data Engineer (Python Specialist)

Employer
Nautical Crime Investigation Services (NCIS)
Location
Vancouver, BC
Job Year Level
Graduated Undergrad
Job Type
Full Time

Compensation: $75,000 - $90,000 annually
About us: NCIS (Nautical Crime Investigation Services), a startup providing technological solutions for national security and maritime defense. NCIS leverages AI and tracking technologies for risk assessment and the detection of criminal activity at sea and on land.
Reports to: Director, Responsible Development of Emerging Technologies


About Us:

Nautical Crime Investigation Services is a leading provider of technological solutions and services for national security and maritime defense. Grounded in principles of justice, equality, and the democratization of technology, we are committed to revolutionizing crime intelligence technology while fostering fairness and inclusivity in our solutions.

Overview

NCIS is seeking a Data Engineer with strong Python skills to join their team. This role focuses on data parsing, manipulation, and preparation for machine learning models and other applications. The ideal candidate will be proficient in Python, with the ability to handle data in various formats and contribute to the development of AI-based solutions.

Key Responsibilities

  • Parse and manipulate data from various formats (JSON, CSV, XML)
  • Perform string manipulation to clean and process text data.
  • Familiarize yourself with existing work, specifically;
    • LangSmith: Including data sets, testing, and setting up a custom evaluator.
    • LangChain: With a focus on its expression language
  • Prepare training and evaluation datasets by scraping the content of relevant articles
  • Understand and apply basic classification metrics such as F1Score, Accuracy, and Confusion Matrix in evaluating models.

Required Technical Skills

  • Proficient in Python for data manipulation and parsing.
  • Experience working with data in formats such as JSON, CSV, and XML.
  • Strong skills in string manipulation and text processing.
  • Familiarity with machine learning and AI concepts, specifically data preparation and evaluation.

Preferred Skills

  • Experience with LangSmith, particularly working with datasets and custom evaluators.
  • Familiarity with LangChain’s expression language.
  • Basic knowledge of scraping tools to extract content from online articles.
  • Understanding of classification metrics (F1 Score, Accuracy, Confusion Matrix).

Bonus Skills

  • Strong interest in exploring new technologies and contributing to AI-based solutions.
  • Prior experience in machine learning pipelines, from data preparation to model evaluation.

Soft Skills & Non-Technical Requirements

  • Clear communication skills, with the ability to explain exoplex data processes.
  • Problem-solving mindset and attention to details in data handling.
  • Experience working in Agile methodologies.
  • Willingness to collaborate with cross-functional teams

Experience & Education

  • 3+ years of experience in data engineering or a similar role with a focus on Python
  • Degree in Computer Science, Data Science, or a related field
  • Proven ability to parse, manipulate, and clean data for use in AI or machine learning models.