As a Vision Machine Learning / Software Engineer, you will play a key role in enhancing Mentium's deep learning library for vision AI models on our in-memory compute ultra-low power hardware. Collaborating closely with a small team of software and hardware experts, you will design and refine a client-facing codebase that enables seamless compilation and deployment of vision models for real-time applications. Additionally, you will support the training and testing of models for demonstrations.
A strong understanding of vision model components is essential, including convolutional layers, transformers, activation functions, pooling, normalization, and non-maximum suppression. Familiarity with the structure, training and quantization of models such as YOLO, Mask R-CNN, MobileNet, ConvNeXt, EfficientNet, and Vision Transformers is required. You will work with libraries like Torchvision, Ultralytics, MMCV, Keras, ONNX, and OpenCV, adapting to client needs. Awareness of hardware fundamentals—such as memory transfer, inference time, model size, and precision—is critical for real-time edge deployments.
Proficiency in Python is essential, including expertise in OOP principles, decorators, context managers, and numerical computing libraries like NumPy and pandas. You should demonstrate strong skills in modular development, collaborative coding using GitHub, and producing clear, well-structured documentation to enhance the customer experience.
This role offers an exciting opportunity to work with cutting-edge vision AI technology and make a meaningful impact on the development of a robust, client-facing software library.
Requirements and Qualifications:
Why Join Mentium?
At Mentium, you'll be at the forefront of vision AI technology, working alongside a talented team dedicated to innovation. We offer a collaborative environment where your contributions directly influence the development of cutting-edge solutions. Join us to make a meaningful impact in the field of vision AI.
Benefits: