Software/Research Engineer
Clearance Required: Active Top Secret with Polygraph
Location: Emerson (Laurel, MD)
Anticipated Start Date: Spring 2018
PCI is actively interviewing candidates for several upcoming positions on a new Delivery Order on our
Prime contract, supporting the Research Directorate of our IC customer.
This team is tasked to perform research, development, and operations support in areas of
Human Language Technology (HLT), natural language processing (NLP), computational linguistics,
computer vision (CV), and machine learning.
Scope/Description of Work:
This delivery order addresses the increasingly broad spectrum of integrated analytics for multimedia content exploitation encompassing image, video, audio, text, and cross-modal information correlation. This is a large delivery order that will staff up to forty-five (45) researchers and engineers in multiple areas.
Imagine building a system that can take the digital version of your favorite TV show and analyze everything about it. From developing a transcript of everything that’s said and attributing it to a specific character, to doing OCR scrapes of all the text that might appear on the screen. If you see an image of a castle on the screen, automatically search to see where in the world its located, and then searching the rest of your home TV database to see where else that castle appeared, or a specific phrase was said, or a particular actor spoke. Let your imagination run wild for a few minutes and then come to PCI and help develop something that does all that and more.
Human Language Technologies (HLT) and Natural Language Processing (NLP) includes technologies to provide for a variety of language-based processing, including language identification and translation from audio, video, and text-based sources. In addition to word-for-word translations, also included is ‘concept’ translation which requires additional machine learning and statistical model building. Speech-to-text and text-to-speech also fall under the purview of NLP. Computer Vision (CV) seeks to automate tasks that the human visual system can do. This includes understanding digital video/images for scene reconstruction, event detection, object recognition, OCR, etc.
Specific Languages and Technologies:
- Java, C/C++, Python, Golang (Go), Kotlin, JavaScript, Scala, Ruby
- Docker, Mesos, Kubernetes,
- AngularJS, React, HTML5
- Cloud processing (Hadoop, Map/Reduce, Accumulo, Big Table)
- Relational and/or non-relational database experience (eg, MySQL, Oracle, Accumulo, MongoDB, HDFS, etc.)
- Statistical Machine Learning, deep learning, neural networks (eg, TensorFlow, Caffe, etc.)
- Elasticsearch
- Speech recognition (eg, Pyspeech, Mozilla DeepSpeech, Kaldi, CMUSphinx)
- OpenCV, dlib, OCR
If you don’t have them all, this is the team to work on to learn them!