Big Data Architect

Alexandria, VA

Share This Job

Job Description

Octo Consulting Group is seeking a Big Data Architect. The Big Data Architect will help identify, design, develop, and configure to establish a distributed and scalable architecture that supports both public and private data exploration, discovery, dissemination in support of data-driven decision making/ improvements across Government’s functions and business units.  In addition, candidate will perform the following:

  • Work with the project team (e.g. business analyst, User Interface (UI) designer, Product Owner, technical lead) to refine the business requirements and complete the implementation and production deployment of these requirements. These requirements are agile in nature and may evolve through collaboration between cross functional teams and the Product Owner
  • Create and document the Big Data Architecture, design and all pertinent configuration, execution instructions, and artifacts to establish and update the Big Data Reservoir to provide data-driven organizational capabilities. Big Data Architecture will:
  • Leverage open source technologies consistent with the Agency and Federal Source Code Policies
  • Have an ingestion tier that consumes raw data (i.e. real-time and batch) from sources to stores them with purpose-driven ingestion paradigms including ingestion of flat file data, stream data, and/or via API and/or Open Database Connectivity (ODBC)
  • Have a compute tier to support data processing (i.e. Map Reduce, Data Transformation process, data loading to structured sources, etc.) and staging and preparing of the data for the Analytics Tier
  • Have a storage tier (i.e. Hadoop Distributed File System (HDFS)) to support accept and store both raw data and derived data aggregated from the compute tier
  • Have an analysis tier that allows developers and data scientists to create/manage models, derived data and prototypes
  • Provide a data management/governance tier for access control and allow for database views and rules to be set up to manage data access
  • Provide a BDR API to serve as an Application Endpoint supporting a RESTful GET operation on various resources
  • Provide business end users with an interface to run visualization reports/dashboards, develop custom queries, and export/download outputs
  • Follow Government’s processes, procedures, policy, and standards including the System Development Life Cycle (SDLC) and Section 508 compliance
  • Provide Data Science expertise to develop analytical models/prototypes using the Big Data Architecture to support data-driven decision making/improvement.
  • Provide Data Visualization services that view multiple disparate sources of data and illuminate Government data and analytics through visualization tools and analysis thus making the data easier to understand.
  • Develop stories that allow the public to tell/share stories about Government data analytics through interactive visualizations and foster discussion.
  • Through visualizations and understanding of the data articulate the implications of the data including trends, correlation, and identify actionable intelligence and how it relates to other data points.
  • Support Executive Briefings when requested. This will include recommendations and justification of Architecture and Strategy, solutions to current issues and any recommended alternative approaches and justification
  • Version control and code check-in using the Government’s configuration management repository. Work collaboratively with Government content management personnel to comply with their needs. Build management and version control will be integrated with the Government Enterprise CM tools to ensure compliance
  • Create all necessary SDLC documentation or updates including Solution/System Design Document (SDD) for the Big Data Reservoir, CM Plan, Dynamic Operation Support Plan (DOSP), create and document  the approved technologies/technical stack, integration with core business systems and services and other supporting systems, enterprise initiatives and infrastructure

Skills & Requirements

  • Fifteen (15)+ years of experience in architecture and design of large, complex systems that are highly available
  • Five (5)+ years of experience in targeted technologies including Big Data and Mobile technologies
  • Five (5)+ years of experience in and utilizing the core technologies, subject to augmentation as the project proceeds to meet additional business needs and version updates. The core technologies include Red Hat Linux, JBoss 7 (SOA-P, EWS and EAP), Java 1.7, Spring/RestEasy, JavaScript, JQuery, XML, Active Directory, MySQL Enterprise/MariaDB, Oracle, and Solr/Lucene
  • Performance on this project requires in-depth working knowledge and extensive experience using the following software and hardware:
  • C/C++, JAVA, JDK, J2SE, J2EE, Perl and UNIX shell script, Apache/Tomcat, TCP/IP, Spring Framework, REST APl, SOA, JBoss.
  • Standard data formats including XML, XML schema definition (XSD), extensible stylesheet language transformations (XSLT), and hyper-text markup language (HTML)
  • Standard hardware and operating systems now used by the USPTO for its production AIS, including: HP/UX, IBM/AIX, REDHAT/LINUX and MS operating systems, VMware servers, EMC and Network storage devices
  • Extensive expertise and hands-on experience with:
  • Pig, Hive, R/Python, Spark, HDFS, Kafka, Hue/Apache, Java, JavaScript
  • Experience installing, configuring and testing the Hadoop ecosystem components
  • Hadoop information Architecture, data modeling and data mining, machine learning and advanced data processing
  • Displaying data using open source visualization (ie. D3)
  • Performance of this task order requires knowledge and previous experience using the following quality management tools: Rally and SVN

Highly desirable:

  • HBase, MariaDB, MongoDB, YARN
  • Hadoop Certification (Cloudera Certified Hadoop Developer (CCHD) etc.)


  • Alexandria, VA 

About Octo Consulting Group, Inc.

Octo Consulting Group (Octo) is an industry-leading, award-winning provider of digital services for the federal government. Octo specializes in providing agile software engineering, user experience design, cloud services, and digital strategy services that address government’s most pressing missions. Octo delivers intelligent solutions and rapid results, yielding lower costs and measurable outcomes.

Our team is what makes Octo great. At Octo you'll work beside some of the smartest and most accomplished staff you'll find in your career.  Octo offers fantastic benefits and an amazing workplace culture where you will feel valued while you perform mission critical work for our government.  Voted one of the region’s best places to work multiple times, Octo is an employer of choice!

Octo Consulting Group is an Equal Opportunity/Affirmative Action employer.  All qualified candidates will receive consideration for employment without regard to disability, protected veteran status, race, color, religious creed, national origin, citizenship, marital status, sex, sexual orientation/gender identity, age, or genetic information. Octo Consulting Group’s commitment to diversity and inclusive selection practices includes ensuring qualified long-term unemployed job seekers receive equal consideration for employment. 

Other Considerations: Relocation will not be offered. Selected applicants will be subject to a background investigation. Individual’s primary workstation is located in an office area. The noise level in this environment is low to moderate. Regularly required to sit for extended periods up to 80% of time; frequently required to move about to access file cabinets and use office equipment such as PC, copier, fax, telephone, cell phone, etc. Occasionally required to reach overhead, bend, and lift objects of up to 10 lbs. Specific vision abilities required by this job include use of computer monitor screens up to 80% of time.

For more information please visit: