Data Scientist, Big Data Job Description
Data Scientist, Big Data Duties & Responsibilities
To write an effective data scientist, big data job description, begin by listing detailed duties, responsibilities and expectations. We have included data scientist, big data job description templates that you can modify and use.
Sample responsibilities for this position include:
Data Scientist, Big Data Qualifications
Qualifications for a job description may include education, certification, and experience.
Licensing or Certifications for Data Scientist, Big Data
List any licenses or certifications required by the position: GCP
Education for Data Scientist, Big Data
Typically a job would require a certain level of education.
Employers hiring for the data scientist, big data job most commonly would prefer for their future employee to have a relevant degree such as Bachelor's and Master's Degree in Computer Science, Statistics, Mathematics, Engineering, Science, Math, Technical, Machine Learning, Physics, Economics
Skills for Data Scientist, Big Data
Desired skills for data scientist, big data include:
Desired experience for data scientist, big data includes:
Data Scientist, Big Data Examples
Data Scientist, Big Data Job Description
- Apply expertise in qualitative analysis and data mining
- Create and design BigData Architectures and Machine Learning processes
- Bringing strategic vision to the team and serving as a subject matter expert in the capabilities of Data Science
- Defines new approaches to feature engineering, extraction and learning
- Collaborates with the data architects of various data platforms (big data, relational and non-relational) to define requirements for data architecture enhancements and new data ingestion
- Utilizes Big Data Analytical tools and packages to design and build highly scalable analytical models
- Leads the analysis and formalization of the business problems
- Leads the team in adoption of new analytical algorithms, tools and technologies
- Lead and participate in the design and implementation of Big data and BI solutions using Hadoop and Apache open source components enterprise grade BI and analytics solutions from leading vendors
- Develops data architectures strategies, principles, standards and frameworks
- Experience manipulating large data sets through statistical software (ex
- Drive client engagements focused on Big Data and Advanced Business Analytics, in diverse domains such as product development, marketing research, public policy, optimization, and risk management
- Five years of professional experience working as a Data Scientist
- Proficiency in analysis
- Define solutions to solve the data processing and analysis
- Experience with data analysis tools including Tableau, R
Data Scientist, Big Data Job Description
- Assist with technical design for solutions to business requirements
- Document designs and implementation
- Planning, preparing and executing proof-of-concepts of the solution using the prototype and refining the solution based on external feedback
- Partner with clients and Think Big Delivery Leads to successfully delivery data science solutions, including associated documentation and presentations
- Open-source big data frameworks such as Apache Drill and Hadoop
- Advise the analytics modelling managers on best practice methodology for Big Data Analytics
- Support a community of data scientists across OpCos
- Build Data Products (50% technical hands-on work and 50% business design)
- Bring expertise with regards to functional and technical building
- Bring knowledge of real time applications (Erlang, Scala) and BigData / NoSQL DB (ElasticSearch)
- STRONG programming background (R /Python/Scala preferred)
- Knowledge of pricing and revenue management theory and practice
- Experience in developing pricing decision science project
- Hands on experience working with Big Data ecosystem (Hadoop/Spark)
- MS degree in Computer Science, Operations Research, Mathematics or Physics required
- 5+ years of professional experience in this area
Data Scientist, Big Data Job Description
- Support specific Data Products projects (integration with billing, provisioning)
- Support customers in Data Products integration and delivery activities
- Develop and enhance the product roadmap
- Design standard and processes for Data Management
- Use Hadoop to performce analysis of data
- Use ETL/MetaData tools to process data from various sources
- Comfortable working in a fast paced agile environment, closely interacting with some of industry’s best software engineers and data scientists, along with business partners
- You will closely work with our “Big Data for Space” transverse group of engineers
- You will also support all the Big Data activities of the Telecom domain
- Research and development of big data analytics platform for data integration, extraction, segmentation, visualization and analysis of large amount of unstructured data
- Knowledge of machine learning techniques (supervised and unsupervised), optimization modeling (convex optimization), reinforcement learning or dynamic programming
- Knowledge of basic statistic testing and descriptive statistics
- Demonstrated track record of architecting, developing and delivering large-scale software solutions
- Ability to work in different database technologies including Big Data / Hadoop (HDFS, MapReduce, Hive, Shark, Spark, ), RDBMS and NoSQL
- Excellent communication skills are a must (writing, presenting)
- Proficiency in analysis packages
Data Scientist, Big Data Job Description
- Implementing any Big Data tools and frameworks required to provide requested capabilities
- Implementing ETL process taking data from different sources (relational databases, logs )
- Motivate, lead, and manage a team of data scientists to help the Think Big Analytics organization meet revenue goals and deliver high quality and high value solutions
- Take responsibility, drive new developments, and work creatively on challenging and groundbreaking development tasks in accordance with the highest technical standards
- Transfer innovative solutions from a piloting phase into a sustainable and scalable software solution for global and cross-business usage
- Mentor and coach other team members and support their functions by providing expertise and insights into data and computer science
- Liaise with internal business and corporate units, and translate their analytical needs into initiatives
- Create meaningful reports and dashboards using traditional and visualization tools and execute daily, weekly, monthly and ad-hoc reports
- Deliver and present reports and data to management and the entire organization
- Assist in identifying and delivering on new reporting capabilities
- Master’s or Ph
- Strong proficiency in parallel computing & distributed algorithms
- Data Science candidates shall have a Master’s or PhD in a data science field
- Experience and passion for finding solutions to real world, applied problems
- Expertise working with large industrial scale data sets, and the ability to use software to manipulate data, prototype new tools, and extract actionable insights from that data
- Capable of presenting outcomes of analytic solutions in a format easily understood by a non-technical audience
Data Scientist, Big Data Job Description
- Manage and guide a team of ML engineers that work with data scientists to build end-to-end machine learning and analytics solution to solve business challenges
- Work on NTTDATA’s Big Data accelerators and Machine Learning Framework
- Develops parallel data-intensive systems using Big Data technologies
- Works with the full open source Hadoop stack from cluster management, to data repositories, to analytics software, to schedulers
- Works in on-premises or public cloud environments to build scalable systems
- Determines the appropriate database given the data and analytics needs, whether file structures such as HDFS, relational databases including NewSQL, non-relational NoSQL databases including in-memory databases
- Optimizes the distribution of data across nodes and the performance of NoSQL repositories
- Identifies performance bottle-necks and evaluates scaling benchmarks
- Developing new technologies in the areas of predictive modeling, simulation, and outcome management
- Conducting research, designing, developing and evaluating first-of-kind data analytics solutions to provide real time diagnostics and therapeutic decision support applications
- Expert-level skills and abilities within field of knowledge, and demonstrated ability to develop new ideas from inception to prototype
- Accomplished R programmer, and experience coding with other languages such as C/C++, Python, and Java
- Experience with modern data management systems like Hadoop, NoSQL, and Spark
- Candidates with 5+ years of relevant experience may be considered for a more senior position
- Ph.D degree in computer science or related fields and with 3+ years working experience
- PhD in Computer Science or Computer Engineering with an emphasis on Distributed