Roles and responsibilities
As a Full Stack Data Engineer Intern, you will be extracting data from publicly available internet websites and storing it in a structured format for analysis and market intelligence leveraging the existing Big Data infrastructure. You will have the opportunity to work in a modern cloud-based data warehousing environment alongside learning from the best team of diverse and talented engineers in the region gaining invaluable exp
Requirements
- BSC in computer science or any other related field
- Knowledge on Version Control system such as GIT with Ci/Cd
- Knowledge on Terminal operations and Linux workflows
- Able to manipulate and visualize data using SQL, Excel or any other data visualization tool such as Tableau
- Good Knowledge in data modeling
- Knowledge in SQL, Python
- Knowledge in data structures and algorithms
- Familiarity with DBMS technology, including MySQL, Postgresql
- Basic Knowledge in web development (HTML , CSS, JS)
- Strong communication and interpersonal skills are essential, as lively debates are a part of the day-to-day.
- Maintaining a professional and collaborative spirit
Benefits
- Ability to contribute to a platform used by more than 5M users in UAE and other platforms in the region
- Strengthen your resume and build your network
- Opportunity to find a full time career with the region's leading organization
- Working in a multicultural environment with over 50 different nationalities
Desired candidate profile
1. Data Ingestion and Processing
- Ingesting Data: Learn how to work with various data sources (e.g., databases, APIs, data lakes) to collect data efficiently.
- Data Transformation: Help in developing data pipelines to transform raw data into a structured format suitable for analysis. This often involves working with ETL (Extract, Transform, Load) processes.
- Working with Big Data Technologies: Gain exposure to technologies like Apache Spark, Hadoop, Kafka, or Airflow for managing large-scale data processing.
2. Back-End Development
- API Development: Learn to design and build APIs that allow different systems to interact with data. This might include creating endpoints for querying data, uploading or manipulating data, or integrating with other systems.
- Database Design and Management: Work with relational (e.g., MySQL, PostgreSQL) and NoSQL (e.g., MongoDB, Cassandra) databases. Develop and optimize queries to extract and manipulate data.
- Server-Side Logic: Gain experience writing server-side code to handle complex data processing tasks. You might work with languages like Python, Java, or Scala for back-end development.
3. Front-End Development
- User Interfaces: Gain exposure to designing and developing front-end interfaces for interacting with data. This can involve creating dashboards or visualization tools.
- Data Visualization: Learn how to display data in user-friendly formats using libraries like D3.js, Plotly, or Chart.js.
- Web Technologies: Get hands-on experience with front-end technologies like HTML, CSS, and JavaScript, and frameworks like React or Angular for building dynamic, data-driven web applications.
4. Collaboration and Code Development
- Version Control: Use version control systems like Git and GitHub for managing code changes and collaborating with other team members.
- Agile Practices: Participate in agile development practices, such as scrum or kanban, attending meetings like daily stand-ups, sprint planning, and retrospectives.
- Code Review: Engage in code reviews with mentors or senior developers to receive feedback and improve coding practices.
5. Data Storage and Optimization
- Data Warehousing: Work with cloud-based data warehousing solutions like Google BigQuery, Amazon Redshift, or Snowflake to store and organize large volumes of structured data.
- Performance Optimization: Learn how to optimize data storage and processing for performance, scalability, and efficiency.
- Data Quality: Work on ensuring data integrity, consistency, and quality through validation and cleaning techniques.