About the Company: Founded in 2023, our client’s vision is to transform the legal and financial industry by creating a platform that empowers law firms and investors to make a positive impact on society. To harness the power of AI and data analytics, our client’s mission is to empower law firms with unparalleled insights and precision in claims assessment About the Role: This internship is a unique opportunity to work on cutting-edge projects in legal tech and develop hands-on experience in data science and software engineering. As a Data Science Engineer , you will be instrumental in developing the core functionalities of the platform, focusing on data preprocessing and integration . This role offers exposure to real-world challenges in handling large datasets and building scalable solutions. Key Responsibilities: Data Preprocessing : Process large volumes of unstructured data (e.g., text files, PDFs, Excel) to prepare for analysis. Database Integration : Structure data into graph databases like Neo4j and optimise relational, vector, and data lake systems. Programming : Develop efficient tools and scripts primarily in Python, with the potential to use TypeScript and React for Front End. Cloud Solutions : Implement scalable solutions using AWS services. Team Collaboration : Work closely with a lead developer and key stakeholders to deliver impactful solutions. Documentation : Use Jira and Confluence for task tracking and knowledge sharing. More Info: Location: Sydney Type: Full-time internship of 5 days per week Duration: 6 months Work Arrangement: hybrid Start Date: ASAP Allowance : monthly allowance will be provided to support some expenses during internship Requirements About You: Currently studying or recently graduated in Computer Science, Data Science, or a related field. Proficient in Python (mandatory) and familiarity with TypeScript (optional). Experience with data preprocessing , particularly handling large unstructured datasets. Knowledge of NLP techniques is a plus. Familiarity with AWS cloud services. Strong understanding of database systems , especially graph databases like Neo4j . Excellent communication and teamwork skills. Highly detail-oriented and proactive. Adaptable to a dynamic startup environment.