Giter Site home page Giter Site logo

Hello, I'm Haoyue Xu πŸ‘‹

I'm a data science enthusiast currently pursuing a Master of Science in Applied Data Science at the University of Southern California, set to graduate in December 2024. With a strong background in computer science from the University of Nottingham Ningbo China, I am passionate about leveraging data to solve real-world problems.

πŸ›  Skills

  • Programming Languages: Python, Java, C/C++, MATLAB, HTML5, CSS3, JavaScript, Swift, Haskell
  • Frameworks & Libraries: Pandas, NumPy, SciPy, TensorFlow, PyTorch, Keras, XGBoost, Matplotlib, Seaborn
  • Big Data & Cloud: Apache Spark, Hadoop, AWS, Google Cloud
  • Tools: Tableau, Power BI

🌱 I’m currently learning

  • Advanced machine learning techniques and cloud services optimization to enhance data-driven solutions.

πŸ’Ό Experience

  • Data Developer Intern at Lark AI ByteDance, where I spearheaded annotation accuracy initiatives and participated in cutting-edge NLP projects.
  • Research Assistant Intern at Zhejiang Future Petrochemical Co., where I used Python for data analysis and automated data collection processes.

πŸ“ˆ Projects

  • Hadoop MapReduce Emulator: Developed an emulator to understand and optimize the MapReduce process.
  • Music-driven Dance Generation: Created a model for generating dance sequences based on music dynamics using deep learning.
  • Scalable Data Systems Project: Applied principles from DSCI 553 at USC to develop scalable systems using Apache Spark for large-scale data analysis, focusing on practical applications in data mining and machine learning.
  • Advanced Algorithm Solutions: Engaged in rigorous study and application of complex algorithmic techniques in CSCI-570. This included tackling problems using divide and conquer, dynamic programming, and network flow, among other methods, to enhance problem-solving and computational efficiency in practical scenarios.
  • Data Science for Business and Society: Explored applications of data science in DSCI-599 to solve real-world problems across various domains such as healthcare, public safety, and marketing using advanced machine learning techniques.
  • Data Science Professional Practicum:
    • Stock Portfolio Management System: Developed a system using the Yahoo Finance API to manage stock portfolios, implementing forecasting methods like ARIMA and LSTM for trading algorithms.
    • Reddit Post Scraper and Analyzer: Created tools to scrape, process, and analyze data from Reddit, employing batch processing and machine learning techniques like Doc2Vec and KMeans for content analysis.
    • Web Data Extraction for Well-data Analysis: Implemented a system to extract and preprocess data from web and PDF sources, integrating various data into a MySQL database for further analysis.
    • PDF Chatbot with GPT 3.5: Built a custom chatbot interface using LangChain and GPT 3.5 to interact with information extracted from PDF documents, enhancing user engagement with document content.
    • Expedition Bot: Developed a sophisticated travel planning bot that integrates multiple technologies and APIs to provide personalized travel solutions.

πŸ“« How to reach me:

Feel free to check out my repositories and don't hesitate to connect!

Hayley XU's Projects

dsci560-n.a.h icon dsci560-n.a.h

This repository contains projects from the DSCI560 course at the USC. Projects range from stock portfolio management using advanced forecasting methods to developing a chatbot using GPT 3.5 for interacting with PDF content.

dsci599 icon dsci599

Applications of data science and machine learning techniques for solving business, economic, and societal problems, including marketing, econometrics, education, public safety, healthcare, and social services.

emotion-detector icon emotion-detector

Engineered an advanced emotion detection application utilizing the Watson NLP library, meticulously formatting the output and deploying it seamlessly using Flask.

haoyuexu99 icon haoyuexu99

This repository hosts the README file for my GitHub profile, showcasing my skills, experiences, projects, and contact information.

passlawtest icon passlawtest

PassLawTest is an learning app utilizing generative AI to help users efficiently prepare for the legal profession qualification exam.

usc-csci-570-sp24 icon usc-csci-570-sp24

CSCI 570 is a comprehensive course focused on the design and analysis of algorithms. Students learn key algorithmic techniques like divide and conquer, greedy, and dynamic programming. The course also delves into network flow, NP-completeness, approximation algorithms, and linear programming.

usc-dsci-553-fall23 icon usc-dsci-553-fall23

This repository covers DSCI553, focusing on data miningβ€”a crucial skill for analyzing massive datasets. The course explores algorithms for uncovering patterns in data, with a practical emphasis. Students will learn to apply data mining techniques to solve real-world problems.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    πŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. πŸ“ŠπŸ“ˆπŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❀️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.