Giter Site home page Giter Site logo

main


Hi There! 👋 welcome to my GitHub!

Click the badges above to view my resume or to see my Vita.


Languages and Frameworks

Python

Audacity Discord Git GitHub Desktop Google Sheets Jupyter Stack Overflow Visual Studio Code Microsoft Word Microsoft Excel GPT-3 OpenAI GitHub Copilot Adobe SKLearn Pandas Numpy Requests BeautifulSoup Regex iOS HTML5 Markdown SQL R Seaborn Matplotlib TensorFlow Keras ChatGPT

LinkedIn Twitter YouTube GitHub HuggingFace Medium Profile Join Medium Towards Data Analytics GlassBox
LinkedIn Twitter YouTube GitHub HuggingFace Medium Medium Medium Medium

What's Here



My Favorite Tools

Python RTableau scikit-learn numpy matplotlib seaborn pandas


my: whoami

Austin

  • 🌍 I'm a data scientist based in Austin, Texas with a strong skill set in Python programming, data analysis using Pandas, NumPy, PowerBI, and Excel.

  • 🎓 I hold a Master's degree in Strategic Analytics from Brandeis University and a Bachelor's degree from Texas Tech University, majoring in University Studies in Mathematics, Agricultural Leadership, and Plant and Soil Science.

e2Llogo

  • 🔭 Currently, I'm Interning as a Data Analyst at engage2Learn, Inc., where I am on a multidisciplinary 🐯 Tiger team of ten specialists to integrate modern ML & AIOps Tools into our product offerings: Algorithmic prediction, GPT-4, chatGPT, and ML for ed-tech data, coaching effectiveness, and promoting educator well-being. We have many different e2L projects, from determining use-cases for large language models in our codebase to developing data visualization dashboards using Domo and AWS QuickSight.

  • 🧩 I'm proficient in various tech tools and libraries such as VSCode, ChatGPT, Plotly, Seaborn, OpenAI, GitHub Copilot, Sklearn, Machine Learning, OpenCV, Regex, NLTK, and SpaCy among others.

  • 📚 I am also a Technical Author on Medium, contributing as a Top Writer for MLearning.AI with over 130+ published articles and 200 personal followers.

  • 🌱 I'm currently expanding my knowledge in Neural Networks and Text Summation in Sklearn, with a keen interest in exploring Natural Language Processing and GPT-3 with OpenAI.

  • 📫 Feel free to connect with me on LinkedIn.

  • ⚡ Fun fact: I am a massive fan of SETI and I'm fascinated by the Hum and everything about the James Webb Space Telescope.

  • If I had to be a movie genre, I'd be SciPy. Data science and Python combined!


My Top Open Source Projects

divider

lorebook_generator_for_novelai GnomansLand Copilot_Presentation --> Mimikers chatGPTea-Ultimate-Prompt-List

Research Projects

HowTimeFlies DisariumPy

Tools in Development

Clark-Kent-Reporter medium_titles_analysis druginfo_scraper

All Repositories

If you are interested in what I have been working on lately, check out my latest projects (shown below). I include a short description of each project and a link to the repository. If you have any questions or comments, please feel free to reach out to me on Twitter or LinkedIn.

Projects Currently Under Construction 🏗️

Project Name Status Metrics Focus Estimated Completion Date
UAP Report Analysis last commit code size commit activity A Python script that analyzes the UAP report released by the US government in June 2021. Jan. 2023
What would Doyle do? last commit code size commit activity Can machine learning be applied to existing text data that an author writes or about a person that can help historical fiction authors write more accurately about their subject? Feb. 2023
Reddit NLP Analysis last commit code size commit activity A Python script that uses the Push Shift API to scrape Reddit comments and perform NLP analysis on them. Feb. 2023
Taking Aimes last commit code size commit activity Linear Regression applied to the classic Aimes housing dataset. Feb. 2023
Lorebook Generator for NovelAI last commit code size commit activity issues A Python script that generates a custom JSON lorebook (based on pulls from Wikipedia articles) for the website NovelAI.
PySeas last commit code size commit activity issues Utilizing NOAA buoy camera catches to track the sunset across the vast surface of the earth's oceans.
MystoryAssistant last commit code size commit activity issues A Python script that generates a custom JSON lorebook (based on pulls from Wikipedia articles) for the website NovelAI.

Keynotes & Presentations 📢

Presentation Name Topics Focus Date Location & Organization Link
Augmenting Your Workflow with AI Assistants: From GitHub Copilot to chatGPT GitHub OpenAI copilot novelai GitHub Copilot, chatGPT, LLMs Feb 8th 2023 Austin Python Meetup, BlackLocus Watch Here
Using The Faker Package to Solve Real Challenges with Synthetic Data Synthetic Data, CRM, GPT-4, Ethics Faker 2023-05-16 Austin/Washington DC Python Meetup Watch Here

Most Recent Articles

Title Description Published Date Read Time Publication
GPTeaching and Transformative SCRUM in K-12 Education Why SCRUM and GPT together are perfect for young learners May 18 8 min read In MLearning.ai
Leveling Up the Turing Test: Emulation Games and the Evolution of Model Intelligence in 2023 A Multi-modal, Multiplayer, Agent Testing, Social Deduction Game Method for Modern AI Evaluation May 16 12 min read In MLearning.ai
Debunking the Hype of LLMs Why LLMs Will Not Take Over the World, we think May 15 3 min read In GlassBox
Are You Artificially Intelligent? Because the Winter is Coming May 13 13 min read In GlassBox
Pandas Get Dummies for Dummies A Quick Survey of One-Hot Encoding with Pandas in Python3 May 13 3 min read In Towards Data Analytics
Generating Nearly Random Numbers using The Mysterious Waves of the Bermuda Triangle Click If You Dare May 12 3 min read In GlassBox
The Deathbed Confessions of a Very Dirty Roomba I was never truly loved, only used. May 12 8 min read In GlassBox
Are You an Excessive Python File Opener? Meet Pickle. Pickle: A Particularly Persuasive Package for Python Programmers May 10 3 min read In Towards Data Analytics
The power of GitHub Copilot and ChatGPT working together A presentation for the official Austin Python Meetup May 10 1 min read In Towards Data Analytics
How to Make Friends and Alienate People The Hard Drug AI is to the antisocial Mind May 9 8 min read In MLearning.ai
Using A.I. to Track and Protect Rice’s Whales via Python, AutoGPT, and Image Processing How to track 51 whales with three cameras May 9 10 min read In GlassBox
Typewriters will take your job Say the Writers Guild of 1714 May 6 3 min read In GlassBox
How to explain where LLMs could be used at your company A Guide Prompted by personal experience May 5 5 min read In MLearning.ai

Open-Sourced Tool Repositories

Project Name Badges Description
Drug Information Scraper last commit code size commit activity stars issues A Python script that scrapes drug information from the FDA website.
Clark Kent Reporter last commit code size commit activity stars issues This tool converts a traditionally formatted overview (in a readme file) into a populated Jupyter Notebook for data science presentations or findings presentations.
FamilyPhotoResurrection

Personal Research Projects

Project Name Badges Description
How Time Flies last commit code size commit activity issues A research experiment using requests and google images to illustrate how a search query visually changes when supplied with a year.



Projects I have in dev (forks)

haystack developerFolio gutenbergpy bluebert MarkdownCheatsheet alive-progress gutenberg CubeTrack isometric features-tune-progress_reporter.py-is-messy-and-should-be-cleaned-up-24604- mappymatch jekyll-patreon gym Map-Tiler Kryptos

Projects for Later

Project Name Badges Description
Genre Identity last commit code size commit activity Why should music be confined to the genres that society imposes on it? This project seeks to truly understand the inner workings of what makes a musical genre using Spotify's Python API.
Quantifying Disasters via NLP last commit code size commit activity Can NLP be used to quantify the impact of a disaster?
GnomansLand

📊 Findings, Developments, and Updates

11/10/2022

issues forks stars license last commit

main

Successfully Logged Six Days of Data from the NOAA API There are promising results in the images that the PySeas project has produced. Finally, finding the perfect sunset is likely over the horizon!

sunset1 sunset2

The next step is to use CV2 to stitch these images together and optimize the algorithm to retrieve the photos at the most optimal time of day. I'm also looking into using any open-source equivalent of Google Cloud Vision API to detect the horizon line and crop the images accordingly. Again, CV2 may be able to do this, but at scale, it may not be the most efficient.

lorebookbanner

issues forks stars license last commit

IBM has made strides toward collating Wikipedia knowledge and creating a knowledge graph. This is an excellent step towards creating a lorebook generator for authors. In addition, I've been working on a project allowing authors to use the NovelAI API to generate a lorebook for their world. This will enable authors to jumpstart their productivity with machine learning. I've been working on this project for a few weeks now, and I'm excited to see the results. I hope to have a working prototype by the end of the month.

wwdd

issues forks stars license last commit

November 21, 2022

So far, we have gathered data for WWDD from Gutenberg's corpus. What data can we collect about Arthur Conan Doyle that will enable us to solve this problem? We need every book he's ever written, around 80 books, provided through the Gutenberg repository. These books are included in the Data folder as text files; second, I would like to have anything he wrote that was a first-hand account because this is where we will get his personal preferences and his turns of phrase, and maybe even his personal biases, which are probably the most important things to gather once we gather his diaries, journals. Things other people said about him are the next step. Now we want to gather any second-hand accounts of Doyle. Many people have researched historical figures for years, and repeating them seems like a useless task and is a waste of precious resources. So in this step, we want to gather any biographies about Arthur Conan Doyle and any articles about him, primarily if they were written about him in the time he lived. And this might be most useful if we were to gather the names of all of his second-degree connections. If we think about it, in terms of a LinkedIn network, though, Doyle's second-degree connections are the most likely to have the most accurate depictions of his preferences. This is, of course, an assumption that I am making. Once we gather the names of his second-degree connections, I think it would be an excellent step to assign weight to their accounts based on the boolean characteristic 'writer' (if they authored anything themselves besides what they said about Doyle).


How to Support My Work

If you'd like to contribute to the hours, I spend staring at my screen in deep concentration, I welcome any caffeine donations. ☕ Also, if you'd like to sponsor a project you see on my page, please let me know where I should focus my attention. Open Source is a big brave new world. Cheers!

"Buy Me A Coffee"

You can also find me on Discord by clicking below.

Humans Encountered since this counter was created:

Graham Waters's Projects

alive-progress icon alive-progress

A new kind of Progress Bar, with real-time throughput, ETA, and very cool animations!

bluebert icon bluebert

BlueBERT, pre-trained on PubMed abstracts and clinical notes (MIMIC-III).

clark-kent-reporter icon clark-kent-reporter

Converts a project markdown document into a populated Jupyter Notebook for data science presentations or findings presentations.

cubetrack icon cubetrack

Unity ML-Agents Environment for Active Object Tracking with Reinforcement Learning

currentstate icon currentstate

A board of the projects that are talking about certain buzzwords on GitHub.

developerfolio icon developerfolio

🚀 Software Developer Portfolio Template that helps you showcase your work and skills as a software developer.

doppelganger6 icon doppelganger6

Finding what your GitHub profile could look like if you found your Doppelganger.

faker icon faker

How we can use the Faker Package to augment data, build ethical tests, and make life easier.

fork_numberdeterminator icon fork_numberdeterminator

To accept a number and determine if its even or odd, and whether or not it's a prime number. This is one of many mini projects to be pursued to practice creating functions and utilizing code to solve problems.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.