Giter Site home page Giter Site logo

michaelcurrin / url-manager Goto Github PK

View Code? Open in Web Editor NEW
33.0 4.0 6.0 109 KB

Centralize, view, edit, label and organize collections of your favorite URLs ๐Ÿ”— ๐Ÿ“™

License: MIT License

Python 95.81% Shell 4.19%
python3 sqlobject sql url bookmarks organiser links urls python onetab

url-manager's Introduction

URL Manager ๐Ÿ”— ๐Ÿ“™

Centralize, view, edit, label and organize collections of your favorite URLs

GitHub tag License

Made with Python

dependency - SQLObject dependency - beautifulsoup4

A meta bookmark manager to rule them all.

About

The aim is to help you create and find your URL more efficiently and within a personalised structure. The result is a tool for you to import browser bookmark-related URL data from various sources and manage them in a single structured, easy-to-search data source.

Import data from the following data sources:

  • Bookmark files
    • Firefox
    • Chrome
  • History file
  • OneTab data
  • A CSV created by hand in the required format

You can also add and manage records with a command-line tool.

Documentation

  • Read this project's docs directory.
  • plyvel Python library to access LevelDB.
    • This is used for getting OneTab data from Chrome's LevelDB storage.
    • This have proven to cause some issues and the location changed I think, so scraping the frontend (for Firefox and Chrome) or using the text export might be easier than using LevelDB.
    • See also this LevelDB Wiki page and article.

License

Released under MIT.

url-manager's People

Contributors

dependabot[bot] avatar michaelcurrin avatar snyk-bot avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar

url-manager's Issues

Split browser tools into new repo

Create a new Python repo. Handle just OneTrust and Bookmark parsing for browsers.

Use Github action to publish package?

Then install from URL manager. Use Github URL? PyPI?

Name? just-browsing. high-brow

Add unit tests for parsing content.
Add integration tests to run on each system to check file path is still valid and content can be parsed in some minimal way. Like identify format or structure or count rows or checking keys.

Write OneTab data to configed directory.

See browser_one_tab_extraction.md, Instead of writing to stdout and a full path in the below sections, write to a file in the project using detailed parameters or a filename.

Add args to transformer.py

Use args to limit to printing preview or counting items without writing.
And to restrict to read only certain file types or file name instead of everything available.
It will called on everything in some cases but one individual files in others such as for development or when new data is needed to be used which wasn't around on initial imports.

Add config

Such as to set the profile for Firefox

abcd1234.default

If Firefox OneTab integration part of the script still works.

Split transformer.py

Split transformer into lib files which can be run standalone like a util. Or create user-facing parts of utils and transformer can use those or be a util itself.

Update transformer paths

transformer.py

Add logic to ensure folder's data is written to a unique key. As duplicate folder names are lost here at

folders[subfolder_name] = child_data

Consider different DB and ORM

Lookup MongoDB example which uses links.

SQLObject gives flexibility of SQLite / Postgres for easy deployment pipeline.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.