Giter Site home page Giter Site logo

Comments (8)

TieWay59 avatar TieWay59 commented on August 29, 2024 4

我已经把我想到的都写了,目前先这样8000字:

https://xlab2017.yuque.com/fo4x1u/kb4iyb/is148uliy9xfg3y8?singleDoc# 《开源项目的结构》

from oss101.

Bruce-Jay avatar Bruce-Jay commented on August 29, 2024 3

我想帮助补充关于视频字幕 + PPT 的内容

You are welcome, please find the resource here~

Here is an example:https://www.yuque.com/bifenglin/oormdi/fmgd2gwe6y321a0r?singleDoc#

https://www.yuque.com/g/easonlin-0hgnr/ggvzqr/collaborator/join?token=a6ycXy2IiEqfDbPT# 邀请你共同编辑知识库《oss101》

由于我现在还没有语雀会员,请加入知识库以后预览一下我编辑的第二讲图文教程是否可行。

我是先将视频下载,然后使用飞书妙记的转写功能将其转成文字,再使用 chatGPT 将里面的口语化助词去掉,再转换成书面性较强、可读性较高、内容密度较高的文字。在 chatGPT 中,我输入了如下命令:以下是一段比较口语化的文字,请你将其的语言风格转化得较为书面。要求:不要压缩过多原文。并且适当保留原文中的例子,使用讲述者的口吻输出。然后跟上换行与转写得到的文字,输出的内容就组成了我的这篇图文课程。

此外,我也询问 gpt 帮我得到一个 python 程序让 pdf 全部转换为图片,插入语雀文档就方便很多了。接下来就是把图片和文字对上,以及校准文字,和让上下文通顺的任务了。整个过程耗时五个小时左右,对我来说还是比较重的一个任务。

import os
import fitz  # PyMuPDF
from PIL import Image

def pdf_to_images(pdf_file, output_folder):
    os.makedirs(output_folder, exist_ok=True)
    doc = fitz.open(pdf_file)

    for pg in range(doc.page_count):
        page = doc[pg]
        pix = page.get_pixmap(alpha=False)
        img = Image.frombytes("RGB", [pix.width, pix.height], pix.samples)
        img.save(os.path.join(output_folder, f"page_{pg}.png"))

if __name__ == "__main__":
    pdf_file = "Lecture02.pdf"  # 输入你的 PDF 文件路径
    output_folder = "output_images_02"  # 输出图片的文件夹路径
    pdf_to_images(pdf_file, output_folder)

from oss101.

Bruce-Jay avatar Bruce-Jay commented on August 29, 2024 1

我想帮助补充关于视频字幕+PPT的内容

from oss101.

will-ww avatar will-ww commented on August 29, 2024 1

我想帮助补充关于视频字幕+PPT的内容

You are welcome, please find the resource here~

Here is an example:https://www.yuque.com/bifenglin/oormdi/fmgd2gwe6y321a0r?singleDoc#

from oss101.

will-ww avatar will-ww commented on August 29, 2024 1

Wow, great work, many thanks~

from oss101.

will-ww avatar will-ww commented on August 29, 2024

本讲需要建设的内容包括:

from oss101.

will-ww avatar will-ww commented on August 29, 2024

伍泰炜同学去年开源课程的仓库,非常好的一个参考材料:https://github.com/X-lab2017/22-Spring-OSDD

from oss101.

will-ww avatar will-ww commented on August 29, 2024

我已经把我想到的都写了,目前先这样8000字:

https://xlab2017.yuque.com/fo4x1u/kb4iyb/is148uliy9xfg3y8?singleDoc# 《开源项目的结构》

厉害厉害~

from oss101.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.