Giter Site home page Giter Site logo

hsharp's People

Contributors

anduin2017 avatar gitter-badger avatar hellooying avatar imba-tjd avatar j178 avatar nugetninja avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

hsharp's Issues

May cause a dead loop when decoding some website.

var httpService = new HTTPService(null); var strData =await httpService.Get(new Aiursoft.Pylon.Models.AiurUrl("https://bsaber.com/"),false); var htmlDoc= HtmlConvert.DeserializeHtml(strData);
就解析 https://bsaber.com/ 这个网站 DeserializeHtml 一直不返回,测试过最长时间 大概 10分钟
nuget上最新的版本

HDoc的索引器是否不妥呢

大佬您好,经常在油管看到您的视频学习,受益匪浅
我也是一名C# .NET 的死忠粉

看到项目比较感兴趣,拜读了一下您的代码
关于HDoc获取元素的地方,略有不同看法,请允许在下想发表一点愚见。

  • 在解析时候,是否应该除List以外,再用一个Dictionary来做索引,
    以提高按名称查找的效率,
    每次都Children.Find(t => t.TagName == tagName);
    会全局遍历,元素多的时候是否会性能不佳呢。

  • 关于索引器中的处理,HTag this[string tagName],HTag this[int tagIndex]
    这2个索引器中没有做任何异常处理,如果是示例中的连续调用(newDocument["html"]["head"]["meta",0])
    如果填错了名字或者索引,造成了空指针或者越界异常,不太容易查找错误地点。
    是否应该throw明确的异常信息,是哪个名字错了,或者哪个索引错了。

  • 可能是我个人的爱好
    个人感觉["meta"][0], 要比["meta",0], 更具有可读性。

没办法得到想要的Son属性值

我发现在一个table中插入这样一段网页,

<P>4.下列关于PC机外存储器,错误的是__________</P>
<P>A.移动硬盘通常采用USB接口或eSata接口,可随时插上或拔下(即支持"热插拔")</P>
<P>B.存储卡是闪存做成的一种固态存储器,所有存储卡的形状,大小,接口都是相同的</P>
<P>C.固态硬盘(简称SSD)是基于半导体存储器芯片的一种存储设备,其接口规范、功能及使用方法与普通硬盘几乎相同</P>

通过遍历每行的Son并不能直接得到p标签内的完整内容
default

html标签中的innertext长度小于2会导致“索引超出范围”

感谢up主的无私奉献,但是今天在使用过程中遇到些问题,麻烦up主帮忙看看如何解决,问题如下:

环境:win10, c#, visual studio 2019, 项目基于.net framework4.0

html字符串:<table><tr><td>6</td></tr></table>

此时使用Doc["table"]["tr"]["td"].Son获取数字6的时候报错:System.ArgumentOutOfRangeException:“索引超出范围。必须为非负值并小于集合大小。,研究了好久,发现如果html标签的innertext字符串长度小于2,就会报错。拿此例来说,如果td标签里的Innertext是60或者06就没问题。

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.