Giter Site home page Giter Site logo

UTF-8 BOM results in crash about scalaxb HOT 1 CLOSED

eed3si9n avatar eed3si9n commented on July 25, 2024
UTF-8 BOM results in crash

from scalaxb.

Comments (1)

eed3si9n avatar eed3si9n commented on July 25, 2024

fixed.
c708f3a

override def toRawSchema(value: File) = {
  val BOM_SIZE = 4
  val EF = 0xEF.toByte
  val BB = 0xBB.toByte
  val BF = 0xBF.toByte
  val FE = 0xFE.toByte
  val FF = 0xFF.toByte
  val bom = Array.ofDim[Byte](BOM_SIZE)
  val in = new java.io.PushbackInputStream(new java.io.FileInputStream(value), BOM_SIZE)
  val readSize = in.read(bom, 0, bom.length)
  val (bomSize, encoding) = bom.toList match {
    case EF :: BB :: BF :: xs => (3, "UTF-8")
    case FE :: FF :: xs       => (2, "UTF-16BE")
    case FF :: FE :: xs       => (2, "UTF-16LE")
    case _                    => (0, "UTF-8")
  }
  in.unread(bom, bomSize, readSize - bomSize)
  readerToRawSchema(new BufferedReader(new java.io.InputStreamReader(in, encoding)))
}

this goes back to Sun not fixing Bug ID 4508058: UTF-8 encoding does not recognize initial BOM.

from scalaxb.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.