JavaScript is disabled on your browser.
Skip navigation links
  • Summary:
  • Nested |
  • Field |
  • Constr |
  • Method
  • Detail:
  • Field |
  • Constr |
  • Method
org.htmlunit.util

Class EncodingSniffer



  • public final class EncodingSniffer
    extends Object 
    Sniffs encoding settings from HTML, XML or other content. The HTML encoding sniffing algorithm is based on the HTML5 encoding sniffing algorithm.
    Author:
    Daniel Gredler, Ahmed Ashour, Ronald Brill, Lai Quang Duong
    • Method Detail

      • sniffEncodingFromMetaTag

        public static Charset sniffEncodingFromMetaTag(InputStream is)
         throws IOException 
        Attempts to sniff an encoding from an HTML meta tag in the specified byte array.
        Parameters:
        is - the content stream to check for an HTML meta tag
        Returns:
        the encoding sniffed from the specified bytes, or null if the encoding could not be determined
        Throws:
        IOException - if an IO error occurs
      • extractEncodingFromContentType

        public static Charset extractEncodingFromContentType(String s)
        Extracts an encoding from the specified Content-Type value using the IETF algorithm; if no encoding is found, this method returns null.
        Parameters:
        s - the Content-Type value to search for an encoding
        Returns:
        the encoding found in the specified Content-Type value, or null if no encoding was found
      • sniffEncodingFromXmlDeclaration

        public static Charset sniffEncodingFromXmlDeclaration(InputStream is)
         throws IOException 
        Searches the specified XML content for an XML declaration and returns the encoding if found, otherwise returns null.
        Parameters:
        is - the content stream to check for the charset declaration
        Returns:
        the encoding of the specified XML content, or null if it could not be determined
        Throws:
        IOException - if an IO error occurs
      • sniffEncodingFromCssDeclaration

        public static Charset sniffEncodingFromCssDeclaration(InputStream is)
         throws IOException 
        Parses and returns the charset declaration at the start of a css file if any, otherwise returns null.

        e.g.

        @charset "UTF-8"
        Parameters:
        is - the input stream to parse
        Returns:
        the charset declaration at the start of a css file if any, otherwise returns null.
        Throws:
        IOException - if an IO error occurs
      • toCharset

        public static Charset toCharset(String charsetName)
        Returns Charset if the specified charset name is supported on this platform.
        Parameters:
        charsetName - the charset name to check
        Returns:
        Charset if the specified charset name is supported on this platform
      • translateEncodingLabel

        public static String translateEncodingLabel(String encodingLabel)
        Translates the given encoding label into a normalized form according to Reference.
        Parameters:
        encodingLabel - the label to translate
        Returns:
        the normalized encoding name or null if not found
Skip navigation links
  • Summary:
  • Nested |
  • Field |
  • Constr |
  • Method
  • Detail:
  • Field |
  • Constr |
  • Method

Copyright © 2002–2025 Gargoyle Software Inc.. All rights reserved.

AltStyle によって変換されたページ (->オリジナル) /