19 questions
- Bountied 0
- Unanswered
- Frequent
- Score
- Trending
- Week
- Month
- Unanswered (my tags)
0
votes
1
answer
175
views
Weird Exception from SgmlReader
I'm using SgmlReader to parse HTML files in C#. I'm using the sample code provided on their website:
using (reader = File.OpenText(fileName))
{
try
{
...
1
vote
0
answers
201
views
Why does building with Microsoft.Xml.SgmlReader 1.8.18 nuget output this warning MSB2346
I'm using the Microsoft.Xml.SgmlReader nuget in several places of my .Net Core 3.1 projects.
The build output says:
C:\Program Files (x86)\Microsoft Visual Studio2019円\Professional\MSBuild\Current\Bin\...
0
votes
1
answer
969
views
Parse SGML using beautiful soup
Below is the sample content :-
<WKEXT-META-ATTRS>
<WKEXT-META-ATTR NAME="uri" VALUE="http://sample.com/ceres/wk-us/Concept/i8148" DATA-TYPE="OTHER"></WKEXT-...
0
votes
1
answer
127
views
Is there a way to have ANY element be able to place anywhere within a SGML DTD?
I am building a DTD for a tool that will modify only attributes. I do not care if it is DTD compliant.
I am looking to have ANY element be able to be a child of any other element. Just to keep the ...
0
votes
0
answers
40
views
SgmlReader infinite loop on large document?
I've got this project to scrape data off of the SEC Edgar site. Part of the task is to get the meat of the whole filing, and I was just testing some of that today.
I ran into this somewhat large ...
0
votes
0
answers
159
views
How to get hidden data in a HTML file
I try to get comments of a Instagram post with C#. But the thing is there is this 'Load more comments' button which as it refers does its job and when i take a look a Firefox HTML codes all of the ...
user avatar
user10911520
1
vote
1
answer
181
views
Using SGML Reader to close single HTML tags
I am using SgmlReader to generate XHTML output from HTML. The question is regarding single HTML tags like
<IMG>
When using SgmlReader, this tag is generated as
<IMG></IMG>
I ...
0
votes
0
answers
68
views
Stax does not ready characters like "“" [duplicate]
I'm doing a Sgml parse with Stax. The Sgml contains characters like "“ ”" and many others that is not replaced setting the UTF-8.
The parse breaks and throws the following exception:
...
2
votes
2
answers
1k
views
Usage of SGML(Standard Generalized Markup Language) [closed]
SGML is the superset of HTML and XML. There are rich HTML and XML parsers available. Could you please explain me the
Usage of SGML (Sample business scneario) in current bussiness domains?
is it when ...
2
votes
1
answer
1k
views
Perl: Can't Build SGML::Parse::OpenSP in Mac OSX and Proxy Errors
I am running Perl 5.16.2 on OSX 10.9.1 and I am having difficulty installing modules from CPAN. I successfully installed
sudo cpan App::cpanminus
However, when I try something like
curl -L http://...
0
votes
1
answer
92
views
Use SgmlReader on Window Phone Application
Is there any way to include SgmlReader for Window Phone Application?
I can't install it by Install-package SgmlReader
incompatiable asssembly error
if this is not possible,is there any alternative ...
0
votes
1
answer
289
views
How to stop SGMLReader to add missing tags like <html> in output
I'm using SGML Reader to format my html snippets as shown here https://github.com/MindTouch/SGMLReader. Problem is it adds missing tags like in the code. Is there is any way to stop that.
1
vote
0
answers
179
views
SgmlReader embedded DTD in Nuget package
I reported a couple of days ago abut DTD schema not embeded in nuget package of SgmlReader.
The post there
But I get a responce then it's embeded and in order to use it I have to set DocType as "...
0
votes
1
answer
629
views
SgmlReader and System.Xml Windows 8
I'm trying to convert HTML stream to XML using SgmlReader for further parsing. This conversion is part of an APP i'm developing for Windows 8 Store. Below is the method that convert Html to XML:-
...
1
vote
1
answer
608
views
Removing jquery and CSS from an Xml Document
I'm using sgmlreader to convert HTML to XML. The output goes into a XmlDocument object, which I can then use the InnerText method to extract the plain text from the website. I'm trying to get the ...