Database System Concepts
Database System Concepts
7th Edition
ISBN: 9780078022159
Author: Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Publisher: McGraw-Hill Education
Bartleby Related Questions Icon

Related questions

Question

It should be a C++ PROGRAM INCLUDE THE HEADER FILE, MAIN CPP AND BSTREE.CPP

There is a real program developed by a computer company that reads a report ( running text ) and issues warnings on style and partially correct bad style. You are to write a simplified version of this program with the following features:

Statistics

A statistical summary with the following information:

  • Total number of words in the report
  • Number of unique words
  • Number of unique words of more than three letters
  • Average word length
  • Average sentence length
  • An index (alphabetical listing) of all the unique words (see next page for a specific format)

Style Warnings

Issue a warning in the following cases:

  • Word used too often: list each unique word of more than three letters if its usage is more than 5% of the total number of words of more than three letters
  • Sentence length : write a warning message if the average sentence length is greater than 10
  • Word length : write a warning message if the average word length is greater than 5

Input

From the keyboard: The name of the file containing the text to be analyzed

From the file: The report to be analyzed.

Output

  1. Write the following information to a file:
  • The name of the input file
  • The statistical summary of the report ( see Statistics above )
  • The style warnings ( see Style Warnings above )

Data Structures

A BST of unique words in the report, created as the file is read. If a word is not in the list, put it there. If it is, increment a counter showing how many times the word has been used.

Definitions:

Word: Sequence of letters ending in a blank, a period, an exclamation point, a question mark, a colon, a comma, a single quote, or a semicolon. You may assume that numbers do not appear in the words; they may be ignored.

Unique word: Words that are spelled the same, ignoring uppercase and lowercase distinctions.

Sentence: Words between end of markers.

SAMPLE OUTPUT

_________________________________________________________________________

FILE NAME: chapter.txt

STATISTICAL SUMMARY

TOTAL NUMBER OF WORDS: 987

TOTAL NUMBER OF "UNIQUE" WORDS: 679

TOTAL NUMBER OF "UNIQUE" WORDS OF MORE THAN THREE LETTERS: 354

AVERAGE WORD LENGTH: 8 characters

AVERAGE SENTENCE LENGTH: 12 words

STLE WARNINGS

WORDS USED TOO OFTEN: ( WORDS OF MORE THAN 3 LETTERS THAT ARE USED MORE THAN 5% OF THE TOTAL NUMBER OF WORDS OF MORE THAN 3 LETTERS )

1) Well

2) Total

3) Good

4) Since

5) Because

6) Little

AVERAGE SENTENCE LENGTH TOO LONG – 12 words

AVERAGE WORD LENGTH TOO LONG – 8 characters

INDEX OF UNIQUE WORDS

A

and

all

around

...

B

be

because

but

...

Expert Solution
Check Mark
Knowledge Booster
Background pattern image
Learn more about
Need a deep-dive on the concept behind this application? Look no further. Learn more about this topic, computer-science and related others by exploring similar questions and additional content below.
Similar questions
    Recommended textbooks for you
    Text book image
    Database System Concepts
    Computer Science
    ISBN:9780078022159
    Author:Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
    Publisher:McGraw-Hill Education
    Text book image
    Starting Out with Python (4th Edition)
    Computer Science
    ISBN:9780134444321
    Author:Tony Gaddis
    Publisher:PEARSON
    Text book image
    Digital Fundamentals (11th Edition)
    Computer Science
    ISBN:9780132737968
    Author:Thomas L. Floyd
    Publisher:PEARSON
    Text book image
    C How to Program (8th Edition)
    Computer Science
    ISBN:9780133976892
    Author:Paul J. Deitel, Harvey Deitel
    Publisher:PEARSON
    Text book image
    Database Systems: Design, Implementation, & Manag...
    Computer Science
    ISBN:9781337627900
    Author:Carlos Coronel, Steven Morris
    Publisher:Cengage Learning
    Text book image
    Programmable Logic Controllers
    Computer Science
    ISBN:9780073373843
    Author:Frank D. Petruzella
    Publisher:McGraw-Hill Education