Counting Words and Characters in a Text File: A Java Program

  • Context: Comp Sci 
  • Thread starter Thread starter Hiche
  • Start date Start date
  • Tags Tags
    File Java Program
Click For Summary
SUMMARY

The discussion focuses on creating a Java program that counts words and characters in a text file using the provided "In" class. The initial code had issues with reading input, specifically using StdIn instead of the In class. The corrected approach involves using in0.readString() to read words and incrementing character counts accurately. The final solution successfully compiles and executes, providing accurate word and character counts when a .txt file is passed as an argument.

PREREQUISITES
  • Java programming fundamentals
  • Understanding of file I/O in Java
  • Familiarity with the In class provided by the instructor
  • Basic string manipulation techniques in Java
NEXT STEPS
  • Explore the In class documentation to understand its methods and properties
  • Learn about Java file handling using java.nio.file package
  • Investigate regular expressions for more advanced word counting
  • Study Java's Scanner class for reading input from files
USEFUL FOR

Students learning Java, educators teaching file I/O concepts, and developers needing to implement text processing in Java applications.

Hiche
Messages
82
Reaction score
0

Homework Statement



Write a program that takes a file name as argument and counts the number of words and the number of characters (without spaces) in the document.

Homework Equations


The Attempt at a Solution



How exactly should my program take a .txt file as an argument? Do I simply:

Code:
String str = args[0];

And the contents of the .txt file are stored as strings? (The file is a text file with only letters).

And how to check for each word inside that file? Let it read the string till encountering a space then (by previously declaring a count variable) increment count? I'm confused here. Is it that same as just studying a string?
 
Last edited:
Physics news on Phys.org
You can count the numbers of words by reading the lines from the .txt file and splitting them using String Tokenizers.


http://download.oracle.com/javase/1.4.2/docs/api/java/util/StringTokenizer.html
 
Last edited by a moderator:
Sadly, we are limited to using the In class for this program. I came up with this but it is not working:

Code:
public class DocumentS
{
	public static void main(String[] args)
	{
		In in0 = new In(args[0]);
		
		int wordCount = 0;
		int charCount = 0;
		
		while (!StdIn.isEmpty())
		{
			wordCount++;
			String str = in0.readAll(); 
			for (int i = 1; i <= str.length(); i++)
				charCount++; 
		}
		System.out.println("Number of words: " + wordCount);
		System.out.println("Number of characters: " + charCount);
	}
}

It compiles fine but when I pass the .txt file as an argument, nothing happens. I don't know if the algorithm is right, but I really need help in this.
 
I do see some problem in the above code but i don't know what exactly, so sorry. Also, i don't know much about "In class", although i found something on "In.class" online and modified it a bit to count the numbers of words and characters in a file and it works fine. Would you like to know about that? or you would like to stick to your version and fix it?
 
I saw there are functions like
readString() // it reads one word at a time
readChar() // it reads one character at a time



Therefore use these functions.
Also take filename as follows

String file = args[0];
In in0 = new In(file);
 
Hiche said:
Sadly, we are limited to using the In class for this program.
I don't believe that the In class is present in standard Java, but is a class that your instructor provides for you to use. Since you are limited to using just this class, you need to understand the methods and properties that are exposed by this class. We can't point you in the right direction without knowing what things are in this class.
 
Last edited:
Code:
public class DocumentS
{
	public static void main(String[] args)
	{
		In in0 = new In(args[0]);
		
		int wordCount = 0;
		int charCount = 0;
		
		while (!in0.isEmpty())
		{
			wordCount++;
			String str = in0.readString(); 
			for (int i = 1; i <= str.length(); i++)
				charCount++; 
		}
		System.out.println("Number of words: " + wordCount);
		System.out.println("Number of characters: " + charCount);
	}
}

I managed to fix this (I changed StdIn.isEmpty() to in0.isEmpty() since we are looping until the end of the file we attached rather than wait for no input which is StdIn). The In class is part of an external library our instructor provided for us. I though many people knew about it, but I guess I was wrong. The library is in this link if anyone is remotely interested: http://www.sendspace.com/file/t7wqmp

Thanks for the replies.
 

Similar threads

  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 23 ·
Replies
23
Views
9K
Replies
1
Views
3K
  • · Replies 14 ·
Replies
14
Views
4K
  • · Replies 15 ·
Replies
15
Views
3K
  • · Replies 5 ·
Replies
5
Views
4K
  • · Replies 12 ·
Replies
12
Views
2K
  • · Replies 7 ·
Replies
7
Views
2K
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 7 ·
Replies
7
Views
3K