can u give me guys a logic how to count the content( number of words) of .htm files
ex.
html>
<head>
<title>Capture</title>
<meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1">
</head>
<body>
hello world<br>
<table width="365" border="1">
<tr>
<td width="158">test me </td>
<td width="191">and here </td>
</tr>
<tr>
<td>anywhere</td>
<td>what else </td>
</tr>
<tr>
<td>dudes</td>
<td> </td>
</tr>
</table>
<div align="center"><br>
home | email me| about me
</div>
</body>
</html>
the result will be
number of words :15
contains :
hello world,test me ,and, here ,anywhere,what, else,dudes,about, me,email, me,home