help!!!! help!!!!!! help!!!!!

2 views
Skip to first unread message

Shashwat Shukla

unread,
Jun 10, 2009, 8:46:29 AM6/10/09
to iitr-i...@googlegroups.com
i have to create a database of 2lac entries form a site. 
Entries are in doc file.(actually they are resumes)
I have to separate them as name,email,ph,address, qualifications like various fields. so that various searches can be applied to them.

does anyone know any software or engine or method to do that?
please tell me? 
i have a rocket behind me.
even a slightest help may put off rocket.

ashish....@gmail.com

unread,
Jun 10, 2009, 8:52:33 AM6/10/09
to iitr interns
i think pdf files can be parsed into xml

look for some tool to do this

then you can easily enter data from xml to any database

Shashwat Shukla

unread,
Jun 10, 2009, 8:55:54 AM6/10/09
to iitr-i...@googlegroups.com

they are doc files and converting 2lac files to pdf????

i found that do files can also be read and used in php and then a database can be created .

how long would making such a program will take??

is there any available software for this?

vishal raja

unread,
Jun 10, 2009, 9:02:14 AM6/10/09
to iitr-i...@googlegroups.com
shashwat ye database kaise banate hai bhai

Shashwat Shukla

unread,
Jun 10, 2009, 9:02:29 AM6/10/09
to iitr-i...@googlegroups.com

actually i said that i can make it, but it is needed urgently.

Shashwat Shukla

unread,
Jun 10, 2009, 9:03:18 AM6/10/09
to iitr-i...@googlegroups.com
abe kuch nahin, mujhe bhi nahin aata.

bas pata hai ki xml aur sql se ban jaayega.

vishal raja

unread,
Jun 10, 2009, 9:05:14 AM6/10/09
to iitr-i...@googlegroups.com
batade bhai...

vishal raja

unread,
Jun 10, 2009, 9:05:45 AM6/10/09
to iitr-i...@googlegroups.com
abe matlab poore doc files save ho jati hai ?

Shashwat Shukla

unread,
Jun 10, 2009, 9:16:39 AM6/10/09
to iitr-i...@googlegroups.com
are usme se name seach kar ke uske aage se name utha lo.
email identify kar lo aur save lar lo
aise hi poore file main se jo bhi nikale utha ke save kar lo.

WebProg....

unread,
Jun 11, 2009, 2:43:57 AM6/11/09
to iitr interns
teri doc files ka naam random hai ya kisi sequence mai hai ya unka koi
database hai??





ek doc file mai se to sara content read karke database bana sakte
hai.....
if there wd be seq to loop chala denge.

pradeep yadav

unread,
Jun 11, 2009, 5:54:22 AM6/11/09
to iitr-i...@googlegroups.com
database to c# pe banega swamin e yaha banaya bhi tha
usse pooch sakte ho

ANSHUL AGRAWAL

unread,
Jun 11, 2009, 5:56:47 AM6/11/09
to iitr-i...@googlegroups.com
database yaar kisi mai bhi ban jaayega.
bas files specific pattern mai honi chahiye.
ye sashwat kaha mar gaya.........
jab aaye tab to kuchh ho..

Shashwat Shukla

unread,
Jun 11, 2009, 7:17:28 AM6/11/09
to iitr-i...@googlegroups.com
dekh bhai


2lac resumes hain. (.doc file)

usme naam , address , qualification, email id wagarah likhi hongi.

ab batao ki ye saari chezen usme se kaise nikalengi.

name , add kaise identify hoga?

ks language se hoga aur database kisme banega.

ANSHUL AGRAWAL

unread,
Jun 11, 2009, 7:23:08 AM6/11/09
to iitr-i...@googlegroups.com
mere bhai tu kisi bhi language mai ye kaam kar sakta hai............
either php or c# or any other.........
php ka code mujhe pata hai
c# mai bahut simple hota hai saare functions .net mai de rakhi hote hai.

php ka code ye hai--->
<?php


function parseWord($userDoc)
{
$fileHandle = fopen($userDoc, "r");
$line = @fread($fileHandle, filesize($userDoc));
$lines = explode(chr(0x0D),$line);
$outtext = "";
foreach($lines as $thisline)
{
$pos = strpos($thisline, chr(0x00));
if (($pos !== FALSE)||(strlen($thisline)==0))
{
} else {
$outtext .= $thisline." ";
}
}
$outtext =
preg_replace("/[^a-zA-Z0-9\s\,\.\-\n\r\t@\/\_\(\)]/","",$outtext);
return $outtext;
}

$userDoc = "testdoc.doc";

$text = parseWord($userDoc);

//print_r($text);

for($i=0;$i<strlen($text);$i++)
{echo $text[$i].' ';}

//echo $text;


?>


it would read out the whole doc file .
ab tu ismai search method laga ke names.......vagairah extract kar lena....


ye to ho gaya ek doc file ka

ab 2 lac files hai to agar unka naam special pattern mai hai to loop chala do.
vo bhi ho jaayega nd ur problem wd solve.

ab tu mujhe ye bata jo doc files hai vo kaha hai aur kya naam hai un files ka???

Shashwat Shukla

unread,
Jun 11, 2009, 8:05:34 AM6/11/09
to iitr-i...@googlegroups.com
thanks


wo doc files ek folder main padi hai.
ayr unka naam anshul_agarwal.doc jaise hai
par wo to number mein ho jayega
\

par yaar name aur address kaise identify hoga?
aur ye code keval file open karke sweep (ya jo bhi kahte hon) karta hai ya aur bhi kuch karta hai?

ANSHUL AGRAWAL

unread,
Jun 11, 2009, 8:11:00 AM6/11/09
to iitr-i...@googlegroups.com
$text to string hai jismai sara content aa jayega doc file ka.
tab tu search method laga dena
like if name occurs
tab this this ....

mai code bana dunga if u wd send me a template of resume.

ANSHUL AGRAWAL

unread,
Jun 11, 2009, 8:17:57 AM6/11/09
to iitr-i...@googlegroups.com
yaar shashwat ye bata number mai kaise karega doc files ko???

Shashwat Shukla

unread,
Jun 11, 2009, 10:24:53 AM6/11/09
to iitr-i...@googlegroups.com
are kuch nahin yar pahle sabko select karo

rename  kar do a.doc

fir saari files rename ho jayegi  a.doc

a(1).doc
a(2).doc
.
.
.
.
a(54345432).doc

fir command prompt se ja ke "a(  " aur ) hata do

bas ho gaya

 

ANSHUL AGRAWAL

unread,
Jun 11, 2009, 10:26:58 AM6/11/09
to iitr-i...@googlegroups.com
are vaah yaar
kaise socho
bond hai tu to
v v v vv v...gud.......

ANSHUL AGRAWAL

unread,
Jun 11, 2009, 10:27:18 AM6/11/09
to iitr-i...@googlegroups.com
then teri prob solve ho gayi??

ANSHUL AGRAWAL

unread,
Jun 11, 2009, 10:27:56 AM6/11/09
to iitr-i...@googlegroups.com
aur haan ye bata database kismai chahiye sql mai ya excel file mai
save karna hai??

Shashwat Shukla

unread,
Jun 11, 2009, 10:35:13 AM6/11/09
to iitr-i...@googlegroups.com
are nahin yaar  mujhe php nahin aati.
to agar code bana pada to pahle php seekhni padegi.
aur mere bo000000000000000000sssssssssssssssssssssss ko koi "kiya karaya kaam" chahiye. jo download ho jaaye
1 ,2 din me
.

aur resume to sabke alag format me honge.
waise iske baare me koi fix information nahin  hai. main pata kar ke batata hun.

aur agar koi software ya script already ho jo docs se email id exctract kar le to mera kuch to kaamho jayega.

soumya mohan

unread,
Jun 11, 2009, 10:39:08 AM6/11/09
to iitr-i...@googlegroups.com
kitni mari padi hai tum logon ki :P
--
With regards
Soumya Mohan
BTech IIIrd yr Computer Science and Engineering
IIT Roorkee
+91 9760030904

Shashwat Shukla

unread,
Jun 11, 2009, 10:38:47 AM6/11/09
to iitr-i...@googlegroups.com
kisi mein  bhi chalega.
sql better padega.
uspe search wagarah badiya chelegi

2009/6/11 ANSHUL AGRAWAL <ansh...@gmail.com>

WebProg....

unread,
Jun 12, 2009, 12:42:30 AM6/12/09
to iitr interns
only email id
tab to v v vv simple
tab to kisi pattern ki bhi jaroorat nahi padegi
tujhe php nahi sikhni padegi tujhe only run karna hoga....
tera data apne aap mysql database mai pahunch jayega...

Shashwat Shukla

unread,
Jun 12, 2009, 6:13:57 AM6/12/09
to iitr-i...@googlegroups.com
ok thanks.

abhi to musibat tal gayi hai.
agar dubara aayi to bataunga.

WebProg....

unread,
Jun 12, 2009, 6:17:05 AM6/12/09
to iitr interns
koi software mila???
Reply all
Reply to author
Forward
0 new messages