I am trying to extract the summary compensation table from the
following website. I have saved the source code on my drive in txt
(table.txt) format.
If i use the following code to search for summary compensation
string, s is always none. From what I understand s should be assigned
a string value of Summary Compensation.
I am new to python and beautiful soup. Any help is greatly
appreciated,
import glob
import codecs
import csv
from BeautifulSoup import BeautifulSoup
soup = BeautifulSoup(open("table.txt").read())
print soup.prettify()
s=soup.findAll(text="Summary Compensation")
print s
http://www.sec.gov/Archives/edgar/data/1314102/000119312508210683/ddef14a.htm