How to pull big data using hb.scan

30 views
Skip to first unread message

hb.scan problem

unread,
Oct 22, 2013, 8:49:22 AM10/22/13
to rha...@googlegroups.com
Hi All,

I ran the below command it return list()
> row_scan <- hb.scan('clickstream_big', 20001, colspec = c("cf1:"))
> row_scan$get()
list()

But when i ran the some code with start =1 it will return proper value .

row_scan <- hb.scan('clickstream_big', 1, colspec = c("cf1:"))
row_scan$get()

[[1000]]
[[1000]][[1]]
[1] "1074593131#e87181a25c798db9b94618e107f85cb8"

[[1000]][[2]]
[1] "cf1:ipaddr"   "cf1:list0"    "cf1:session"  "cf1:shop_id"  "cf1:unixtime"
[6] "cf1:vPage"

[[1000]][[3]]
[[1000]][[3]][[1]]
[1] "66.77.73.176"

[[1000]][[3]][[2]]
[1] "834"

[[1000]][[3]][[3]]
[1] "e87181a25c798db9b94618e107f85cb8"

[[1000]][[3]][[4]]
[1] "11"

[[1000]][[3]][[5]]
[1] "1074593131"

[[1000]][[3]][[6]]
[1] "dt"

Any body can help me on this.

Thanks,
Manish

David Champagne

unread,
Oct 23, 2013, 2:21:11 PM10/23/13
to rha...@googlegroups.com
Hi,

It's difficult to know what the issue is without knowing what is in your hbase table.  I would suggest first using the 'scan' function in the Hbase shell, to see if you can access records starting with row number 20001.  Here's a link to a reference guide for the shell commands  http://learnhbase.wordpress.com/2013/03/02/hbase-shell-commands/

David
Reply all
Reply to author
Forward
0 new messages