scribe with mapr ?

172 views
Skip to first unread message

bciceron

unread,
Apr 4, 2012, 2:16:13 PM4/4/12
to scribe...@googlegroups.com
hello,

we've got scribe working in ubuntu 10.04 and able to run all config under examples/ but when it comes to writing to hdfs:/// ( in our case maprfs:///path) using examples/hdfs_example.conf it failes with :

...
[Sat Mar 31 08:12:13 2012] "[hdfs] Connecting to HDFS for maprfs:///mapr/mycluster.com/tmp/ben/scribe/test5/hk-support-01"
[Sat Mar 31 08:12:15 2012] "[hdfs] disconnecting fileSys for maprfs:///mapr/mycluster.com/tmp/ben6/scribe/test5/hk-support-01"
[Sat Mar 31 08:12:15 2012] "[hdfs] disconnected fileSys for maprfs:///mapr/mycluster.com/tmp/ben6/scribe/test5/hk-support-01"
[Sat Mar 31 08:12:15 2012] "[test5] Failed to create/open file of type for writing"
[Sat Mar 31 08:12:15 2012] "Exception: hdfsListDirectory call failed"

1/ the maprfs uri is valide and accessible from this host.
2/ the user has r/w access to the parent directory,

MC Srivas

unread,
Apr 6, 2012, 12:43:03 AM4/6/12
to Scribe Server
Ben, did you try answers.mapr.com? Someone from there can help you
out real fast.
> 3/ also patched withhttps://github.com/traviscrawford/scribe/commit/7dbd50c496fb2924e2f7c...
> to no avail.

Leonid

unread,
Apr 6, 2012, 12:29:58 PM4/6/12
to Scribe Server
Ben,

Instead of linking with libhdfs.so, you should be linking with
libMapRClient.so to access maprfs efficiently. Please look to
this guide

"Using C to Interface with MapR-FS"
http://mapr.com/doc/display/MapR/Working+with+MapR-FS



On Apr 4, 10:16 am, bciceron <bcice...@gmail.com> wrote:
> hello,
>
> we've got scribe working in ubuntu 10.04 and able to run all config under
> examples/ but when it comes to writing to hdfs:/// ( in our case
> maprfs:///path) using examples/hdfs_example.conf it failes with :
>
> ...
> [Sat Mar 31 08:12:13 2012] "[hdfs] Connecting to HDFS for
> maprfs:///mapr/mycluster.com/tmp/ben/scribe/test5/hk-support-01"
> [Sat Mar 31 08:12:15 2012] "[hdfs] disconnecting fileSys for
> maprfs:///mapr/mycluster.com/tmp/ben6/scribe/test5/hk-support-01"
> [Sat Mar 31 08:12:15 2012] "[hdfs] disconnected fileSys for
> maprfs:///mapr/mycluster.com/tmp/ben6/scribe/test5/hk-support-01"
> [Sat Mar 31 08:12:15 2012] "[test5] Failed to create/open file of type for
> writing"
> [Sat Mar 31 08:12:15 2012] "Exception: hdfsListDirectory call failed"
>
> 1/ the maprfs uri is valide and accessible from this host.
> 2/ the user has r/w access to the parent directory,

Subhash

unread,
Apr 9, 2012, 7:32:37 PM4/9/12
to scribe...@googlegroups.com
Ben,

I have attached a patch to make Scribe work with MapR. You will have to run `bootstrap.sh && make`.
The change to README in the patch describes how to run configure for MapR, inside bootstrap.sh

Also, how do I submit this patch to the Scribe community ?

Thanks
Subhash
scribe_patch.txt

bciceron

unread,
Apr 9, 2012, 11:16:47 PM4/9/12
to scribe...@googlegroups.com

great ! was able to build and run with no error both maprfs_example*.conf . but when checking the destination directory and files in maprfs the simply dont exist. i used the same test5 that before. see server side output athttps://gist.github.com/adc9280c28ebbbc5f3d9

bciceron

unread,
Apr 10, 2012, 2:15:11 PM4/10/12
to scribe...@googlegroups.com
when testing the above with a simple client like :

echo "test888" | ./examples/scribe_cat test8

it fails to write to maprfs :

/target/ops/code/scribe/bin/scribed -c ~/FromFtp/repo/scribe/examples/maprfs_example.conf 
[Wed Apr 11 02:04:32 2012] "setrlimit error (setting max fd size)" 
[Wed Apr 11 02:04:32 2012] "STATUS: STARTING" 
[Wed Apr 11 02:04:32 2012] "STATUS: configuring" 
[Wed Apr 11 02:04:32 2012] "got configuration data from file </target/home/bensix/FromFtp/repo/scribe/examples/maprfs_example.conf>" 
[Wed Apr 11 02:04:32 2012] "CATEGORY : default" 
[Wed Apr 11 02:04:32 2012] "Creating default store" 
[Wed Apr 11 02:04:32 2012] "configured <1> stores" 
[Wed Apr 11 02:04:32 2012] "STATUS: " 
[Wed Apr 11 02:04:32 2012] "STATUS: ALIVE" 
[Wed Apr 11 02:04:32 2012] "Starting scribe server on port 1463" 
Thrift: Wed Apr 11 02:04:32 2012 libevent 1.4.14b-stable method epoll
[Wed Apr 11 02:04:45 2012] "[test8] Creating new category store from model default" 
[Wed Apr 11 02:04:45 2012] "store thread starting" 
[Wed Apr 11 02:04:45 2012] "[hdfs] Connecting to HDFS for maprfs:///ben6/scribe/test8/hk-support-01" 
[Wed Apr 11 02:04:46 2012] "[hdfs] disconnecting fileSys for maprfs:///ben6/scribe/test8/hk-support-01" 
[Wed Apr 11 02:04:46 2012] "[hdfs] disconnected fileSys for maprfs:///ben6/scribe/test8/hk-support-01" 
[Wed Apr 11 02:04:46 2012] "[hdfs] Connecting to HDFS for maprfs:///ben6/scribe/test8/hk-support-01/test8-2012-04-11_00000" 
[Wed Apr 11 02:04:47 2012] "[hdfs] opened for append maprfs:///ben6/scribe/test8/hk-support-01/test8-2012-04-11_00000" 
[Wed Apr 11 02:04:47 2012] "[test8] Opened file <maprfs:///ben6/scribe/test8/hk-support-01/test8-2012-04-11_00000> for writing" 
[Wed Apr 11 02:04:47 2012] "[test8] Opened file </target/home/bensix/FromFtp/tmp/test5/test8/test8_00000> for writing" 
[Wed Apr 11 02:04:47 2012] "[test8] Changing state from <DISCONNECTED> to <SENDING_BUFFER>" 
[Wed Apr 11 02:04:47 2012] "[test8] read <0> entries of <0> bytes from file </target/home/bensix/FromFtp/tmp/test5/test8/test8_00000>" 
[Wed Apr 11 02:04:47 2012] "[test8] No more buffer files to send, switching to streaming mode" 
[Wed Apr 11 02:04:47 2012] "[test8] Changing state from <SENDING_BUFFER> to <STREAMING>" 

then it hangs.

here is the config :
port=1463
max_msg_per_second=2000000
check_interval=1
max_queue_size=100000000
num_thrift_server_threads=2


# DEFAULT - write all messages to hadoop
<store>
category=default
type=buffer

target_write_size=20480
max_write_interval=1
buffer_send_rate=1
retry_interval=30
retry_interval_range=10

<primary>
type=file
fs_type=maprfs
file_path=maprfs:///ben6/scribe
create_symlink=no
use_hostname_sub_directory=yes
base_filename=thisisoverwritten
max_size=1000000000
rotate_period=daily
rotate_hour=0
rotate_minute=5
add_newlines=1
</primary>


<secondary>
type=file
fs_type=std
file_path=/target/home/bensix/FromFtp/tmp/test5
base_filename=thisisoverwritten
max_size=3000000
</secondary>
</store>

Subhash

unread,
Apr 10, 2012, 2:37:24 PM4/10/12
to scribe...@googlegroups.com
I figured out the problem. Due to some reason, the path maprfs:///scribedata is being interpreted as a relative path and a directory /maprfs:// is getting created.
A workaround would be to just specify /scribedata/ in maprfs_example.conf and maprfs_example2.conf. I have enclosed an updated diff.

We will investigate this further and if it is a bug, will fix this in our next release of MapR.

Thanks
-Subhash
scribe_patch.txt

bciceron

unread,
Apr 10, 2012, 3:07:38 PM4/10/12
to scribe...@googlegroups.com
YES !
Reply all
Reply to author
Forward
0 new messages