Account Options

  1. Sign in
The old Google Groups will be going away soon, but your browser is incompatible with the new version.
Google Groups Home
« Groups Home
Message from discussion help wanted: scan and analyze your sp services bills

Received: by 10.204.154.76 with SMTP id n12mr2489891bkw.1.1342957441425;
        Sun, 22 Jul 2012 04:44:01 -0700 (PDT)
X-BeenThere: singapore-rb@googlegroups.com
Received: by 10.204.7.203 with SMTP id e11ls5512825bke.8.gmail; Sun, 22 Jul
 2012 04:43:59 -0700 (PDT)
Received: by 10.204.152.23 with SMTP id e23mr2486042bkw.6.1342957439633;
        Sun, 22 Jul 2012 04:43:59 -0700 (PDT)
Received: by 10.204.152.23 with SMTP id e23mr2486041bkw.6.1342957439606;
        Sun, 22 Jul 2012 04:43:59 -0700 (PDT)
Return-Path: <gallagher.p...@gmail.com>
Received: from mail-lb0-f175.google.com (mail-lb0-f175.google.com [209.85.217.175])
        by gmr-mx.google.com with ESMTPS id j4si2812194bkj.3.2012.07.22.04.43.59
        (version=TLSv1/SSLv3 cipher=OTHER);
        Sun, 22 Jul 2012 04:43:59 -0700 (PDT)
Received-SPF: pass (google.com: domain of gallagher.p...@gmail.com designates 209.85.217.175 as permitted sender) client-ip=209.85.217.175;
Authentication-Results: gmr-mx.google.com; spf=pass (google.com: domain of gallagher.p...@gmail.com designates 209.85.217.175 as permitted sender) smtp.mail=gallagher.p...@gmail.com; dkim=pass header...@gmail.com
Received: by lbol5 with SMTP id l5so7365883lbo.34
        for <singapore-rb@googlegroups.com>; Sun, 22 Jul 2012 04:43:59 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;
        d=gmail.com; s=20120113;
        h=mime-version:in-reply-to:references:date:message-id:subject:from:to
         :content-type;
        bh=CsH0gR/GNVCp/Efcgd6DRqn1iSpFMZ5/YloUo8dUcGc=;
        b=defThCFd8wuNFB4eD1WPuTtn7qXqu6uzaN7WsJlu0SBpiwnPXQVMFA5goJwLJZp/pd
         4lTW4l4ay9fBJQyf8XW2V/BnrdZnqGNX8uHqJ2PBgJ2hweV5jExRwBgf32x1/Ge6SB38
         IcJTS//yOpyw4Aj+8nNeTxukenskbnbpk9HF48IgTWj4w/sx1qQERBq0Kiz64itkMbDY
         N2FPijPMhYSQouqvgIGG+7TcB5z5czjM289w9dva/Mu51iNfw2OoIaR1iLzkM8SOzow4
         kJFblreQ07Xe5dtxIgltE/WF3PSPe2Mq4HWDwWzqKgl/4cAJrfqH/cyY+7fBb7+q9FrU
         135Q==
MIME-Version: 1.0
Received: by 10.112.46.9 with SMTP id r9mr5913259lbm.81.1342957439074; Sun, 22
 Jul 2012 04:43:59 -0700 (PDT)
Received: by 10.114.0.48 with HTTP; Sun, 22 Jul 2012 04:43:59 -0700 (PDT)
In-Reply-To: <F9519497-DA60-4CB2-B8F0-13226C94A...@odd-e.com>
References: <CAGBx7GjnLN1zwKf2f6s1TSufQTUk_aCsRzW=SfdqOUCaLkD...@mail.gmail.com>
	<E64C0E01-86FD-4670-9E7C-5D5A6096B...@gmail.com>
	<CAGBx7GjbTMqMSpsWU+qw-Ot_-aVHC4j_M0fz1rMXRXTUB5J...@mail.gmail.com>
	<F9519497-DA60-4CB2-B8F0-13226C94A...@odd-e.com>
Date: Sun, 22 Jul 2012 19:43:59 +0800
Message-ID: <CAGBx7GhSRLTYPkqA2yFwagTL+r1aA7Ntsgf3y9MtpqZ9J77...@mail.gmail.com>
Subject: Re: [SRB] help wanted: scan and analyze your sp services bills
From: Paul Gallagher <gallagher.p...@gmail.com>
To: singapore-rb@googlegroups.com
Content-Type: multipart/alternative; boundary=f46d0401236f1ac23804c569a551

--f46d0401236f1ac23804c569a551
Content-Type: text/plain; charset=UTF-8

hola Bas.

sorry, no I haven't found a gem for that. I was looking for something
similar in order to redact my SP Services bills so I could add them as
samples without leaking PI, but I didn't find anything that quite met my
needs (which was specifically to substitute text exactly while keeping
everything else about the document structure intact).

The closest I came were the multivalent
<http://multivalent.sourceforge.net>tools - but that's java. Depending
on exactly what you need, you might be
able to make use of things like
uncompress<http://multivalent.sourceforge.net/Tools/pdf/Uncompress.html>(I
think similar to what you can do with
pdfbox <http://pdfbox.apache.org/commandlineutilities/WriteDecodedDoc.html>).


If you come up with anything, let us know!

On Sun, Jul 22, 2012 at 7:27 PM, Bas Vodde <b...@odd-e.com> wrote:

>
> Hola Paul,
>
> Nice job :)
>
> Slightly unrelated question. Do you know any other gems than Prawn for PDF
> generation? Especially ones who can replace text but still take care of
> spacing and things like that.
> Been looking for one for a while and procrastinating one writing one :)
> (this is another procrastination :P)
>
> Bas
>
>
> On 22-Jul-2012, at 6:13 PM, Paul Gallagher wrote:
>
> > On Sat, Jul 21, 2012 at 9:29 PM, Meng Weng Wong <mengw...@gmail.com>
> wrote:
> > Today, SP. Tomorrow:
> >
> > - bank statements
> > - credit card statements
> > - other utility statements
> > - invoices
> > - and any other structured data which gets flattened to paper or PDF.
> >
> > exactly;-)
> >
> > Ideally data custodians will see the light and open up machine-readable
> access to the data they hold about us(!!) .. but until then,
> *re*-combobulating data with tools will help demonstrate people really do
> want the data.
> >
> > Actually, to that end, I just extracted the PDF-parsing smarts from
> sps_bill to another gem: https://github.com/tardate/pdf-reader-turtletext
> >
> > If anyone's interested in taking aim at parsing another PDF source,
> pdf-reader-turtletext gives a little more of a leg up than raw pdf-reader
> (and I have a few more DSL-like ideas I think I'll add)
> >
> >
> > There are a handful of "open and scan" mail handler services like
> earthclassmail.com but none of them are smart. I'd love a service that
> would convert my snailmail to data structures available over a combination
> of push and pull APIs compatible with, say, ifttt.com. Then I could
> script it and trigger auto-payments, etc.
> >
> > Your work also connects to a Big Data idea that came up at the
> Quantified Self meetup last week. Working title: the Fishbowl Flag.
> >
> > I like the name;-) I should get along to a  Quantified Self meetup -
> sounds like my kind of bof
> >
> >
> >
> > --
> > You received this message because you are subscribed to the Google
> Groups "Singapore Ruby Brigade" group.
> > To post to this group, send email to singapore-rb@googlegroups.com.
> > To unsubscribe from this group, send email to
> singapore-rb+unsubscribe@googlegroups.com.
> > For more options, visit this group at
> http://groups.google.com/group/singapore-rb?hl=en.
>
> --
> You received this message because you are subscribed to the Google Groups
> "Singapore Ruby Brigade" group.
> To post to this group, send email to singapore-rb@googlegroups.com.
> To unsubscribe from this group, send email to
> singapore-rb+unsubscribe@googlegroups.com.
> For more options, visit this group at
> http://groups.google.com/group/singapore-rb?hl=en.
>
>

--f46d0401236f1ac23804c569a551
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

hola Bas.<br><br>sorry, no I haven&#39;t found a gem for that. I was lookin=
g for something similar in order to redact my SP Services bills so I could =
add them as samples without leaking PI, but I didn&#39;t find anything that=
 quite met my needs (which was specifically to substitute text exactly whil=
e keeping everything else about the document structure intact).<br>
<br>The closest I came were the <a href=3D"http://multivalent.sourceforge.n=
et">multivalent</a> tools - but that&#39;s java. Depending on exactly what =
you need, you might be able to make use of things like <a href=3D"http://mu=
ltivalent.sourceforge.net/Tools/pdf/Uncompress.html">uncompress</a> (I thin=
k similar to what you can do with <a href=3D"http://pdfbox.apache.org/comma=
ndlineutilities/WriteDecodedDoc.html">pdfbox</a>). <br>
<br>If you come up with anything, let us know!<br><br><div class=3D"gmail_q=
uote">On Sun, Jul 22, 2012 at 7:27 PM, Bas Vodde <span dir=3D"ltr">&lt;<a h=
ref=3D"mailto:b...@odd-e.com" target=3D"_blank">b...@odd-e.com</a>&gt;</spa=
n> wrote:<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><br>
Hola Paul,<br>
<br>
Nice job :)<br>
<br>
Slightly unrelated question. Do you know any other gems than Prawn for PDF =
generation? Especially ones who can replace text but still take care of spa=
cing and things like that.<br>
Been looking for one for a while and procrastinating one writing one :) (th=
is is another procrastination :P)<br>
<span class=3D"HOEnZb"><font color=3D"#888888"><br>
Bas<br>
</font></span><div class=3D"HOEnZb"><div class=3D"h5"><br>
<br>
On 22-Jul-2012, at 6:13 PM, Paul Gallagher wrote:<br>
<br>
&gt; On Sat, Jul 21, 2012 at 9:29 PM, Meng Weng Wong &lt;<a href=3D"mailto:=
mengw...@gmail.com">mengw...@gmail.com</a>&gt; wrote:<br>
&gt; Today, SP. Tomorrow:<br>
&gt;<br>
&gt; - bank statements<br>
&gt; - credit card statements<br>
&gt; - other utility statements<br>
&gt; - invoices<br>
&gt; - and any other structured data which gets flattened to paper or PDF.<=
br>
&gt;<br>
&gt; exactly;-)<br>
&gt;<br>
&gt; Ideally data custodians will see the light and open up machine-readabl=
e access to the data they hold about us(!!) .. but until then, *re*-combobu=
lating data with tools will help demonstrate people really do want the data=
.<br>

&gt;<br>
&gt; Actually, to that end, I just extracted the PDF-parsing smarts from sp=
s_bill to another gem: <a href=3D"https://github.com/tardate/pdf-reader-tur=
tletext" target=3D"_blank">https://github.com/tardate/pdf-reader-turtletext=
</a><br>

&gt;<br>
&gt; If anyone&#39;s interested in taking aim at parsing another PDF source=
, pdf-reader-turtletext gives a little more of a leg up than raw pdf-reader=
 (and I have a few more DSL-like ideas I think I&#39;ll add)<br>
&gt;<br>
&gt;<br>
&gt; There are a handful of &quot;open and scan&quot; mail handler services=
 like <a href=3D"http://earthclassmail.com" target=3D"_blank">earthclassmai=
l.com</a> but none of them are smart. I&#39;d love a service that would con=
vert my snailmail to data structures available over a combination of push a=
nd pull APIs compatible with, say, <a href=3D"http://ifttt.com" target=3D"_=
blank">ifttt.com</a>. Then I could script it and trigger auto-payments, etc=
.<br>

&gt;<br>
&gt; Your work also connects to a Big Data idea that came up at the Quantif=
ied Self meetup last week. Working title: the Fishbowl Flag.<br>
&gt;<br>
&gt; I like the name;-) I should get along to a =C2=A0Quantified Self meetu=
p - sounds like my kind of bof<br>
&gt;<br>
&gt;<br>
&gt;<br>
</div></div><div class=3D"HOEnZb"><div class=3D"h5">&gt; --<br>
&gt; You received this message because you are subscribed to the Google Gro=
ups &quot;Singapore Ruby Brigade&quot; group.<br>
&gt; To post to this group, send email to <a href=3D"mailto:singapore-rb@go=
oglegroups.com">singapore-rb@googlegroups.com</a>.<br>
&gt; To unsubscribe from this group, send email to <a href=3D"mailto:singap=
ore-rb%2Bunsubscribe@googlegroups.com">singapore-rb+unsubscribe@googlegroup=
s.com</a>.<br>
&gt; For more options, visit this group at <a href=3D"http://groups.google.=
com/group/singapore-rb?hl=3Den" target=3D"_blank">http://groups.google.com/=
group/singapore-rb?hl=3Den</a>.<br>
<br>
--<br>
You received this message because you are subscribed to the Google Groups &=
quot;Singapore Ruby Brigade&quot; group.<br>
To post to this group, send email to <a href=3D"mailto:singapore-rb@googleg=
roups.com">singapore-rb@googlegroups.com</a>.<br>
To unsubscribe from this group, send email to <a href=3D"mailto:singapore-r=
b%2Bunsubscribe@googlegroups.com">singapore-rb+unsubscribe@googlegroups.com=
</a>.<br>
For more options, visit this group at <a href=3D"http://groups.google.com/g=
roup/singapore-rb?hl=3Den" target=3D"_blank">http://groups.google.com/group=
/singapore-rb?hl=3Den</a>.<br>
<br>
</div></div></blockquote></div><br>

--f46d0401236f1ac23804c569a551--