Re: dataset import failure due to time format

25 views
Skip to first unread message
Message has been deleted
Message has been deleted

Omar

unread,
Apr 9, 2014, 3:46:41 PM4/9/14
to public-d...@googlegroups.com
Hi Kim,

It looks like your PDV 2 has two populations for 2013.11. I believe that's what's causing the error, since every combination of dimension values should only appear once in a slice data table.

Best,
-Omar


On Wednesday, April 9, 2014 11:41:36 AM UTC-4, Kim Pellegrin wrote:
forgot to include the dataset :-)

On Wednesday, April 9, 2014 5:39:59 PM UTC+2, Kim Pellegrin wrote:

Here under the import error message I receive when I try to import my dataset ( my full dataset is in the document included to this message)

My statistics
Apr 8, 2014 at 5:41:02 PM
Failed
( source: /test_import/PDVID_pop_2013_slice.csv at or near line: 5 ): Exception in parsing data rows from source caused by Found duplicate time value in slice data '2013-11-01T00:00:00.000Z' for dimension values [2]. Note: Time values may have been normalized based on specified granularity.
DSPL Outline
Concepts
population
country
PDV_ID
MARQUE
TYPE_CLIENT
Slices
PDVID_pop_2013_slice
Topics
geography
social_indicators
population_indicators
Files
countries.csv (11 Rows)
POINT_DE_VENTE.csv (892 Rows)
MARQUE.csv (2 Rows)
TYPE_CLIENT.csv (3 Rows)
PDVID_pop_2013_slice.csv (0 Rows)

the table PDVID_pop_2013_slice.csv  was created as follow:

PDV_ID,month,population
2,2013.12,1
2,2013.11,1
2,2013.11,1
3,2013.11,1
3,2013.09,1
3,2013.10,1
4,2013.10,4
4,2013.10,1
5,2013.11,1
5,2013.10,3
7,2013.10,1
7,2013.09,2
8,2013.10,1
10,2013.12,1
12,2013.10,1
12,2013.10,1
13,2013.11,1
14,2013.11,1
15,2013.11,1
16,2013.11,1
18,2013.09,1
19,2013.10,1
20,2013.10,1
21,2013.08,2
22,2013.08,1
23,2013.08,2
24,2013.10,1

and xml file contains the following about the concerned slice and tables:



  <concepts>
    <!-- As noted in the tutorial, this concept should extend quantity:amount.-->
    <concept id="population" extends="quantity:amount">
      <info>
        <name>
          <value>population</value>
        </name>
        <description>
          <value>Size of the resident population.</value>
        </description>
      </info>
      <topic ref="population_indicators"/>
      <type ref="integer"/>
    </concept>

    <concept id="country" extends="geo:location">
      <info>
        <name>
          <value>country</value>
        </name>
        <description>
          <value>My list of countries</value>
        </description>
      </info>
      <type ref="string"/>
      <property id="name">
        <info>
          <name><value xml:lang="en">Country name</value></name>
          <description>
            <value xml:lang="en">The official name of the country</value>
          </description>
        </info>
        <type ref="string"/>
      </property>
      <table ref="countries_table"/>
    </concept>


    <concept id="PDV_ID" extends="geo:location">
      <info>
        <name>
          <value>Point de vente</value>
        </name>
        <description>
          <value>point de vente</value>
        </description>
      </info>
      <type ref="string"/>
      <property concept="country" isParent="true"/>
      <table ref="PDV_table"/>
   </concept>


    <concept id="MARQUE">
        <info>
          <name>
          <value>MARQUE</value>
 </name>
          <description>
            <value>name of the brand</value>
          </description>
        </info>
        <type ref="string"/>
      <table ref="MARQUE_table"/>
    </concept>
    
    
    <concept id="TYPE_CLIENT">
        <info>
          <name>
          <value>TYPE_CLIENT</value>
 </name>
          <description>
            <value>type client</value>
          </description>
        </info>
        <type ref="string"/>     
<table ref="TYPE_CLIENT_table"/>
    </concept>


   </concepts>



  <slices>


    <slice id="PDVID_pop_2013_slice">
      <dimension concept="PDV_ID"/>
      <dimension concept="time:month"/>    
      <metric concept="population"/>
      <table ref="PDVID_pop_2013_slice_table"/>
    </slice>


  </slices>



  <tables>

    <table id="countries_table">
      <column id="country" type="string"/>
      <column id="name" type="string"/>
      <column id="latitude" type="float"/>
      <column id="longitude" type="float"/>
      <data>
        <file format="csv" encoding="utf-8">countries.csv</file>
      </data>
    </table>

    <table id="MARQUE_table">
      <column id="MARQUE" type="string"/>
      <column id="name" type="string"/>
      <data>
        <file format="csv" encoding="utf-8">MARQUE.csv</file>
      </data>
    </table>

    <table id="TYPE_CLIENT_table">
      <column id="TYPE_CLIENT" type="string"/>
      <column id="name" type="string"/>
      <data>
        <file format="csv" encoding="utf-8">TYPE_CLIENT.csv</file>
      </data>
    </table>

    <table id="PDV_table">
    <column id="PDV_ID" type="string"/>
      <column id="POINT_DE_VENTE" type="string"/>
      <column id="country" type="string"/>
      <column id="MARQUE" type="string"/>
      <column id="latitude" type="float"/>
      <column id="longitude" type="float"/>
      <data>
        <file format="csv" encoding="utf-8">POINT_DE_VENTE.csv</file>
      </data>
    </table>

    <table id="PDVID_pop_2013_slice_table">
      <column id="PDV_ID" type="string"/>
      <column format="yyyy.MM" id="month" type="date"/>
      <column id="population" type="integer"/>
      <data>
        <file format="csv" encoding="utf-8">PDVID_pop_2013_slice.csv</file>
      </data>
    </table>


I don't see what's wrong with the format of time dimension. I did the same as some given examples of the tutorial which are supposed to work fine ( like European Unemployment from Eurostat to be more precise).


thank you very much for your help.

Kim

Kim Pellegrin

unread,
Apr 10, 2014, 5:03:38 AM4/10/14
to public-d...@googlegroups.com
Thank you very much it works fine now!
Reply all
Reply to author
Forward
0 new messages