Possible bug in 4.6.1

4 posts / 0 new
Last post
Nameless
Nameless's picture
Offline
Last seen: 2 weeks 5 days ago
Joined: Feb 28 2014
Junior Boarder

Posts: 3

Tomas Uko
Possible bug in 4.6.1
Hi all,
 
I've got started InfiniDb cluster with two nodes (servers), setting is 1 UM and 2PMs (one on each machine). When we try to load data this is what happens (import is runned agains UM).
 
/usr/local/Calpont/bin/colxml stats -t detections -d '\t' -E '"' -l /opt/data/indata/detections_1405926001.dat -j 24
 
Running colxml with the following parameters:
2014-07-21 10:25:52 (771) INFO :
        Schema: stats
        Tables: detections
        Load Files: /opt/data/indata/detections_1405926001.dat
        -C      \
        -E      "
        -b      0
        -c      1048576
        -d      \t
        -e      10
        -j      24
        -n
        -p      /usr/local/Calpont/data/bulk/job
        -r      5
        -s
        -u
        -w      10485760
        -x      tbl
 
Creating job description file: /usr/local/Calpont/data/bulk/job/Job_24.xml
File completed for tables:
        stats.detections
 
Normal exit.
[...]# /usr/local/Calpont/bin/cpimport -j 24
2014-07-21 10:25:58 (813) INFO : Running distributed import (mode 1) on all PMs...
Segmentation fault (core dumped)
 
 
Logs (debug.log) on both machines follows:
Master:
Jul 21 10:25:58 *-002 writeengineserver[48479]: 58.072203 |0|0|0| D 32 CAL0000: 4963 : onReceiveMode() Setting fMode = 1
Jul 21 10:25:58 *-002 writeengineserver[48479]: 58.072308 |0|0|0| D 32 CAL0000: 4963 : onReceiveMode() DbRoot Count = 2
Jul 21 10:25:58 *-002 writeengineserver[48479]: 58.072544 |0|0|0| D 32 CAL0000: 4963 : CMD LINE ARGS came in /usr/local/Calpont/bin/cpimport.bin -j 24 -E " -C \ -R /tmp/infinidb_tmp_files/BrmRpt0621102558813.rpt -m 1 -P pm1-813 -p /tmp/infinidb_tmp_files -fSTDIN
Jul 21 10:25:58 *-002 writeengineserver[48479]: 58.072618 |0|0|0| D 32 CAL0000: 4963 : Brm Rpt Filename Arrived /tmp/infinidb_tmp_files/BrmRpt0621102558813.rpt
Jul 21 10:25:58 *-002 writeengineserver[48479]: 58.072655 |0|0|0| D 32 CAL0000: 4963 : Start Cpimport command reached!!
Jul 21 10:26:14 *-002 writeengineserver[48479]: 14.331910 |0|0|0| D 32 CAL0000: 4963 : Message Queue is empty; Stopping CF Thread
Jul 21 10:26:14 *-002 writeengineserver[48479]: 14.507616 |0|0|0| D 32 CAL0000: 4963 : onCpimportSuccess BrmReport Send
 
"Slave"
Jul 21 10:24:57 *-003 cpimport.bin[42032]: 57.785491 |0|0|0| I 34 CAL0086: Initiating BulkLoad: -j 24 -E " -C \ -R /tmp/infinidb_tmp_files/BrmRpt0621102457363.rpt -m 1 -P pm1-363 -p /tmp/infinidb_tmp_files -fSTDIN
Jul 21 10:24:58 *-003 cpimport.bin[42032]: 58.966966 |0|0|0| I 34 CAL0081: Start BulkLoad: JobId-24; db-stats
Jul 21 10:25:03 *-003 cpimport.bin[42032]: 03.154972 |0|0|0| I 34 CAL0083: BulkLoad: JobId-24; finished loading table stats.detections; 360000 rows inserted
Jul 21 10:25:03 *-003 writeengine[42032]: 03.155111 |0|0|0| I 19 CAL0008: Bulkload |Job: /tmp/infinidb_tmp_files/Job_24.xml |For table stats.detections: 360000 rows processed and 360000 rows inserted.
Jul 21 10:25:03 *-003 cpimport.bin[42032]: 03.168982 |0|0|0| I 34 CAL0082: End BulkLoad: JobId-24; status-SUCCESS
Jul 21 10:25:03 *-003 writeengineserver[41806]: 03.182606 |0|0|0| I 32 CAL0000: 5692 : cpimport exit on success
Jul 21 10:25:58 *-003 cpimport.bin[42194]: 58.227992 |0|0|0| I 34 CAL0086: Initiating BulkLoad: -j 24 -E " -C \ -R /tmp/infinidb_tmp_files/BrmRpt0621102558813.rpt -m 1 -P pm1-813 -p /tmp/infinidb_tmp_files -fSTDIN
Jul 21 10:25:59 *-003 cpimport.bin[42194]: 59.287771 |0|0|0| I 34 CAL0081: Start BulkLoad: JobId-24; db-stats
Jul 21 10:26:14 *-003 cpimport.bin[42194]: 14.516369 |0|0|0| I 34 CAL0083: BulkLoad: JobId-24; finished loading table stats.detections; 3890000 rows inserted
Jul 21 10:26:14 *-003 writeengine[42194]: 14.516512 |0|0|0| I 19 CAL0008: Bulkload |Job: /tmp/infinidb_tmp_files/Job_24.xml |For table stats.detections: 3890000 rows processed and 3890000 rows inserted.
Jul 21 10:26:14 *-003 cpimport.bin[42194]: 14.530917 |0|0|0| I 34 CAL0082: End BulkLoad: JobId-24; status-SUCCESS
Jul 21 10:26:14 *-003 writeengineserver[41806]: 14.544758 |0|0|0| I 32 CAL0000: 4963 : cpimport exit on success
Jul 21 10:26:14 *-003 writeengineserver[41806]: 14.545465 |0|0|0| I 32 CAL0000: 4963 : Broken Pipe : socket write failed
Jul 21 10:26:14 *-003 writeengineserver[41806]: 14.545624 |0|0|0| I 32 CAL0000: 4963 : Broken Pipe : socket write failed
 
 
When I try to import data into that table again I receive this:
[...]# /usr/local/Calpont/bin/cpimport -j 24
2014-07-21 10:26:51 (44856) ERR  : Failed to acquire Table Lock of stats.detections;  Table currently locked by process-cpimport; pid-42765; session--1; txn--1
 
After that only thing, that works is shut system down via command console.
 
Can anyone help me with this problem?
Thanks 
Tomas
radams
radams's picture
Offline
Last seen: 2 days 11 hours ago
Joined: Jan 3 2011
Administrator

Posts: 492

Robert Adams
Possible bug in 4.6.1

 

We tested this on the 4.6.0-1 release and everything seems to work. Not sure why you are seeing an issue.

 

# /usr/local/Calpont/bin/colxml stats -t detections -d '\t' -E '"' -l /opt/data/indata/detections_1405926001.dat -j 24
 
Running colxml with the following parameters:
2014-07-23 16:53:29 (22412) INFO : 
        Schema: stats
        Tables: detections  
        Load Files: /opt/data/indata/detections_1405926001.dat  
        -C      \
        -E      "
        -b      0
        -c      1048576
        -d      \t
        -e      10
        -j      24
        -n      
        -p      /usr/local/Calpont/data/bulk/job
        -r      5
        -s      
        -u      
        -w      10485760
        -x      tbl
 
Creating job description file: /usr/local/Calpont/data/bulk/job/Job_24.xml
File completed for tables:
        stats.detections
        
Normal exit.
# /usr/local/Calpont/bin/cpimport -j 24                                                                                                                      
2014-07-23 16:53:33 (22462) INFO : Running distributed import (mode 1) on all PMs...
2014-07-23 16:53:34 (22462) INFO : For table stats.detections: 64 rows processed and 64 rows inserted.
2014-07-23 16:53:34 (22462) INFO : Bulk load completed, total run time : 1.42636 seconds
Nameless
Nameless's picture
Offline
Last seen: 2 weeks 5 days ago
Joined: Feb 28 2014
Junior Boarder

Posts: 3

Tomas Uko
Have you tried run cpimport

Have you tried run cpimport one after another, I mean start next bulk load into same table right after prevous bulk load finished? With each import loading about 4mil of rows?

Nameless
Nameless's picture
Offline
Last seen: 2 weeks 5 days ago
Joined: Feb 28 2014
Junior Boarder

Posts: 3

Tomas Uko
Have you tried it?

Have you tried, what I wrote?