--- Log opened ma sep 08 00:00:05 2014 00:11 -!- adahms is now known as adahms-brb 00:11 -!- adahms-brb is now known as adahms-mtg 00:56 -!- adahms-mtg is now known as adahms 01:52 -!- bala (purple): has joined #vdsm 02:01 -!- adahms: has quit [Ping timeout: 245 seconds] 02:20 -!- #vdsm dougsland: has quit [Ping timeout: 264 seconds] 02:20 -!- adahms (Andrew Dahms): has joined #vdsm 02:21 -!- dougsland (Douglas): has joined #vdsm 02:25 -!- dougsland: has quit [Ping timeout: 240 seconds] 02:34 -!- nsoffer: has quit [Ping timeout: 260 seconds] 02:44 -!- dougsland (Douglas): has joined #vdsm 02:57 -!- bala: has quit [Quit: Leaving.] 03:32 -!- #vdsm dougsland: has quit [Ping timeout: 252 seconds] 03:34 -!- dougsland (Douglas): has joined #vdsm 03:50 -!- nsoffer (Nir Soffer): has joined #vdsm 04:10 -!- nsoffer: has quit [Read error: Connection reset by peer] 04:44 -!- adahms: has quit [Ping timeout: 268 seconds] 04:56 -!- adahms (Andrew Dahms): has joined #vdsm 06:05 -!- adahms: has quit [Quit: Leaving] 06:16 -!- vered (Vered Volansky): has joined #vdsm 06:52 -!- bala (purple): has joined #vdsm 07:48 -!- hchiramm (Humble Chirammal): has joined #vdsm 07:49 -!- sbonazzo (purple): has joined #vdsm 07:58 -!- adahms (Andrew Dahms): has joined #vdsm 08:06 -!- mskrivanek_away is now known as mskrivanek 08:08 -!- aravindavk (Aravinda): has joined #vdsm 08:17 -!- evilissimo (Vinzenz Feenstra): has joined #vdsm 08:17 -!- fromani (Francesco Romani): has joined #vdsm 08:22 -!- fabiand (Fabian Deutsch): has joined #vdsm 08:32 -!- pkliczew (Piotr Kliczewski): has joined #vdsm 08:35 -!- ishaby (Idan Shaby): has joined #vdsm 08:50 -!- phoracek (phoracek): has joined #vdsm 08:51 -!- saggi (purple): has joined #vdsm 09:08 -!- bazulay: has quit [Quit: Leaving.] 09:11 -!- xaviern (Xavier): has joined #vdsm 09:13 -!- acanan (Aharon Canan): has joined #vdsm 09:33 -!- fsimonce (Federico): has joined #vdsm 09:49 -!- derez_ (Daniel Erez): has joined #vdsm 10:01 -!- Daviey: has quit [Ping timeout: 264 seconds] 10:01 -!- apuimedo (antoni): has joined #vdsm 10:13 < fromani> apuimedo: good morning. I've got news for you. Please take a look at http://fpaste.org/131703/ at entries for Mon Sep 8 08:24:28 CEST 2014 and Mon Sep 8 08:29:29 CEST 2014 10:13 * apuimedo looks 10:14 < fromani> there is an usage jump of ~60 megs that I cannot really explain 10:14 < fromani> and this is compatible with your findings 10:14 < fromani> apuimedo: ^^^ 10:15 -!- bazulay (purple): has joined #vdsm 10:15 < apuimedo> fromani: :( 10:16 -!- Daviey (Dave Walker): has joined #vdsm 10:19 < apuimedo> fromani: I'm still in the order of 300Mb gone every day 10:21 -!- mskrivanek is now known as mskrivanek_away 10:22 -!- mskrivanek_away is now known as mskrivanek 10:38 < fromani> apuimedo: every day? Here I lost ~75 megs in 36 hours 10:39 * fromani is even more puzzled 10:41 < apuimedo> fromani: every day 10:42 < fromani> dowser profiles are flat 10:42 < apuimedo> fromani: It could be that your leak being a one-off thing and mine being due to ssl 10:42 < fromani> apuimedo: exactly 10:42 < apuimedo> s/being/are/ 10:42 < fromani> apuimedo: I'm trying to gather all the clues we found 10:42 < fromani> apuimedo: but the picture is still quite muddy 10:43 < apuimedo> I really think that the easiest thing to check is try to remove the m2crypto 10:43 < apuimedo> and give it a go 10:43 < apuimedo> and then decide 10:45 -!- ykaplan (Yeela Kaplan): has joined #vdsm 10:49 < fromani> apuimedo: even though I like this idea, I'm not really comfortable with the shotgun debugging, but I'm also running out of ideas :( 10:52 -!- bazulay: has quit [Ping timeout: 240 seconds] 10:57 -!- nsoffer (Nir Soffer): has joined #vdsm 11:08 -!- bazulay (purple): has joined #vdsm 11:14 -!- bazulay: has quit [Ping timeout: 246 seconds] 11:28 -!- mode/#vdsm: by ChanServ 11:28 -!- danken (purple): has joined #vdsm 11:28 -!- bazulay (purple): has joined #vdsm 11:34 -!- winfr34k (Thomas): has joined #vdsm 11:34 < winfr34k> Hello~ 11:35 < winfr34k> are there any known problems with the snapshots tree that involve hosted-engine deploy? If so, are there any fixes for it? I'm on CentOS 7 right now. Worked one week ago flawlessly, no just stops. 11:35 < winfr34k>: Failed to execute stage 'Environment setup': [Errno 2] No such file or directory: '/etc/pki/vdsm/certs/cacert.pem' 11:37 < apuimedo> winfr34k: maybe sbonazzo will know 11:37 -!- mpolednik (Martin Polednik): has joined #vdsm 11:37 < winfr34k> ok? 11:37 < winfr34k> well then, @sbonazzo, any idea? 11:38 < sbonazzo> winfr34k: I'm still investigating on that. You can workaround it with "vdsm-tool configure --force" but you'll loose TLS config for libvirt so you can't use virsh to connect to the VM 11:38 < sbonazzo> winfr34k: but if you use spice or vnc it's enough 11:39 < winfr34k> omg, if this works, you are my lifesaver! 11:40 < winfr34k> thanks, at least I got further this time :D 11:44 < apuimedo> danken: I split out the topology iteration in a separate patch with tests ;-) 11:45 < apuimedo> danken: http://gerrit.ovirt.org/32606 11:47 <@danken> apuimedo: thanks 11:48 <@danken> apuimedo: could you also take a look at the ethtool_opts issue? it's a 3.5 thingy that I'd like to get done 11:48 < apuimedo> danken: I'm on it 11:49 <@danken> apuimedo: hmm, you're keeping the surprising "for dev in device" iterator. I'll need to get used to it. 11:50 < apuimedo> danken: I tried to explain in the commit message that this is a simple tree-like data structure iteration 11:50 < sbonazzo> winfr34k: let us know :-) 11:51 < apuimedo> and thus it seems legitimate to me that for dev in device shows all the devs 11:51 < apuimedo> (I agree that looking at devices like vlans that can only have a child makes __iter__ look funny 11:51 < apuimedo> ) 11:55 < winfr34k> sbonazzo: Thanks, I certainly will :P 11:55 <@danken> apuimedo: my problem with that is also of design. Unlike a dictionary, which is only a container of key-value pairs, a NetDevice has identity and devices that it depends on. 11:55 <@danken> the dependent devices are not "part of" the top level one 11:57 < apuimedo> danken: to me it's like going through a linked list getting a pointer to a node and just followin gthe 'next' pointer 11:57 < apuimedo> or taking a tree node as a subtree 12:02 <@danken> apuimedo: but in a general-purpose container, such as list or tree, you can easily say that the head or root *are* the underlying structure. It seems much more awkward for a specific thing like our NetDevice 12:03 < apuimedo> danken: well, that could be fixed by renamin NetDevice to NetTopologyNode :P 12:08 < mpolednik> danken: any chance on quick glance at http://gerrit.ovirt.org/#/c/29265/3 ? need to get vmtests import through in order to be able to create patches for other devices without lots of rebasing later :) 12:10 <@danken> apuimedo: I was thinking about an explicit name for the iterator, something like NetDevice.underlying_devices() 12:10 <@danken> mpolednik: I'd like to get fromani's ack for this one first. 12:12 < mpolednik> danken: http://gerrit.ovirt.org/#/c/29263/ this one would do? 12:12 < fromani> mpolednik: I have a couple of comments about the commet message (see comment to patchset 1), patch itself looks OK 12:15 < apuimedo> danken: I saw the suggestion, but it is so nice to just implement the iteration protocol :P 12:17 <@danken> mpolednik: taken 12:18 <@danken> apuimedo: yeah, I understand that urge 12:18 < apuimedo> ;-) 12:18 < mpolednik> danken: awesome, thanks 12:18 < apuimedo> I'll push the ethtool shortly 12:26 < winfr34k> sbonazzo: I got through the install, though this error appeared on the created VM: http://pastebin.com/hGHXVYUd 12:34 -!- bala: has quit [Ping timeout: 264 seconds] 12:40 < sbonazzo> winfr34k: never seen such error, looks like your exportfs command has some missing function 12:41 < winfr34k> sbonazzo: Well, that isn't good :/ I just used the hosted-engine --deploy command! 12:43 < sbonazzo> winfr34k: well hosted-engine --deploy is a tool that rely on other tools doing their work properly :-) 12:43 < sbonazzo> winfr34k: which distribution are you running? 12:44 < sbonazzo> winfr34k: (the pastebin is from engine-setup BTW) 12:46 < winfr34k> oops :D 12:46 < winfr34k> sbonazzo: I am running Fedora 20 on the engine, but CentOS 7 on all of my nodes. 12:46 < winfr34k> sbonazzo: BTW, I just tried to set up another node, but this happened: http://pastebin.com/KU37J2dP 12:48 < sbonazzo> winfr34k: maybe we should move this discussion to #ovirt channel 12:48 < winfr34k> we can. 12:48 < winfr34k> brb in the ovirt channel on the other server 12:48 < sbonazzo> winfr34k: ok 12:49 < sbonazzo> winfr34k: btw, xxx-vmengine1.xxx.xxx.edu looks like the engine vm, you need to pass first host fqdn :-) 12:50 -!- winfr34k: has quit [Quit: Verlassend] 12:56 -!- ykaplan: has quit [Ping timeout: 276 seconds] 13:09 -!- vered: has quit [Ping timeout: 252 seconds] 13:25 < apuimedo> danken: wildcard '*' pushed 13:39 -!- fromani_ (Francesco Romani): has joined #vdsm 13:40 -!- mpolednik: has quit [Ping timeout: 272 seconds] 13:41 -!- fromani: has quit [Ping timeout: 272 seconds] 13:45 -!- vered (Vered Volansky): has joined #vdsm 13:54 -!- fromani_: has quit [Quit: Leaving] 13:54 -!- fromani (Francesco Romani): has joined #vdsm 14:09 -!- ykaplan (Yeela Kaplan): has joined #vdsm 14:09 < apuimedo> fromani: I'm getting build errors in el6.5 14:10 < apuimedo> error: line 317: Unknown tag: else 14:10 < apuimedo> make: ***: Error 1 14:10 < apuimedo> when doing make rpm 14:10 < apuimedo> do you know anything about that? 14:10 < fromani> nope but I don't build on RHEL6.5 since a couple of weeks 14:10 < fromani> I can retry within the hour, however 14:11 < apuimedo> fromani: ok, thanks 14:11 < fromani> apuimedo: sure, I'll let you know. Is that the master tip? 14:11 -!- mpolednik (Martin Polednik): has joined #vdsm 14:12 < apuimedo> I just rebased my ethtool dependency removal on top of master 14:12 < apuimedo> let me check I didn't screw up the vdsm.spec.in 14:12 < apuimedo> :P 14:13 < apuimedo> ah, yes 14:13 < apuimedo> I missed a '%' 14:13 < apuimedo> my bad, fromani: sorry 14:14 < fromani> apuimedo: np :) 14:19 -!- mbetak (purple): has joined #vdsm 14:36 -!- ishaby: has quit [Quit: Leaving] 14:36 -!- ishaby (Idan Shaby): has joined #vdsm 14:54 -!- gpadgett (Greg Padgett): has joined #vdsm 15:05 -!- derez_: has quit [Ping timeout: 245 seconds] 15:06 -!- acanan: has quit [Ping timeout: 245 seconds] 15:14 -!- derez_ (Daniel Erez): has joined #vdsm 15:22 -!- aravindavk: has quit [Ping timeout: 268 seconds] 15:37 -!- adahms: has quit [Quit: Leaving] 15:39 < apuimedo> danken: ethtool patches (except the hook) verified 15:43 -!- ybronhei (purple): has joined #vdsm 15:44 <@danken> apuimedo: thanks; I have some comment regarding the hook 15:45 < apuimedo> oh 15:45 < apuimedo> go ahead 15:45 <@danken> apuimedo: could you state HOW was the verification done? I had an unhappy occasion where I did not ask this question, but should have. 15:45 < apuimedo> danken: the hook I didn't tick as verified, did I? 15:45 < apuimedo> cause I'm verifying it now 15:46 -!- fpliger (fpliger): has joined #vdsm 15:47 <@danken> apuimedo: you have not 15:47 < apuimedo> ah, ok 15:47 <@danken> apuimedo: oh, I see that the tip of the branch you have 15:47 <@danken> Verified the branch up until this patch by running the unit tests, the functional tests and checking that the sampling keeps working. 15:47 <@danken> understood. 16:03 -!- ykaplan: has quit [Remote host closed the connection] 16:10 -!- Daviey: has quit [Ping timeout: 252 seconds] 16:14 -!- Daviey (Dave Walker): has joined #vdsm 16:20 -!- fpliger_ (fpliger): has joined #vdsm 16:23 -!- mpolednik: has quit [Ping timeout: 240 seconds] 16:24 -!- fpliger: has quit [Ping timeout: 246 seconds] 16:28 -!- #vdsm xaviern: has quit [Ping timeout: 264 seconds] 16:35 -!- aravindavk (Aravinda): has joined #vdsm 16:41 -!- ishaby: has quit [Ping timeout: 255 seconds] 16:44 -!- phoracek: has quit [Quit: WeeChat 0.4.3] 16:54 -!- xaviern (Xavier): has joined #vdsm 17:01 -!- mskrivanek is now known as mskrivanek_away 17:08 <@danken> apuimedo: could you glimpse at https://bugzilla.redhat.com/show_bug.cgi?id=1138731 firewall-cmd gets stuck when run as superVDSM subsubprocess 17:08 <@danken> can you think of the selinux there? 17:08 <@danken> could you ping someone from selinux regarding this? 17:09 < apuimedo> danken: what do you mean with 'can you think of the selinux there?' 17:09 < apuimedo> ? 17:09 -!- mbetak1 (purple): has joined #vdsm 17:09 < apuimedo> I told ondra that he would likely have to visit mr grepl 17:09 -!- mbetak1: has quit [Client Quit] 17:09 < apuimedo> you want me to push them a bit? 17:11 -!- pkliczew: has quit [Ping timeout: 264 seconds] 17:12 -!- bazulay: has quit [Ping timeout: 276 seconds] 17:12 -!- mbetak: has quit [Ping timeout: 268 seconds] 17:13 -!- ishaby (Idan Shaby): has joined #vdsm 17:14 < nsoffer> danken, I think you can take http://gerrit.ovirt.org/32406 - the added comment seems safe 17:17 <@danken> nsoffer: thanks. 17:18 <@danken> nsoffer: could you also review/ack fromani profiling patch? 17:18 -!- #vdsm xaviern: has quit [Ping timeout: 245 seconds] 17:18 < nsoffer> danken, not today 17:18 < fromani> danken: nsoffer already added comments, I still need to update my patch accordingly 17:18 <@danken> http://gerrit.ovirt.org/#/c/32019/ 17:18 <@danken> ah 17:18 -!- ishaby: has quit [Ping timeout: 255 seconds] 17:18 < fromani> danken: I'll do shortly enough 17:19 < fromani> do we still want to have dowser inside or do we want to leverage manhole? 17:19 < nsoffer> fromani, any news on cpu profiling 100 vms with latest yappi? 17:19 < fromani> nsoffer: no, because I've joined forces with apuimedo to tackle a nasty memleak he found 17:20 < fromani> nsoffer: that test is queued 17:20 < nsoffer> fromani, ok 17:20 -!- vered: has quit [Ping timeout: 252 seconds] 17:27 -!- sbonazzo: has quit [Quit: Leaving.] 17:32 -!- pkliczew (Piotr Kliczewski): has joined #vdsm 17:32 -!- derez_: has quit [Ping timeout: 255 seconds] 17:44 -!- derez_ (Daniel Erez): has joined #vdsm 17:48 -!- fpliger_: has quit [Remote host closed the connection] 17:49 -!- fpliger (fpliger): has joined #vdsm 17:54 -!- fpliger: has quit [Ping timeout: 276 seconds] 18:09 -!- pkliczew: has quit [Ping timeout: 240 seconds] 18:10 -!- derez_: has quit [Ping timeout: 240 seconds] 18:17 -!- fromani: has quit [Quit: Leaving] 18:19 -!- fpliger (fpliger): has joined #vdsm 18:21 -!- fpliger_ (fpliger): has joined #vdsm 18:21 -!- fpliger: has quit [Read error: Connection reset by peer] 18:33 < apuimedo> danken: ethtool_opts verified 18:33 < apuimedo> note that if you pass an option that is already set 18:33 < apuimedo> ethtool fails 18:33 <@danken> apuimedo: I meant "can you think of the selinux *failure* there", what could be dropping out dbus messages 18:34 < apuimedo> and the rest of subcommands are not executed 18:34 < apuimedo> danken: mgrepl thinks it's wrong labelling 18:35 -!- apuimedo is now known as apuimedo|dinner 18:35 <@danken> apuimedo: yuckie ethtool behaviour. is this the case with initscripts, too? 18:35 <@danken> apuimedo|dinner: bon apetite 18:36 <@danken> apuimedo|dinner: and when you're back, I'd love to hear your comment of my http://gerrit.ovirt.org/#/c/32508/3..4/vdsm_hooks/ethtool_options/ethtool_options.py 18:45 < ybronhei> danken: nsoffer: please revisit http://gerrit.ovirt.org/#/c/31561/ 18:51 -!- mbetak (purple): has joined #vdsm 18:53 < apuimedo|dinner> I'm back 18:53 -!- apuimedo|dinner is now known as apuimedo 19:05 < apuimedo> danken: I'm not sure about it 19:05 < apuimedo> why do you assign: to nics? 19:05 < apuimedo> nics is suuposed to be what we retrieve from netinfo 19:06 < apuimedo> sorry 19:06 < apuimedo> netattrs 19:23 < alitke__> Hi guys. I am getting an InquireNotSupported exception anytime getStoragePoolInfo is called on a new 3.5 host. Any ideas what I can do to fix it? 19:26 < apuimedo> alitke__: maybe nsoffer 19:27 < alitke__> apuimedo, Thanks, nsoffer any ideas? 19:28 -!- phoracek (phoracek): has joined #vdsm 19:28 < nsoffer> alitke__, what version is the storage domain? 19:28 < nsoffer> alitke__, is it the latest? v3? 19:29 < alitke__> nsoffer, interesting... It works if I switch back to the rfh oop_impl 19:29 < alitke__> yes 19:29 < alitke__> I have two things I keep doing to get a stable host: 1) disable jsonrpc, 2) use rfh 19:29 < nsoffer> :-) 19:30 < nsoffer> alitke__, any bug in jsonrpc that effect your workflow? 19:30 < alitke__> I keep getting unexpected EOF messages in the vdsm log and engine cannot bring a host up 19:31 < nsoffer> ok, I remember one of the EOF tracebacks that you posted here 19:31 < nsoffer> I think this causes the detector thread to exit, so you cannot make any new connection 19:32 < nsoffer> alitke__, but it should be the same with xmlrpc 19:32 < nsoffer> alitke__, did you open a bug for this? 19:32 < alitke__> no. I don't really have any good info about it 19:32 < alitke__> And I'm swatting away so many bugs trying to fix my own bug :) 19:33 < nsoffer> alitke__, it was unhandled SSLError I think 19:33 < alitke__> Like being caught in a cloud of gnats 19:33 < nsoffer> he, you have only one :-) 19:33 < nsoffer> about the oop one - can you create a clean vdsm log that show how it breaks? 19:34 < nsoffer> and open a bug, or paste it somewhere? 19:34 < nsoffer> alitke__, ^^ 19:34 < alitke__> sure. Let me try that 19:39 < alitke__> nsoffer, Now I can't reproduce it 19:39 < alitke__> Seems after it connected the first time now it can connect regardless of the oop_impl setting 19:40 < alitke__> If I see it again I'll try to capture the lgo 19:42 < apuimedo> bye! 19:42 -!- apuimedo: has quit [Quit: WeeChat 0.4.3] 19:48 < nsoffer> alitke__, do you still have the logs from the last failure? 19:48 < alitke__> no 19:58 -!- nsoffer: has quit [Read error: Connection reset by peer] 19:59 -!- nsoffer (Nir Soffer): has joined #vdsm 20:01 < nsoffer> alitke__, so we cannot do anything about it now 20:02 < alitke__> not at the moment, no. 20:09 -!- fpliger_: has quit [Remote host closed the connection] 20:09 -!- fpliger (fpliger): has joined #vdsm 20:10 -!- fabiand: has quit [Quit: Verlassend] 20:14 -!- fpliger: has quit [Ping timeout: 264 seconds] 20:25 -!- bala (purple): has joined #vdsm 20:30 -!- mbetak: has quit [Quit: Leaving.] 20:38 -!- bala: has quit [Quit: Leaving.] 20:44 -!- aravindavk: has quit [Quit: Leaving] 20:51 -!- pkliczew (Piotr Kliczewski): has joined #vdsm 20:58 -!- pkliczew: has quit [Ping timeout: 260 seconds] 21:10 -!- fpliger (fpliger): has joined #vdsm 21:14 -!- fpliger: has quit [Ping timeout: 255 seconds] 21:15 -!- bazulay (purple): has joined #vdsm 21:20 -!- fpliger (fpliger): has joined #vdsm 21:24 -!- fpliger: has quit [Ping timeout: 240 seconds] 21:57 -!- fpliger (fpliger): has joined #vdsm 22:08 -!- fpliger: has quit [Remote host closed the connection] 22:08 -!- fpliger (fpliger): has joined #vdsm 22:09 -!- mbetak (purple): has joined #vdsm 22:13 -!- fpliger: has quit [Ping timeout: 246 seconds] 22:30 -!- nsoffer: has quit [Ping timeout: 255 seconds] 22:35 -!- nsoffer (Nir Soffer): has joined #vdsm 22:35 -!- phoracek: has quit [Ping timeout: 260 seconds] 22:35 -!- jbrooks_ is now known as jbrooks 22:37 -!- ishaby (Idan Shaby): has joined #vdsm 22:53 -!- fpliger (fpliger): has joined #vdsm 22:53 -!- #vdsm ybronhei: has quit [Quit: Leaving.] 23:01 -!- phoracek (phoracek): has joined #vdsm 23:08 -!- ishaby: has quit [Ping timeout: 260 seconds] 23:11 -!- phoracek: has quit [Quit: WeeChat 0.4.3] 23:23 -!- adahms (Andrew Dahms): has joined #vdsm 23:28 -!- saggi: has quit [Quit: Leaving.] 23:42 -!- gpadgett: has quit [Ping timeout: 276 seconds] 23:57 -!- adahms: has quit [Ping timeout: 264 seconds] 23:59 -!- danken: has quit [Quit: Leaving.] --- Log closed di sep 09 00:00:07 2014