#pulp-meeting log

15:30:39 <fao89> #startmeeting Pulp Triage 2021-01-19
15:30:39 <fao89> #info fao89 has joined triage
15:30:39 <fao89> !start
15:30:39 <pulpbot> Meeting started Tue Jan 19 15:30:39 2021 UTC.  The chair is fao89. Information about MeetBot at http://wiki.debian.org/MeetBot.
15:30:39 <pulpbot> Useful Commands: #action #agreed #help #info #idea #link #topic.
15:30:39 <pulpbot> The meeting name has been set to 'pulp_triage_2021-01-19'
15:30:39 <pulpbot> fao89: fao89 has joined triage
15:31:42 <daviddavis> #info daviddavis has joined triage
15:31:42 <daviddavis> !here
15:31:42 <pulpbot> daviddavis: daviddavis has joined triage
15:32:30 <ggainey> #info ggainey has joined triage
15:32:30 <ggainey> !here
15:32:30 <pulpbot> ggainey: ggainey has joined triage
15:32:36 <ppicka> !here
15:32:36 <ppicka> #info ppicka has joined triage
15:32:37 <pulpbot> ppicka: ppicka has joined triage
15:33:04 <ttereshc> #info ttereshc has joined triage
15:33:04 <ttereshc> !here
15:33:04 <pulpbot> ttereshc: ttereshc has joined triage
15:33:08 * daviddavis shouts across the post apocalyptic wastelands, "QUORUM!"
15:33:30 <ttereshc> :)
15:33:34 <ggainey> heh
15:33:47 <fao89> topic: How specific can we be in Access Denied type of errors not to give away too much information but also to make it less painful for users and devs to debug permission problems?
15:33:55 <fao89> https://paste.centos.org/view/df93d584
15:34:08 <ttereshc> it was me
15:34:20 <bmbouter> #info bmbouter has joined triage
15:34:20 <bmbouter> !here
15:34:20 <pulpbot> bmbouter: bmbouter has joined triage
15:34:25 <ttereshc> I was very frustrated with debugging and I wonder how it would be for users
15:34:39 <ttereshc> to figure out what they configured wrongly
15:35:28 <ggainey> ttereshc: detailed info into the log is prob good. From the "outside", anything other than "you do not have permissions" is prob to be avoided
15:35:36 <bmbouter> the issue is that the log isn't for the user
15:35:42 <ggainey> because you don't want to give info to an attacker
15:36:02 <ggainey> bmbouter: but a 'user' can't change perms - an admin has to do that
15:36:08 <ttereshc> what attacker can do if I say you don't have access to a requested repository?
15:36:21 <bmbouter> users can over time certainly
15:36:27 <ttereshc> without saying its name
15:36:47 <bmbouter> I think the application should raise a fatal exception that has a helpful error message
15:36:51 <ttereshc> at least I know which perms got screwed up
15:37:12 <ggainey> ah ok, that sounds reasonable
15:37:53 <ttereshc> ok, maybe I can file an issue for that
15:38:07 <ttereshc> bmbouter, could you define what helpful message means
15:38:13 <ttereshc> how helpful :)
15:39:02 <dkliban> #info dkliban has joined triage
15:39:02 <dkliban> !here
15:39:02 <pulpbot> dkliban: dkliban has joined triage
15:39:43 <bmbouter> well that's the trick of it
15:40:20 <bmbouter> right now in pulp's history I think helping users figure problems out > concerns for data leakage
15:40:43 <bmbouter> over time as the setup gets easier that concern will reverse
15:40:49 <ggainey> strongly disagree - because leaking-info never changes, until *after* it causes a security problem
15:40:52 <gerrod> #info gerrod has joined triage
15:40:52 <gerrod> !here
15:40:52 <pulpbot> gerrod: gerrod has joined triage
15:41:00 <ggainey> and then it's too late
15:41:05 <bmbouter> to be clear though it's not a security problem
15:41:09 <bmbouter> it's a data leakage problem
15:41:22 <ggainey> I don't see a difference there
15:41:25 <bmbouter> which is a problem but it's not with that info the user can sidestep authorization
15:41:47 <ggainey> "learning things you're not supposed to know" *is* a security problem
15:41:47 <bmbouter> ggainey: I get where you're coming from it's my default also
15:42:19 <bmbouter> but when asked my opinion on where to draw the line on that error message this is my answer (maybe not the right answer)
15:42:46 <ggainey> sure - which is why we're havin the discussion, is to decide where the line gets drawn in pulp :)
15:44:10 <ttereshc> we can have a setting which allows to produce error messages with some details and user can consciously turn it on knowing that some info might be revealed. Not passwords but more details than the person focused on security would consider safe.
15:44:37 <bmbouter> that would resolve our difference in opinion but I worry that's too complicated
15:44:44 <dkliban> sounds complicateed
15:44:45 <bmbouter> I'd rather agree with grant and let the user just figure it out
15:44:54 <ttereshc> I don't really like this idea  exactly for that reason
15:45:01 <ttereshc> yeah
15:45:18 <ggainey> yeah, feels like asking for edge-cases, alas
15:45:21 <ttereshc> let ggainey do all the rbac and maybe he'll change his mind :P
15:45:35 <ggainey> nope - ggainey did security for a living for too many years
15:45:39 <bmbouter> I maintain that knowing the name of a repo is not allowing a user to be authorized to operate any more than they could otherwise
15:45:53 <bmbouter> and it is practically very helpful to the user
15:46:01 <ggainey> the REST call replies "You're not allowed to do that", and the server-logs have details as to why
15:46:27 <bmbouter> we're going in a circle I think we established already that the logs are not helpful to users
15:46:29 <ggainey> we have absolutely had actual-customer-cases where "leaking repo names" was a sev-1 for the customer
15:46:51 <dkliban> i was of ggainey's opinion yesterday, but after discussing ttereshc's experience yesterday, i have decided that revealing the name is fine.
15:47:23 <bmbouter> ggainey's opinion is also my default
15:47:32 <ttereshc> is it ok to have this info in server logs at least? it can be the first step
15:47:52 <ggainey> ttereshc: yes - because server-logs don't show anything that someone with server-access couldn't find anyway
15:48:31 <ggainey> as long as we keep 'secrets' (keys and pwds and such) out of logs, we can put whatever detrail we want there
15:48:40 <ggainey> detail, even
15:49:04 <ttereshc> ok, so i'll file an issue to have all details in server logs and the type of resource for a user message. Is it ok with everyone?
15:49:13 <ggainey> sounds like a fine idea to me
15:49:17 <daviddavis> +1
15:49:18 <bmbouter> I don't think it's actually helping
15:49:31 <bmbouter> sorry because I know we want to move on but the user does not have log access
15:49:50 <ggainey> then they shouldn't have detailed-info-access on poermission-failures either
15:50:03 <bmbouter> I'm ok with that
15:50:25 <ttereshc> bmbouter, I think that user performs operation a limited set of resources and if you point which resource is the problematic one, it's already helpful
15:50:27 <ggainey> I'm not. We can do something different, but I'm not going to agree it's correct or appropriate.
15:50:45 <ttereshc> operation on a limited set of resources
15:51:12 <ttereshc> e.g. I'm doing podman push, and it involves creating 3 resources
15:51:28 <ttereshc> if I'm told which one, it's easier for me to figure out what's up
15:51:32 <bmbouter> to me the choices are a) say more in the response the uesr can read or b) do not, there isn't a c) put things into the logs
15:51:44 <fao89> next topic?
15:52:18 <ttereshc> bmbouter,  the say more part for a user is the resource type
15:52:24 <ttereshc> imo
15:52:27 <dkliban> ggainey: in ttereshc's specific use case she already knows the names of things
15:52:31 <ttereshc> not ideal but a step forward
15:52:39 <bmbouter> ttereshc: oh I see what you're saying, I'm good with that
15:52:52 <dkliban> she just wanted to know which of the resources she is lacking permissions to access
15:52:55 <bmbouter> I'm basically ok with all of these options as long as the info is going back to the user not in the logs
15:53:33 <ttereshc> they are probably unrelated. Logs will help with development/debugging.
15:53:41 <ttereshc> just not to do one instead of the other
15:54:04 <ipanova> ttereshc: dkliban FYI dockerhub does not say much, it just retuns 'insufficient permissions'
15:54:39 <ttereshc> ok, I think we can move on and I'll file an issue where we can figure out more details.
15:54:41 <ttereshc> ty ipanova
15:54:57 <ttereshc> thanks for the discussion
15:54:59 <ggainey> sure - better to have discussion in an issue, so it's available for the future!
15:56:21 <dkliban> i think we need to move on
15:56:27 <ggainey> concur
15:56:38 <fao89> +1
15:57:05 <fao89> topic: Spam script removing actual users
15:57:11 <ggainey> ouch
15:57:24 <daviddavis> this came up during our pulpcore meeting
15:57:24 <dkliban> this is problematic
15:57:35 <ggainey> yupyup
15:57:39 <fao89> disable the script?
15:57:51 <dkliban> i am wondering what kind of links they are posting
15:58:00 <dkliban> that is causing them to be flagged
15:58:11 <dkliban> dalley: do you know?
15:58:16 <dalley> it's not a matter of flagged, it's new users not going through the entire process
15:58:34 <dkliban> what do you mean?
15:58:35 <dalley> also there's manual effort on our end to verify them and add them to the whitelist
15:59:00 <dkliban> dalley: the script is supposed to look for issues/comments that contain links
15:59:10 <dkliban> and we have a white list of domains where users are allowed to link to
15:59:32 <ttereshc> so the script does its job right then, it seems that the new users do not get into this whitelist
16:00:11 <ttereshc> if I understood dalley correctly
16:00:16 <dalley> the one example I specifically remember was one of AdamW's posts that ended up deleted permanently, so I don't know where the link was to
16:00:31 <dkliban> does every new user need to be added to the whitelist?
16:00:41 <dalley> I just remember he emailed us after his account got locked and I had to unlock it
16:01:04 <dalley> and AdamW is one of our most active and helpful users
16:02:05 <dalley> but the post is gone
16:03:01 <dalley> dkliban, actually, to be honest, we've gone through a few revisions of the script, I've lost track because I was never really involved with it apart from being on the mailing list
16:03:10 <dalley> I don't know what it does right at this moment
16:03:20 <ggainey> yeah same
16:03:34 <bmbouter> same here
16:04:14 <dkliban> my understanding is that when a user posts a link to a domain we have not whitelisted, their account gets flagged and they must send us an email.
16:04:29 <dkliban> once that email is received the user needs to be added to a whitelist
16:04:42 <daviddavis> I know at least one user that didn't bother to do that and their content got removed
16:04:43 <dkliban> ideally, that domain would also get added to a whitelist
16:04:45 <daviddavis> fwiw
16:05:13 <dkliban> I am nto saying that tehre is no problem, but i just want to make sure we are all on the same page
16:05:25 <daviddavis> yes, it's appreciated
16:05:35 <ggainey> daviddavis: do we know the script did that? like, can we point to the script-log that says "I'm deleting Post X"?
16:05:41 <dkliban> and i agree that we don't want legitimate users getting deleted
16:06:08 <dkliban> ggainey: i am not sure how much is logged
16:06:17 <ggainey> have we seen a legit *user* get deleted? I thought we just locked users
16:06:27 <daviddavis> this was like months ago when I believe we were still deleting accounts before locking them
16:06:29 <daviddavis> now we lock them
16:06:29 <ggainey> dkliban: yeah same, it's been forever since I looked at a run
16:06:32 <dkliban> no, i misspoke
16:06:36 <ggainey> ah kk
16:07:24 <daviddavis> do we still get a lot of spam?
16:07:37 <ggainey> can we get an issue open, to teach the spam-filter to make issues 'private' rather than deleting them? would that accomplish our spam-goals, while still letting us recover in the case of an error?
16:08:07 <ggainey> daviddavis: well, we've had ppl complaining about it recently - not sure how much we get, but it does still happen
16:08:13 <daviddavis> +1
16:08:59 <daviddavis> I agree with the issue and making stuff private instead of deleting it. I guess I wonder if we should also disable the spam script until that issue gets implemented?
16:09:27 <fao89> we still have 2 items for open floor + triage, and we only have 20 minutes more
16:09:51 <daviddavis> let's think bout this topic and discuss more on friday?
16:09:54 <dkliban> ggainey: daviddavis: i'll open an issue to make things private instead of delete
16:09:59 <daviddavis> +1
16:10:03 <ggainey> +1
16:10:23 <fao89> +1
16:10:32 <bmbouter> oh wait lol
16:10:40 <bmbouter> I want to disable all private comments on redmine lol
16:10:49 <bmbouter> so this goes against that other goal
16:11:20 <bmbouter> I can explain why etc here if you want or just assume that's a valuable goal
16:12:08 <ggainey> I'm never fond of private bugs - but if deleting a legitimate user's issue is happening, but we still want to run the antispam-script, we are stuck
16:12:44 <bmbouter> when we accept private data we have several new problems: 1) we can never make it unprivate, 2) the "private read access" is a huge unmaintained mess 3) github issues won't support private issues when we switch 4) folks in the past expressed concerns that pulp is accepting private data and it legally shouldn't
16:13:20 <bmbouter> oh also it violates our agreement with plan.io as a fully public open source project
16:13:40 <ggainey> ok - then we don't do that, and take the (hopefully rare) hit
16:14:08 <ggainey> which I'm ok with, to be clear - it's suboptimal, but so is letting pulp.plan.io be used by spammers
16:15:00 <dalley> what about security bugs?
16:15:25 <bmbouter> we never file security bugs in pulp.plan.io
16:15:38 <dalley> off topic anyways. +1 to the proposal
16:15:42 <bmbouter> they are disclosed over mailing list and then kept in bugzilla only
16:15:49 <bmbouter> while embargoed
16:16:31 <daviddavis> timecheck, we have less than 15 min left with 2 topics and a handful of issues.
16:17:25 <bmbouter> what was the conclusion on this?
16:17:44 <ggainey> I honestly don't know :)
16:19:09 <fao89> conclusion: bring it on next open floor?
16:19:46 <dalley> ggainey, we did see at least one legit user get deleted, but yes thats no longer an issue
16:20:13 <dalley> but even getting locked would be a major pain / frustration
16:20:28 <ggainey> fao89: for the sake of moving thru today's list, sure
16:20:30 <fao89> I moved the topics for next open floor, I think we need to start triage
16:20:38 <daviddavis> oh I have a blocking issue
16:20:43 <daviddavis> er topic
16:20:49 <dkliban> what is it?
16:21:01 <daviddavis> Should model validation be skipped altogether?
16:21:01 <ggainey> dalley: spam-filtering is hard - we're going to frustrate somebody somewhere :)
16:21:05 <fao89> topic: Should model validation be skipped altogether?
16:21:06 <daviddavis> e.g. https://git.io/JttPW
16:21:17 <ggainey> daviddavis: what's the problem there?
16:21:46 <daviddavis> we said all validation belongs in the serializer but I'm wondering if model validation is ok in addition to serializer validation
16:21:50 <daviddavis> belt + suspenders :)
16:22:01 <bmbouter> lol
16:22:26 <ggainey> daviddavis: I mean, in that very model we're throwing valueerrror in get_downloaders()
16:22:28 <daviddavis> that is a ggainey-ism
16:22:34 <ggainey> no it isn't
16:22:47 <daviddavis> oh I got belt + suspenders from you
16:22:59 <ggainey> oooh, the expression, I get it
16:23:27 <daviddavis> so you're saying that model validation is ok in addition to other places in the code?
16:23:30 <bmbouter> moving it out of the model would be ok w/ me
16:23:32 <ggainey> this model has all sorts of validation.
16:23:48 <bmbouter> it's tough because we don't always use the serializers...
16:23:55 <daviddavis> I'm working on some validation in the serializer and I'd like to add it to the model too to be sure
16:23:57 <daviddavis> yea
16:24:10 <bmbouter> like for example migration code
16:24:14 <ggainey> I mean, I get it, it's aspirational - but it's also *everywhere* in our models
16:24:37 <bmbouter> serializers also can derive their validation from models in some cases right?
16:24:50 <bmbouter> or am I just thinking wishfully
16:24:53 <ggainey> daviddavis: if we want to say "one should strive to...", I'm absolutely onboard - not so much for "rewrite existing code Just Because", tho
16:25:09 <daviddavis> this is new code
16:25:17 <daviddavis> I just posted an examle of some existing code
16:25:29 <daviddavis> sorry if I caused some confusion there
16:25:34 <bmbouter> for new code if you can use serializer only I'm ok w/ that
16:25:36 <ggainey> ah, kk, thanks
16:25:38 <ggainey> yeah same
16:25:52 <daviddavis> I mean I'd like to do serializer + model to be sure
16:26:01 <daviddavis> since it's possible the model is used via the plugin api
16:26:06 <bmbouter> as long as it's DRY I'm ok w/ that
16:26:10 <daviddavis> ok
16:26:12 <bmbouter> if it's not DRY then I think we're taking on risk
16:26:21 <daviddavis> yea, ok
16:26:23 <ggainey> yeah, concur
16:26:40 <daviddavis> thanks fao89
16:26:52 <ttereshc> a minor problem is that if we change anything in validation, we'd need to do it in 2 places
16:27:17 <daviddavis> right, I think that was bmbouter's point about being DRY
16:27:30 <ttereshc> ah ok
16:27:34 <ggainey> yus
16:28:33 <fao89> can I start triage?
16:28:46 <ggainey> heh - for 2 min? :)
16:29:23 <fao89> 2 issues there seems to be important
16:29:23 <daviddavis> this is the only semi-urgent issue I see: https://pulp.plan.io/issues/8116
16:29:45 <fao89> !issue 8116
16:29:45 <fao89> #topic https://pulp.plan.io/issues/8116
16:29:46 <pulpbot> RM 8116 - paji@redhat.com - NEW - Export is generating the incorrect 'next_version' causing import to fail.
16:29:47 <pulpbot> https://pulp.plan.io/issues/8116
16:29:51 <ggainey> daviddavis: yeah - worked w/paji on this yesterday
16:29:59 <ggainey> accept and add, we need to figure something out here
16:30:18 <ttereshc> what's the target release for katello?
16:30:47 <ttereshc> do we need it late January or February?
16:30:56 <fao89> #idea Proposed for #8116: accept and add to sprint
16:30:56 <fao89> !propose other accept and add to sprint
16:30:56 <pulpbot> fao89: Proposed for #8116: accept and add to sprint
16:31:27 <fao89> #agreed accept and add to sprint
16:31:27 <fao89> !accept
16:31:27 <pulpbot> fao89: Current proposal accepted: accept and add to sprint
16:31:28 <fao89> #topic https://pulp.plan.io/issues/8115
16:31:28 <pulpbot> fao89: 4 issues left to triage: 8115, 8112, 7950, 7922
16:31:29 <pulpbot> RM 8115 - bmbouter - NEW - Satellite issue sync is broken
16:31:30 <pulpbot> https://pulp.plan.io/issues/8115
16:32:14 <ttereshc> I'm not a fan adding items to the sprint without knowing relative priority, since we are already stretched thin. that's why I asked.
16:33:22 <ggainey> ttereshc: this breaks a common katello use-case for import/export. It needs to be fixed for imp/exp to be useful to tyhe katello user. I'd think 4.0? but we will ask in integration-mtg tomorrow
16:33:31 <ggainey> maybe 4.1?
16:34:45 <fao89> I'll remove it from sprint, and just skip the triage
16:34:48 <fao89> #endmeeting
16:34:48 <fao89> !end