Age | Commit message (Collapse) | Author | |
---|---|---|---|
2023-04-11 | Add JIS encodings to message loading | Célestin Matte | |
2022-04-01 | Fix bytes/str handling of secondary text parts in messages | Magnus Hagander | |
This was broken in the python 2->3 migration, but is apparently an uncommon enough case that it wasn't properly spotted until now. Reported and pointers in the right direction from Andres Freund | |||
2020-08-11 | Updaate regexp escaping syntax to please pep8 | Magnus Hagander | |
2020-08-11 | Update exception catching syntax to please pep8 | Magnus Hagander | |
2020-04-01 | Mark all ForeignKeys as on_delete=CASCADE | Magnus Hagander | |
2019-01-08 | Fix date parsing to be even more forgiving | Magnus Hagander | |
In particular, if parsing the date either fails or if it results in a date that's in the future, fall back to parsing the dates out of the Received: headers instead, because at some point there we will find a parsable date for sure (if not before then when it hit one of our own servers) | |||
2019-01-04 | Fix comparison operators | Magnus Hagander | |
2019-01-04 | Fix bad multi-command lines | Magnus Hagander | |
2019-01-04 | Whitespace fixes | Magnus Hagander | |
2019-01-04 | Fix indentation | Magnus Hagander | |
Per pep8 warnings, adjust indentation for consistency | |||
2019-01-04 | Tabs to 4 spaces | Magnus Hagander | |
pep8 standard for indentation | |||
2019-01-04 | Trap internal AssertionError from python libraries | Magnus Hagander | |
For some really broken messages, we end up in a cannot-happen codepath. Trap this one and just consider that MIME part empty, and try again later. In passing, also change it so we continue loading after failures of parsing. We continued in the mode where we just generated diffs, but not when making updates. Now continue in both cases, but of course don't do the actual update if the parsing failed. | |||
2019-01-03 | Update loader scripts to use python3 syntax | Magnus Hagander | |
Some minor cleanups as well, but mostly just the output of the 2to3 tool and some manual changes. | |||
2019-01-03 | Use "in" syntax instead of has_key() | Magnus Hagander | |
has_key() has been deprecated for a while and will be gone in Python3. The in syntax is available in both the old and the new versions. | |||
2018-11-30 | Switch to using tidylib rather than tidy | Magnus Hagander | |
tidylib (http://countergram.github.io/pytidylib/) is maintained, the old tidy one (https://cihar.com/software/utidylib/) is not. And in particular, python3 support is in the new one. Generates some minor changs in the existing archives, but it seems to be just whitespace and some actual incorrectness in the old output. | |||
2018-11-20 | Remove spaces in messageids | Magnus Hagander | |
They shouldn't be there in the first place. Sigh. But if they're there just pretend they don't exist, so we get a working messageid. | |||
2018-11-20 | One more round of header replacement fixes | Magnus Hagander | |
2018-11-20 | Add another strange timezone offset format | Magnus Hagander | |
2017-04-10 | Fix silly overlook in imports | Magnus Hagander | |
2017-04-10 | Try to decode attachment filenames when escaped | Magnus Hagander | |
Some MUAs (notably gmail at least) can generate header-escaped filenames for attachments, if non-ascii characters are included. If this happens, decode them and try to use that rather than generating filenames with escaping in them. | |||
2016-12-17 | Exclude pkcs7 signatures in attachments | Magnus Hagander | |
Treat them the same way we do with detatched pgp-signatures, which is simply don't process them as attachments. | |||
2016-03-02 | Forcibly remove \0 at the end of a decoded message | Magnus Hagander | |
This happens fairly commonly with some broken MUAs it seems. | |||
2016-02-14 | Actually store the raw data in rawtxt | Magnus Hagander | |
Previously, we would parse the message and then reconstruct it. This refolds the headers, as well as breaks From rows in the body. Oops. Instead, materalize the data into rawtxt and then parse that, instead of the other order. | |||
2013-08-17 | Properly recurse into multipart/sign:ed email parts | Cédric Villemain | |
Previously we'd only recurse into multipart/mixed, but this would miss PGP-signed attachments sent by some MUAs. | |||
2013-01-09 | Turn any non-first text/plain parts into attachments | Magnus Hagander | |
Instead of ignoring them because they're text/plain, only ignore the first one and specifically the one matching our footers. This should deal with the case when there is a textfile attached that has no name. | |||
2013-01-05 | Properly parse attachments of type=text/plain, content-disposition=attachment | Magnus Hagander | |
Previously we'd only parse them if they were given an explicit name, which is not required - instead, they can have a filename... | |||
2012-08-12 | Another typo, i think | Magnus Hagander | |
2012-08-12 | Missing object reference | Magnus Hagander | |
2012-08-09 | More forgiving parsing of emails with broken header encoding | Magnus Hagander | |
2012-07-10 | Turn rawtxt into a bytea, since w edon't know the encoding | Magnus Hagander | |
2012-07-09 | Fail date parsing on empty dates | Magnus Hagander | |
2012-07-09 | Store the raw text of messages. | Magnus Hagander | |
Also add deferred loading of all large (possibly TOASTable) columns not needed in the django views | |||
2012-07-07 | Oops.. Misisng a reference there, are we.. :) | Magnus Hagander | |
2012-07-07 | Attempt to get rid of the postgresql specific mail footer on all lists | Magnus Hagander | |
2012-07-06 | Parsing didn't work, and it's not enough messages to really care about.. | Magnus Hagander | |
2012-07-06 | Specify encoding of file | Magnus Hagander | |
2012-07-06 | Badly encoded name of encoding :O | Magnus Hagander | |
2012-07-06 | Support filtering a single email out of mbox/directory for reloading | Magnus Hagander | |
2012-07-06 | Add parameter to override the date of a message | Magnus Hagander | |
When they're so badly formatted we can't figure out a way to clean it.. | |||
2012-07-06 | typo | Magnus Hagander | |
2012-07-06 | more date stuff | Magnus Hagander | |
2012-07-06 | Strange spelling of gmt.. | Magnus Hagander | |
2012-07-06 | More timezones | Magnus Hagander | |
2012-07-06 | Silly - needs to be lowercase :S | Magnus Hagander | |
2012-07-06 | Remove dead code | Magnus Hagander | |
2012-07-06 | Handle empty bodies instead of giving an error | Magnus Hagander | |
typical case: someone sends an attachment with just a subject. this is not an error, but also has no body.. | |||
2012-07-06 | Work around more broken dates | Magnus Hagander | |
2012-07-06 | One more round of encodings | Magnus Hagander | |
2012-07-05 | Don't crash on non-multipart messages that appeaer to be multipart | Magnus Hagander | |
2012-07-05 | Clean up UTF surrogate points in unicode data | Magnus Hagander | |
They shouldn't be therein the first place, but when theyd o show up, there's a bug in python 2 (fixed in py 3) that lets them through, and postgresql barfs on them... |