Difference between revisions of "2015Q1 Reports: Info Officer"

From Admin Wiki
Jump to navigation Jump to search
Line 26: Line 26:
 
== Anthology ==
 
== Anthology ==
  
The ACL Anthology is a digital archive of research papers in computational linguistics, sponsored by the CL community, and freely available to all.  We employ a Creative Commons Attribution Non-Commercial, Share-Alike license for materials published by ACL, although dual licensing for a fee is presumably possible (although not exercised currently).  
+
The ACL Anthology is a digital archive of research papers in computational linguistics, sponsored by the CL community, and freely available to all.  We employ a Creative Commons Attribution Non-Commercial, Share-Alike license for materials published by ACL.  This makes our content usable by the general public with attribution to the ACL (although it is not mandatory for any user to inform us of their use of our materials).  Dual licensing for a fee is presumably possible (although not exercised currently).  
  
 
The Anthology now contains over 34,000 (up from 31,500 papers in the last report in Q3)  The [http://aclanthology.info new ACL Anthology] is now active and will be switched to the primary Anthology site around ACL this year, as we have had some time to sort out problems with the site.  However, we know a portion of our membership will want to still use the older version, so we are going to maintain both sites at least until the end of 2015.   
 
The Anthology now contains over 34,000 (up from 31,500 papers in the last report in Q3)  The [http://aclanthology.info new ACL Anthology] is now active and will be switched to the primary Anthology site around ACL this year, as we have had some time to sort out problems with the site.  However, we know a portion of our membership will want to still use the older version, so we are going to maintain both sites at least until the end of 2015.   
  
'''Mailing List.''' The Anthology mailing list's (http://groups.google.com/group/acl-anthology) membership pool has grown, now consisting of 479  members (up from 426 from a year ago, and 469 from six months ago).  This is an announcement-only list, where we notify members of newly listed released materials online.
+
An achievement
 +
 
 +
'''Mailing List.''' The Anthology mailing list's (http://groups.google.com/group/acl-anthology) membership pool has grown, now consisting of 533 members (up from 469 from a year ago, and 479 from the last report 6 months ago).  This is an announcement-only list, where we notify members of newly listed released materials online.
  
 
'''Plans.''' A key thrust this year will be to start assigning DOIs, as part of the ACL's initiative to take DOIs under our control.
 
'''Plans.''' A key thrust this year will be to start assigning DOIs, as part of the ACL's initiative to take DOIs under our control.
Line 41: Line 43:
  
 
* A previous discussion (with Ken Church) proposed that we create a single bibtex file for all Anthology materials.  The beta Anthology can generate such information fairly easily with its database backing; we plan to have this file available soon (before the ACL 2014 conference).
 
* A previous discussion (with Ken Church) proposed that we create a single bibtex file for all Anthology materials.  The beta Anthology can generate such information fairly easily with its database backing; we plan to have this file available soon (before the ACL 2014 conference).
 +
*
 
* collaboration with START and aclpub (also may involve the Conference Officer's work)
 
* collaboration with START and aclpub (also may involve the Conference Officer's work)
 
* collaboration with ELRA with respect to use of the LRE Map and ISLRNs, and voluntarily helping them with scanning backlog archives into a digital form.
 
* collaboration with ELRA with respect to use of the LRE Map and ISLRNs, and voluntarily helping them with scanning backlog archives into a digital form.
  
== Web Site ==
+
== Web Site and Hosting Provider ==
  
The [http://www.aclweb.org/ ACL website] continues to serve as the primary online resource for the organization. It contains the main ACL site, an ACL Wiki which serves as a resource to the general computational linguistics community, an ACL Admin wiki used to store and maintain ACL specific resources such as reports, handbooks, and policies as well as an exec wiki reserved for the use of ACL execs. We also maintain mirrors of individual ACL conference websites, membership email lists for ACL announcements and a listing of resolutions of the ACL Exec Committee.   
+
The [http://www.aclweb.org/ ACL website] continues to serve as the primary online resource for the organization. It contains the main ACL site, an ACL Wiki which serves as a resource to the general computational linguistics community, an ACL Admin wiki used to store and maintain ACL specific resources such as reports, handbooks, and policies as well as an Exec wiki reserved for the use of ACL Exec officers. We also maintain mirrors of individual ACL conference websites, membership email lists for ACL announcements and a listing of resolutions of the ACL Exec Committee.   
  
The website was previously running legacy software (Mambo), and has been recently upgraded to Drupal 7 to keep with necessary software upgrades and to prevent security risks to the system.
+
We reported in the last report that we had successfully migrated the main website from Mambo to Drupal 7, mainly to gain responsiveness for mobile clients.  We have restored all content to the site and have had minimal problems with keeping the new site managed (free from spam and updated with respect to service patches).  
  
The website was revamped from the earlier, non-responsive website in this half-year termThe resources listed on the website have largely been kept as-is in the port, pending integration with the Portal.
+
A major achievement was in migrating our hosting provider from 1and1.com to bluehost.com, due to problems with 1and1.com incorrectly auto-diagnosing our membership e-mailings as spam, and not being responsive to our customer support ticketsWe migrated to Bluehost, a well-known hosting provider and have been mildly satisfied with their services. The migration caused a several hiccups involving announcements that were not properly sent out and the Wikis having slow response times and libraries that needed installation, but these difficulties have been solved now.  We are currently on a pro shared hosting plan that has given fairly good response timesThe cost is roughly $25 per month, on par or cheaper to the old 1and1.com plan that we have now deprecated.
 
 
The previous webmaster, Joshua Herring, has voluntarily resigned as of March 2014.  However, due to obligations to integrate the Portal and the website onto Drupal 7 that have not been entirely fulfilled, he has agreed to stay on for the time being until that piece has been sorted outIn the meantime, a new webmaster at NUS has been hired (as of end May) and will be assisting in maintaining the Drupal websites and assisting with the transition to the new webhost, bluehost.
 
  
 
== Portal ==
 
== Portal ==
  
The [http://www.aclweb.org/portal ACL Portal] was created to provide a web-based platform to house facilities for the benefit of members.  The Portal currently serves little function other than maintaining a list of current members and a payment gateway for membership.  We are currently working towards integrating the Portal into the website's functionality, now that both systems are run on a common platform (Drupal 7).  Integration will involve upgrading existing custom modules developed by Ben Phelan (the previous developer) for the Portal to Drupal 7; this is still ongoing work.
+
The [http://www.aclweb.org/portal ACL Portal] was created to provide a web-based platform to house facilities for the benefit of members.  The Portal currently serves little function other than maintaining a list of current members and a payment gateway for membership.  We are currently working towards integrating the Portal into the website's functionality, now that both systems are run on a common platform (Drupal 7).  Integration will involve upgrading existing custom modules developed by Ben Phelan (the previous developer) for the Portal to Drupal 7; this is still ongoing work.  Pranav, our current webmaster, is working towards these goals.
  
We are now working in parallel on consolidating the ACL Website and the ACL Portal, and on the establishment of a central login for ACL services (something akin to a "ACL Account" a la Google or Facebook).  We are planning to use OpenID and OAuth, which would allow members to link their ACL account with other (i.e., Google, LinkedIn, Twitter, Microsoft/Hotmail) services; such that one could use login credentials from those services for ACL use.
+
A major achievement for the Portal was to consolidate our member database to eliminate spurious spammer accounts that bloated our on membership counts to 20K+ (our actual membership inclusive of past expired members is about 8.5K).  These spammer accounts recorded fake email addresses that bounced and caused the original problem with our old hosting provider.
 +
We still need to manage spam registrations on a weekly basis as the Portal allows anyone to register an account (and get a webpage listing their profile, a target for spammers to get an "endorsed" hyperlink from the Portal).
 
   
 
   
== Current Problems ==
+
We are now working in parallel on consolidating the ACL Website and the ACL Portal, which is our primary goal for our current webmasterWe also hope to resume our work on the establishment of a central login for ACL services (something akin to a "ACL Account" a la Google or Facebook).  We are planning to use OpenID and OAuth, which would allow members to link their ACL account with other (i.e., Google, LinkedIn, Twitter, Microsoft/Hotmail) services; such that one could use login credentials from those services for ACL use.
However, due to problems with the Portal software mailing (see below) and non-existent bounce processing, our internet service provider (1and1.com) has been regularly disrupting our own paid services, despite our effortsYou may have experienced temporary (1-2 day) outages of website, portal, Anthology and wikis, due to this problem.  The problem that they have cited is that the portal has two PHP scripts that send out email (for announcement services) to our membership.  However, our announcement service currently does not do bounce processing correctly, nor does our service provider give us proper notice of the bounce processing such that we can act on it.  This makes correcting this problem a serious difficulty, as the portal software is also legacy code, in the sense that the original implementer (Ben Phelan) is no longer part of ACL's staff.
 
 
 
For these reasons, limitations of services (again, concerned with the mail service), we are in the process of moving from 1and1 and migrating to bluehost.com. We hope to complete the migration by the end of 2014, but perhaps substantially sooner (end of summer 2014).
 

Revision as of 11:50, 16 February 2015

[Link to 2014 Q3 Report] [Link to 2014 Q1 Report] [Link to 2013 Q3 Report] [Link to 2013 Q1 Report]

The Information Officer (IO) portfolio includes integration of the different ACL-wide activities that are related to information dissemination; including the Anthology, website, wiki, portal and archive. Plans include provide integration of logins (through OpenID and OAuth; IN PROGRESS); update our information services to be updated and professionally-designed (PLANNED).

Long-term goals for the costs of the information services to be sponsored, movement of the aclweb.org infrastructure to a more modern webhost (DONE), accessibility and long-term maintenance of the aclweb.org and other sites, and to be cost-neutral through sponsorship by corporate interests.

IO Overview

Budget. The IO has budget to oversee part-time manpower allocated to help improve our association's websites, which includes maintenance, upgrading, migrating and backup. So far, we have incurred costs of 2,670. This is well in line with our projections.

DOIs/CrossRef.


Thompson Reuters Web of Knowledge / Elsevier Scopus Indexing: We know a good portion of our membership relies on impact assessment for promotion, ranking and tenure. We are trying to tackle this problem by investigating whether our materials can be indexed by major citation indices, namely Thompson Reuter's Web of Knowledge (Wok) and Elsevier's Scopus. We have initiated discussions and provided materials since Oct 2014 to WoK, but they have not been forthcoming with any status or updates with respect to our queries. We continue to attempt to remind them on a monthly basis, but unfortunately, we have not be able to get any response.

Elsevier requires that journals have an ISSN (largely reserved for serials / journals)

Note that the onus of journal indexing (both CL and TACL) is the responsibility of the respective journals. Currently, CL is indexed by Elsevier, but TACL is not indexed by either service.

We would appreciate help from our membership who have been successful at approaching either indexing service in helping to get the agencies to index our conference proceedings.


Anthology

The ACL Anthology is a digital archive of research papers in computational linguistics, sponsored by the CL community, and freely available to all. We employ a Creative Commons Attribution Non-Commercial, Share-Alike license for materials published by ACL. This makes our content usable by the general public with attribution to the ACL (although it is not mandatory for any user to inform us of their use of our materials). Dual licensing for a fee is presumably possible (although not exercised currently).

The Anthology now contains over 34,000 (up from 31,500 papers in the last report in Q3) The new ACL Anthology is now active and will be switched to the primary Anthology site around ACL this year, as we have had some time to sort out problems with the site. However, we know a portion of our membership will want to still use the older version, so we are going to maintain both sites at least until the end of 2015.

An achievement

Mailing List. The Anthology mailing list's (http://groups.google.com/group/acl-anthology) membership pool has grown, now consisting of 533 members (up from 469 from a year ago, and 479 from the last report 6 months ago). This is an announcement-only list, where we notify members of newly listed released materials online.

Plans. A key thrust this year will be to start assigning DOIs, as part of the ACL's initiative to take DOIs under our control.

A second thrust is to other forms of scientific knowledge that we are interested in archiving. These include software, datasets and videos. The procedures for integrating these with START and the submission process need to be worked out, and the space requirements for these services assessed. For the time being, we will concentrate on videos.

A third thrust for this year will be to incorporate the results of the R50 workshop into the Anthology, and allow third-party applications to automatically annotate articles with new metadata and papers in the Anthology, as they come available. Such an API will raise the visibility of the Anthology as a object of study, complementing our earlier work to make the Anthology's text a corpus.

We have long term plans to work on these other following problems, which are less urgent:

  • A previous discussion (with Ken Church) proposed that we create a single bibtex file for all Anthology materials. The beta Anthology can generate such information fairly easily with its database backing; we plan to have this file available soon (before the ACL 2014 conference).
  • collaboration with START and aclpub (also may involve the Conference Officer's work)
  • collaboration with ELRA with respect to use of the LRE Map and ISLRNs, and voluntarily helping them with scanning backlog archives into a digital form.

Web Site and Hosting Provider

The ACL website continues to serve as the primary online resource for the organization. It contains the main ACL site, an ACL Wiki which serves as a resource to the general computational linguistics community, an ACL Admin wiki used to store and maintain ACL specific resources such as reports, handbooks, and policies as well as an Exec wiki reserved for the use of ACL Exec officers. We also maintain mirrors of individual ACL conference websites, membership email lists for ACL announcements and a listing of resolutions of the ACL Exec Committee.

We reported in the last report that we had successfully migrated the main website from Mambo to Drupal 7, mainly to gain responsiveness for mobile clients. We have restored all content to the site and have had minimal problems with keeping the new site managed (free from spam and updated with respect to service patches).

A major achievement was in migrating our hosting provider from 1and1.com to bluehost.com, due to problems with 1and1.com incorrectly auto-diagnosing our membership e-mailings as spam, and not being responsive to our customer support tickets. We migrated to Bluehost, a well-known hosting provider and have been mildly satisfied with their services. The migration caused a several hiccups involving announcements that were not properly sent out and the Wikis having slow response times and libraries that needed installation, but these difficulties have been solved now. We are currently on a pro shared hosting plan that has given fairly good response times. The cost is roughly $25 per month, on par or cheaper to the old 1and1.com plan that we have now deprecated.

Portal

The ACL Portal was created to provide a web-based platform to house facilities for the benefit of members. The Portal currently serves little function other than maintaining a list of current members and a payment gateway for membership. We are currently working towards integrating the Portal into the website's functionality, now that both systems are run on a common platform (Drupal 7). Integration will involve upgrading existing custom modules developed by Ben Phelan (the previous developer) for the Portal to Drupal 7; this is still ongoing work. Pranav, our current webmaster, is working towards these goals.

A major achievement for the Portal was to consolidate our member database to eliminate spurious spammer accounts that bloated our on membership counts to 20K+ (our actual membership inclusive of past expired members is about 8.5K). These spammer accounts recorded fake email addresses that bounced and caused the original problem with our old hosting provider. We still need to manage spam registrations on a weekly basis as the Portal allows anyone to register an account (and get a webpage listing their profile, a target for spammers to get an "endorsed" hyperlink from the Portal).

We are now working in parallel on consolidating the ACL Website and the ACL Portal, which is our primary goal for our current webmaster. We also hope to resume our work on the establishment of a central login for ACL services (something akin to a "ACL Account" a la Google or Facebook). We are planning to use OpenID and OAuth, which would allow members to link their ACL account with other (i.e., Google, LinkedIn, Twitter, Microsoft/Hotmail) services; such that one could use login credentials from those services for ACL use.